Pro+ vs. usage base pricing

Recalculated API Cost Per Million Tokens

The new estimates for cost per million tokens (CPM), factoring in the cost of Cache Read, are as follows:

Model Est. Input CPM ($/M tokens) Est. Output CPM ($/M tokens)
Grok-4 ~$3.21 ~$16.05
Gemini 2.5 Pro ~$2.26 ~$6.78
Claude 4 (Sonnet) ~$4.89 ~$24.45
Claude 4 Thinking ~$3.92 ~$19.60
o3-pro ~$25.35 ~$76.05
o4-mini ~$1.49 ~$5.96
auto ~$2.54 ~$5.08
  • Note on Calculation: These estimates are derived by treating Cache Read as a billable event costing 10% of the standard input price. A common industry price ratio between output and input tokens (e.g., 5:1 for Claude, 3:1 for Gemini) was assumed to solve for the individual rates. The models with a high volume of Cache Read tokens (like Gemini and Claude) see a significant shift in their effective cost structure compared to the previous calculation.