$20 for a single request

y4my4my4m · March 11, 2026, 7:39pm

I have multiple single shot requests that are costing me $15 ~ $18 a request…
I understand I am using opus 4.6 using MAX but it used to not be more than $0.80 ~ $1.5 per request..now it’s 10x ?

Is this normal? A bug?

MidnightOak · March 11, 2026, 8:15pm

You were doing 10M token requests with opus for $1.5?

Can you hover over the larger requests and show the token breakdown?
My Auto requests are like 99% cache reads. I wonder what yours are.

Scenario A — 90% cached

Suppose the 10M tokens are:

9M cached input
0.5M new input
0.5M output

Cost:

Cached input → 9 × $0.50 = $4.50
New input → 0.5 × $5 = $2.50
Output → 0.5 × $25 = $12.50

Total ≈ $19.50

Scenario B — 98% cached

Example:

9.8M cached
0.1M new input
0.1M output

Cost:

Cached → 9.8 × $0.50 = $4.90
Input → 0.1 × $5 = $0.50
Output → 0.1 × $25 = $2.50

Total ≈ $7.90

y4my4my4m · March 11, 2026, 8:20pm

Thanks this is useful. So are those prices (roughly) accurate how they calculate?

I’m not sure how many token, but the requests were essentially the same scale (obviously that doesn’t mean it didnt fetch/used much more input tokens).

All I meant was, the more i am paying this week, the more expensive the requests have gotten.

MidnightOak · March 11, 2026, 8:22pm

Hover over the tokens and it should show you the breakdown.

Yes those examples should roughly equate to how Cursor charges per request. This is from their website.

It would be worth comparing them to last weeks token usage and breakdown. Maybe the requests in general are using different amounts or ratios of tokens than normal.

y4my4my4m · March 11, 2026, 9:37pm

This was the $18.40… i’m being charged for cache read or what?.. no way having 20,079 output should cost that much right?

MidnightOak · March 11, 2026, 10:05pm

Yea this doesn’t add up.

Should be like $8 at most.

Breakdown by % of cost

Cache read: ~89% ($6.94)
Output tokens: ~6% ($0.50)
Cache write: ~4% ($0.35)
Input tokens: ~0%

There has to be something else going on. Do these requests have to be “MAX”, maybe “MAX” is causing other charges that are not normal.

Max Mode uses token-based pricing at the model’s API rate, so it consumes usage faster than the default context window. On individual plans, a 20% upcharge is added to the model’s API rate.

Colin · March 11, 2026, 10:14pm

Based on @y4my4my4m’s screenshot, Max Mode is enabled!

When using Max Mode, requests are twice as expensive when input exceeds 200k tokens, so I think the costs are largely expected here (if you double the manual calculations, you arrive at the number shown in the dashboard)!

MidnightOak · March 11, 2026, 10:39pm

Where is the pricing of Max Mode explained?
I don’t see much about it on the pricing page. Is it really as simple as 2x normal cost?

y4my4my4m · March 12, 2026, 6:09am

Just curious, what is to stop a generation from costing someothing like $250 ~ 350 in one request? Say if it can store cache in the millions. What are the limits?

If I leave it running for a long plan, many features, etc.
Can it just keep going up indefinitely?

ZulfikarHD · March 12, 2026, 6:20am

You have to hover over the (!) icon on each model, it shows descriptions about legacy models and 200k+ token pricing. I think Claude’s own pricing page also uses the same pricing method. Though I feel like we need some kind of limiter or cost reminder per process, because the fact that it can cost millions of tokens in one go raises a question, does that mean it’s caching the entire codebase? Is the indexing basically caching all of it? And does caching here work the same as Claude’s , where the first request costs more, and how long does the cache stay warm before it expires?
PS. I dont know if im miss it in docs or other forum discussion for this questions answer,

MidnightOak · March 13, 2026, 9:29pm

y4my4my4m · March 14, 2026, 1:44am

Does this mean it wasn’t supposed to charge 2x above 200k?

CC: @Colin

Serp · March 14, 2026, 7:16am

It came yesterday and the question is for 3 days old requests.

MidnightOak · March 14, 2026, 4:05pm

I don’t know when this promotion went into affect, but your previous request may have been before the promotion. Just saying for now, maybe you will not be charged 2x, so go ham.

Topic		Replies	Views
Opus 4.6 vs opus 4.6 Max Pricing Help max-mode , anthropic	1	624	March 18, 2026
Max mode vs non-max mode (context max, not thinking max) Feedback max-mode	5	416	April 2, 2026
Anthropic just announced 1M context GA at standard pricing for Opus 4.6 & Sonnet 4.6, when will Cursor reflect this? Discussions max-mode , anthropic	5	4271	March 19, 2026
Claude opus 4.5: Max vs Default mode Help max-mode , anthropic	1	278	January 30, 2026
Bad Usage Reporting on 3rd Party Extension Bug Reports	35	1108	July 27, 2025

$20 for a single request

Scenario A — 90% cached

Scenario B — 98% cached

Related topics