Extreme token usage

What specific issue do you have with it? What mode do you use? What prompts do you use?

I think we will not wait alot to find a good alternative, companies couldn’t compete with cursor before but not after their new pricing policy.

don’t use Claude-4 :slight_smile: just use grok-4
good output input and no stupid cache over 2m per request

i do not use memories

Yes, same here. I ran out of my “allowed subscription usage” in a matter of 2-3 days as well, without a clear explanation of why, and am still somehow charged additional costs on top of separate API calls due to running out of aforementioned subscription usage. I think a lot of us are lost on just how the pricing works.

I am trying it right now, it just created me a long plan for a simple problem that needs file reading, research and one code line fix

lol

yep, I set up spending cap of 10$, I exhausted them in 2 hours

Same issue of huge tokens usages starting 14th of July. Passing from less than $0.10 per request to $2

While memories are great to help the model have a longer-term understanding of your codebase, a lot of any context (memories, files, chat history, etc) can contribute to excessively high token usage.

In non-MAX mode, we limit the maximum context window to ensure this is minimised and requests are more predictable, but MAX requests (especially those with large context window models) have no cap, so can very high token usage surprisingly quickly.

To keep your token usage low, I’d recommend not using MAX mode unless it’s necessary (which it often isn’t!), routinely starting new conversations instead of continuing long ones on different topics, and trying to keep rules concise and to the point, as they are always included in every request you make!

The usage is INSANE. In the beginning I could code 16 hours a day for a month and only use around $50. Now we are half way through the month and I have spend $220..

Yeah, the token burn on Ultra is brutal, especially with Auto mode picking the priciest models. Quick tip: if you’re using Claude Sonnet 4 and not bringing your own API key, Cursor adds a 20% markup. You can avoid that by plugging in your own Anthropic key in settings. Same models, way cheaper usage. Helped me stretch my quota a lot further.


If you're using Claude Sonnet 4 via Cursor AI, you're probably paying 20% more than you need to

Yes, my usage increased also suspecially strong during the last 3 days…

I’m out. This is ridiculous. Unusable.

Oh, I love how that’s honest and up front on their page. Yep - I already cancelled. Cursor has been an absolutely awful and very expensive ride for me. The pricing is ridiculous and so is the usage.

They should offer full refunds for this month - it’s absolutely outrageous.

Hey all, just to clarify @0xHACKS’s message, Auto is actually free and unlimited on every plan, so does not have any effect on your usage!

Also, I’d recommend people to check Cursor - The AI Code Editor for how much their usage would’ve actually cost if you were using an API key direct with the model provider!

Ah, I see, appreciate the “clarification,” but that phrasing was deliberate. It caught attention, didn’t it?

Now I also see why you’re avoiding being upfront about the real markup Cursor adds when people don’t use their own API keys. That’s the actual issue, not just Auto mode.

Telling users “Auto is free” while skimming an extra 20% on Claude usage through your default config isn’t exactly transparent. People deserve to know what they’re really paying for, not just what sounds free on the surface.

Let’s not dress it up. Cursor’s a solid tool, but let’s be real about the business model.


If you're using Claude Sonnet 4 via Cursor AI, you're probably paying 20% more than you need to

Auto succeeds at simple to moderate tasks about 1 in every 20-30 prompts. Often times it goes on vision quests, hallucinates often. I don’t find it useful at all. Context seems to be lost constantly.

Yeah using ultra it went up a lot? More or less x10 tokens :confused:

Me too.Time to quit.


this is absurd levels of usage. it cannot be right. I never go more than 70-100k tokens a request. this was a big prompt but I can’t find a similar example in the past week