Why does Cursor consume an absurd amount of cache read tokens?

Colin · February 23, 2026, 5:40pm

Hey all!

This is effectively just going to rephrase @Andres_Cardona’s answer, but I hope it’s helpful. It might sound like it’s talking to beginners, but I want to make sure this is approachable for anybody reading!

When you hover over Tokens on your usage page, the number you see is the aggregate across every LLM call that contributed to that request, not a single call.

A single message in Cursor can (and typically does) trigger multiple LLM calls under the hood. The agent may read files, invoke tools, apply edits, or reason through a plan, and each of those steps constitutes a separate call. All of them are rolled up into a single row on the dashboard, and aggregation stops only when the request is complete (when you can type a new message).

To illustrate: suppose your first message sends 20k tokens of context, and overall it requires 10 LLM requests to finish. You’d see 20k input tokens and roughly 180k cached tokens, because each subsequent request reuses the same prefix the provider already has cached. Those cached tokens also carry forward to the next message within the same conversation.

This is also why you might see a total token count that exceeds the model’s context window. It isn’t one enormous call, but the sum of all calls made during that turn.

If you’re curious about what’s consuming your tokens, you can ask the agent directly: What’s in your context window right now? Be exhaustive.

We’re always looking to make the context window more efficient!

Topic		Replies	Views
Why is a simple edit eating 100,000+ tokens? Let’s talk about this Discussions	107	11327	February 11, 2026
Cursor high token usage Help context , byok , large-codebases	12	1604	June 26, 2026
Cursor Costs Are Climbing Without a Clear Reason Feedback	25	1115	May 29, 2026
Pro Plan Burned in 10 Minutes by Background Agent Calls — Completely Unacceptable Feedback	46	3904	July 23, 2025
Where can I see REAL TIME usage against my monthly subscription plan? Feedback	35	27317	February 4, 2026

Why does Cursor consume an absurd amount of cache read tokens?

Related topics