How long is chat "cache" stored for? Question about resuming chats left for a few hours

CK-iRonin.IT · June 18, 2025, 1:04pm

Whenever I resume a chat that I left overnight, my account is hit with 10+ request usage event which is like 10x of an average usage event. The size is similar to the first request of a chat with a lot of initial data. It seems Cursor is re-seeding the chat or something then to continue it.

What’s the “chat cache” life time then?

condor · June 18, 2025, 1:11pm

Hi @CK-iRonin.IT

Could you please explain what you mean with chat “cache” and where you encounter this? (In regular Cursor or as Background Agent?)

There is no ‘cache’, just your locally stored chat in regular Cursor.

The new pricing does not have a usage limit but rather rate limits.

AbleArcher · June 19, 2025, 6:52am

5 minutes with Anthropic, or an hour but there’s no way Cursor is paying the for that.

By default, the cache has a 5-minute lifetime. The cache is refreshed for no additional cost each time the cached content is used.

5-minute cache write tokens are 1.25 times the base input tokens price
1-hour cache write tokens are 2 times the base input tokens price
Cache read tokens are 0.1 times the base input tokens price
Regular input and output tokens are priced at standard rates
Prompt caching - Anthropic

If I’m reading it correctly the time is refreshed when it’s used so it might be possible to use tricks to keep it alive, but I doubt Anthropic would like that.

condor · June 19, 2025, 7:01am

AH yes good point with prompt caching. Its used when you continue working with same chat to avoid all communication to consume tokens again. Though I dont know the caching time.

thanks @AbleArcher

CK-iRonin.IT · June 19, 2025, 4:04pm

I don’t know how you call it internally - from my perspective: whenever I come back to a chat after “a while” (I haven’t determined how long I have to wait yet), I’m hit with large usage even though my request after coming back was, like, “continue”.

condor · June 19, 2025, 4:06pm

While I do not have insight into the precise settings, it’s likely that the chat cache (which depends on AI provider) has timed out and therefore the submission of the thread is counted by provider as writing to cache since its empty.

justme409 · October 22, 2025, 5:27am

what about when switching between models? from sonnet ‘plan’ mode, to codex ‘agent’ do I loose the cached tokens?

Topic		Replies	Views
How is the token cache stored? Discussions	5	139	December 8, 2025
Usage limits detail? Help	7	2766	January 1, 2026
How do cache reads and writes works? Discussions	1	3077	August 17, 2025
Cache Token Usage Help	1	133	July 18, 2025
How to reduce cache reads Help	7	1180	December 10, 2025

How long is chat "cache" stored for? Question about resuming chats left for a few hours

Related topics