Prompt caching with Claude

Orpheus · August 14, 2024, 5:25pm

Prompt Caching is a powerful feature that optimizes your API usage by allowing resuming from specific prefixes in your prompts. This approach significantly reduces processing time and costs for repetitive tasks or prompts with consistent elements.

Prompt caching with Claude allows developers to store frequently used context between API calls, reducing costs by up to 90% and latency by up to 85%.

amanrs · August 14, 2024, 6:48pm

We’re mainly excited about supporting longer context workloads at much lower latency. Expect features that use this in the coming weeks.

jc_codes · August 14, 2024, 7:22pm

Thank you!

Taidan · August 14, 2024, 7:39pm

and hopefully more or equivalent uses

deanrie · August 15, 2024, 6:07am

Cool! I can’t wait.

Jeremy · August 15, 2024, 3:50pm

caching cursor, nice ring to it

ilessio_aiflow · August 16, 2024, 9:00am

When ?

leoing · August 16, 2024, 7:13pm

Excited!

jshay21 · August 31, 2024, 3:14am

@amanrs Hi super excited for this feature. Is there an update on the timeline?

arpagon · September 3, 2024, 11:47pm

Any news about this?

sugoidesune · September 16, 2024, 5:42pm

At the moment the cached prompts have a lifetime of only 5 minutes.
So unless Cursor gets a special cache lifetime it’s not a huge improvement.
And caching costs 25% more so if you cache unimportant context you might end up paying more than if you didn’t as it expires in 5min.

TroyRob · September 17, 2024, 2:46am

Bumped. @amanrs Any updates on prompt caching with Claude? Has this been integrated?

Orpheus · September 27, 2024, 5:57pm

Gemini also has context prompt / context caching
Context caching | Gemini API | Google AI for Developers (and its pricing Gemini API 定價 | Google AI for Developers)

Orpheus · September 27, 2024, 6:05pm

JohnCena · February 15, 2025, 3:27am

Any updates on this topic?

danperks · February 25, 2025, 1:57pm

While we won’t discuss too many details here, we do utilize some efficiency features behind the scenes like Prompt Caching.

Topic		Replies	Views
How is the Claude-4-sonnet consumption record calculated Discussions	6	355	July 9, 2025
How does Claude 3 Opus in Cursor compare with Opus on the Claude.ai webui with context and output size? Discussions	5	3460	June 4, 2024
Cursor prompt unable to connect to claude api Discussions	3	166	December 31, 2024
How long is chat "cache" stored for? Question about resuming chats left for a few hours Discussions	5	138	June 19, 2025
Context and profiles How To	0	35	April 27, 2025

Prompt caching with Claude

Related topics