Max mode vs non-max mode (context max, not thinking max)

deanrie · March 10, 2026, 8:40am

Hey, this is expected behavior. Max Mode always applies a 1.2x multiplier to the API cost, no matter how much context is actually used in the conversation.

From the docs: Max Mode | Cursor Docs

Max Mode consumes usage at 1.2x the normal API rate for the selected model.

The 1.2x is the fee for access to Max Mode features like extended context, subagents with models other than Composer-1, and image generation on request-based plans, not for actually going over the base context window.

If you are not using subagents or image generation, it is usually easier to keep Max Mode off. Colin explains it in detail here: Claude opus 4.5: Max vs Default mode - #3 by Colin

Topic		Replies	Views
Opus 4.7 context window (increase non-max above 200k) Feedback max-mode , context , openai , anthropic	8	595	May 7, 2026
$20 for a single request Help max-mode , anthropic	13	638	March 14, 2026
Anthropic just announced 1M context GA at standard pricing for Opus 4.6 & Sonnet 4.6, when will Cursor reflect this? Discussions max-mode , anthropic	5	5168	March 19, 2026
Opus 4.6 vs opus 4.6 Max Pricing Help max-mode , anthropic	1	1110	March 18, 2026
Cursor automatically invokes Claude 4.5 long-context mode (context-1m-2025-08-07), increasing costs and hitting Bedrock rate limits Bug Reports java	19	1086	January 22, 2026

Max mode vs non-max mode (context max, not thinking max)

Related topics