Custom Models Set The Context Window to 1M

deanrie · May 8, 2026, 3:41pm

Yeah, that downstream effect is a fair point. With the 1M default in place, the auto-compaction threshold is calculated based on 1M. So if a model only supports 200 to 272K, the upstream API hits its real limit before our compaction kicks in, and the chat stalls. The Auto-Mode round trip is basically the only user-side workaround right now.

It’s the same root cause as the original report. Without a way for custom OpenAI-compatible models to declare their actual context size, both the indicator and the compaction trigger are wrong. The best place to add this info is the same feature request thread: Unlock Full Context Window with Own API Keys. Calling out the auto-compaction failure helps with prioritization, since it turns this from a nice-to-have into a workflow blocker for sub-agents.

Topic		Replies	Views
Cursor is blocking all custom models Bug Reports byok	10	1216	May 17, 2026
Subagents ignore user's own API key. Always bill against Cursor plan Bug Reports byok , anthropic , subagents	12	625	May 14, 2026
Unknown model ID: claude-opus-4-7-thinking-high Bug Reports anthropic	5	346	May 15, 2026
WARNING: Infinite "Cache Read" Loop in v2.4.x (Agent/Thinking Mode) – 96M Tokens Spike Bug Reports performance , anthropic	5	179	March 19, 2026
After integrating the Open API key, the existing AI becomes unusable Bug Reports byok	5	198	March 14, 2026

Custom Models Set The Context Window to 1M

Related topics