Token Usage Transparency, Context Controls & Approval Gates for Composer/Auto Mode

Mushin · February 11, 2026, 6:01pm

Feature request for product/service

Cursor IDE

Describe the request

Problem

There are cases where a simple request unexpectedly results in massive token consumption (millions of tokens).

While prompting style plays a role, users currently lack:

Clear visibility into what context is being sent
Fine-grained control over how much context is included
Approval mechanisms before large token calls execute
Tooling to intelligently optimize context for longevity and cost efficiency

For teams working in large codebases, this creates:

Cost unpredictability
Fear of using Auto / Composer freely
Reduced trust in background agent behavior

Requested Features

1. Pre-Execution Token Estimate + Approval Gate

Before Composer / Auto executes a large call:

Show:
- Estimated input tokens
- Estimated output tokens
- Estimated cost
If over user-defined threshold → require confirmation

Example:

This action will use ~480,000 tokens (~$X).
Continue? [Approve] [Cancel] [Optimize Context]

User-configurable thresholds:

Hard cap (never exceed)
Soft cap (ask before exceeding)

2. Context Visualization Panel

A live “Context Inspector” showing:

Files included
Snippets included
Conversation history length
System rules (.cursorrules)
Memory injections
External tool outputs

With the ability to:

Toggle items on/off
Collapse large files
Replace full file with summarized version
Pin essential files

This makes context intentional instead of opaque.

3. Smart Context Optimizer

Before sending context, allow:

“Optimize Context” button that:
- Deduplicates repeated code
- Removes irrelevant history
- Summarizes large files
- Converts full files to embeddings when possible
- Switches to structural diffs instead of raw file dumps

Goal: same intelligence, fewer tokens.

4. Persistent Context Strategy Modes

Allow users to choose per-project:

Strict Mode – minimal context, manual inclusion only
Balanced Mode – auto include but capped
Full Auto Mode – current behavior

This gives professionals control without disabling advanced features.

5. Composer Subagent Transparency

If Composer 1.5 spawns subagents or expands context automatically:

Show when it happens
Show why
Show how many tokens that branch used

This is not about restricting capability — it’s about transparency.

Why This Matters

Large codebases require longevity of token budgets.

Better context controls would:

Increase trust
Reduce billing shock
Encourage heavier usage (because users feel safe)
Make Cursor viable for long-term professional workflows

Right now context feels “magic.”
It needs to feel “intentional.”

If anyone else has experienced unexpected token spikes, I’d like to hear how you handled it.

michael-rentable · March 24, 2026, 3:33pm

Soft Cap would be a great start. The unexpected million token usage is brutal.

Topic		Replies	Views
Reduce AI Costs by 50%: Control the Context Sent to the Model Feature Requests	3	520	July 25, 2025
Understanding tokens spent Help context	3	114	March 27, 2026
Context Window Inspector & Agent Usage Profiler Feature Requests mcp , rules , context , skills , performance	3	146	March 17, 2026
Reduce waste, minimize cost from accidental token spend! Feature Requests max-mode	1	57	February 1, 2026
Cursor needs better pricing guardrails when one session can cost $600+ Feedback max-mode , cloud-agents	5	95	April 1, 2026