Crazy high cost opus? Up to 70 requests for 1 prompt

Where does the bug appear (feature/product)?

Cursor IDE

Describe the Bug

I used all my tokens in several prompts and I don’t know how.

Steps to Reproduce

I used Opus in max mode

Expected Behavior

Several tokens, not hundreds.

Screenshots / Screen Recordings

Operating System

MacOS

Current Cursor Version (Menu → About Cursor → Copy)

Version: 2.2.20
VSCode Version: 1.105.1
Commit: b3573281c4775bfc6bba466bf6563d3d498d1070
Date: 2025-12-12T06:29:26.017Z (6 days ago)
Electron: 37.7.0
Chromium: 138.0.7204.251
Node.js: 22.20.0
V8: 13.8.258.32-electron.0
OS: Darwin arm64 25.1.0

For AI issues: which model did you use?

Opus 4.5

Does this stop you from using Cursor

No - Cursor works, but with this issue

Hey there!

Max Mode uses significantly more context per request, which is why you’re seeing higher usage than expected. It’s designed for situations where you need the model’s full capabilities and aren’t particularly cost-sensitive. Max mode will consume usage much faster than non-Max mode.

If you’d like to preserve your usage for more requests, consider sticking with regular (non-max) mode for most tasks.

It’s just said that I didn’t get a warning and that I’m already done for the month. Just by a small mistake :confused:. Just can’t believe the usage, i know its maybe more context, but if on other scenarios we are hitting above 1M tokens, with just 1 request. So seems too high

My record is 120.8 requests in a single prompt, I can only use max mode when I’ve burned through my included request and it is counting against on demand usage.

This topic was automatically closed 22 days after the last reply. New replies are no longer allowed.