Where does the bug appear (feature/product)?
Cursor IDE
Describe the Bug
For tasks that I previously used Sonnet 4.5 and had no issue with, Sonnet 4.6 uses a ridiculous amount of thinking tokens and has a very strong tendency to overthink and use a large amount of reasoning tokens without actually producing a meaningful output. I did a direct comparison with Sonnet 4.5, which was able to smoothly reach the solution. Not sure how this was intended to be ingrated but I think the 4.6 series in general (both Sonnet and Opus) needs to be further investigated and refined because many users have already made complaints and observed how many thinking tokens these models use for certain tasks.
I understand that Anthropic now has a thinking effort parameter and is deprecating the budget tokens, so tuning is not completely in your control, but I think perhaps adjustments in the prompt or certain parameters could be fruitful.
Steps to Reproduce
Ask for it to make a simple scraper or something that collects data sources from different platforms and analyses using an LLM. This is just an example but it should be fairly straightforward to reproduce.
Expected Behavior
Produce at least some sort of output that adheres to my request and not have a large amount of reasoning tokens used for simple tasks
Operating System
MacOS
Version Information
Version: 2.4.27 (Universal)
VSCode Version: 1.105.1
Commit: 4f2b772756b8f609e1354b3063de282ccbe7a690
Date: 2026-01-31T21:24:58.143Z
Build Type: Stable
Release Track: Default
Electron: 39.2.7
Chromium: 142.0.7444.235
Node.js: 22.21.1
V8: 14.2.231.21-electron.0
OS: Darwin x64 22.6.0
For AI issues: which model did you use?
Sonnet 4.6
Does this stop you from using Cursor
Sometimes - I can sometimes use Cursor