Sonnet 4 / GPT-5 usage period is too short, Auto model is highly problematic

condor · September 7, 2025, 2:09pm

hi @matrix_code thank you for the detailed post. Here are a few remarks from my experience:

Please check if you can optimize your Sonnet 4 and GPT-5 usage by reducing context size and optimizing Agent runs as this will give you more usage. Check out Understanding LLM Token Usage if this helps you with best practices.
We are improving Auto mode by adding better models and improving handling. Feel free to file bug reports with issues you find with Auto.

As to why there is a different usage to apps like ChatGPT:

Context matters a lot and increasing context means that more unrelated code, comments etc is presented to AI for processing. This confuses models and increases mistakes by AI.
AI providers do have usage optimizations in their own apps like Claude chat and ChatGPT but they also have hourly limits after which they downgrade to smaller models which is in Cursor the users choice.

Additionally, note that the cost is essentially set by the AI providers and since powerful LLMs are very large and must run on large machines with huge graphic cards the cost for those is much higher than smaller models. See the price difference between Sonnet and Opus where gap is 5 times in cost.

Topic		Replies	Views
Model fallback? Bug Reports	6	151	October 25, 2024
GPT-5-Mini is a great value Discussions	15	1565	August 21, 2025
It seems to me that the fourth model has become very stupid Bug Reports	20	1238	December 13, 2023
GPT-5 Main Discussion Thread Discussions	114	7204	September 14, 2025
Claude 3.5 Sonnet Feature Requests	17	7760	July 26, 2024

Sonnet 4 / GPT-5 usage period is too short, Auto model is highly problematic

Related topics