just got one, and I am far from my limits :
To give you a better sense of the issues with your question :
The paid subscription allows for a not easy number to track of coding sessions with magical fast code assist. Unless it is not magical.
When paying, Tab to complete feels like the only editor you want to use, or I would dream of Apple Intelligence giving me this in all apps with my private data sort of magic. So if you code long hours, you will enjoy and get used and it’s awesome.
When you hit your limit, you still have the slow requests and smaller models, the experience looses the magical touch, your mood, productivity is severely impacted and In solid 3 months experience (10+hours daily) you will either accept it with a sad face and complain here and social media to relieve the pain, or just realize it’s worth it to pay more to keep your accelerated and fun productivity.
but here is the catch, Unless it’s not Magical
more and more, I see a variable quality of output from the same LLM dealing with the same problems. To be clear, this is what I expect from humans, after hangover, vacations, accident. And took me a while to learn to be alert on the output, always, because it seems to vary wildly and to summarize: there are many good days and a few terrible days.
And I experience this with at least 3 different paid LLM subscriptions ( ChatGPT Plus, Google Gemini Advanced and including Cursor Pro here, even if it is different) .
I am not sure of the cause, I have a really consistent prompting ( that improved ) and love to structure a good question, and I can only suspect the Platforms Inferencing cost, like all cloud services have their SLA/SLO, Capacity that needs to be managed, so when you have a bad days, likely there is less memory/gpu/time/etc which reduces the ouptut quality.
specially with code, as Cursor mentions somewhere in the docs, limiting to 20K tokens most of the time ( if LLM supports it ), for code we need a large window ( larger than the input box ) and this is of course one of the aspects that can be optimized.
If you are aware of those, accept them you will be a happy customer, until something better appear, because while it’s good, it’s rather unconfortable to be vigiant and loose entire day of work when trying to solve a bug, creating new dozen and in the next day same the llm prompt problem, solves in 2 turns.
hope this helps,
Páris