I have to rewrite commands, stop it from generating responses, correct it, make it find solutions to simple answers that it was supposed to address easily..
I kept it running and it deployed a script deleting all of my functions from firebase.. completely out of what i have asked. wtf?
I noticed the same thing too. Just 3-4 weeks ago everything was working fine, but now I have to fix even simple scripts 5-6 times. It feels like it’s been intentionally made “dumber” so that users would spend more tokens. There’s simply no way a model could start making this many mistakes literally within a few weeks. And I’m not even talking about other issues.
No, Claude 4 Sonnet is not getting dumber, it performs really well, I use it daily without such issues on the personal Pro plan.
Creating a thread in Bug Reports would be great to help investigate your issue properly.
Cursor Team has stated repeatedly that there is nothing making models dumber and not intentionally. With a Bug Report the issue could be checked and made sure that models work well.
Please feel free to tag me there with @ so I don’t miss your Bug Report.
I see the same pattern in the last week. Hallucinating more then ever, doing things you tell it not to even multiple times, telling it to continue, asking it is it stuck (and with very simple things). I am on the point that I am asking myself if I am just scammed for more and more requests.
If the issues are related to following rules, yes, it has been less consistent in following them than the first two weeks, it’s possible that the cause isn’t Cursor(like system prompts), I’ve read an user in reddit that says new models have their full capacity at start to show muscles and get degraded slightly to optimize resource usage(and monetary extraction) -but I’ve not seen yet research confirming this-.
Its not cursor but the models themselves, you need to implement strict rules and knowledge for the model upon initially calling it, sometimes it will swap a fresh instance, when it does that the new instance will have no recollection of your convo/history even in the same chat unless you tell it to read the history.
Interesting I stopped using rules. Feels like there is no need for them anymore with latest Claude 4 Sonnet Thinking.
Well, I have one user rule now: Follow SOLID & DRY
Lacakdaisical is right, files for info storage: feature details, progress log, implementation plan. I ask Agent to read MD files I create or to write its own updates.
Usually my prompts are 1-2 sentences and then it works on that for 1-3h.
Implement plan for feature {approximate filename}.
i don’t like using memories. if they screw up, they save the memory with the error thinking that it was implemented right. i keep memory saving active, but constantly delete them all.
Oh, I didn’t mean the cursor memory, I have that off myself, but rather a file that stores what you need done/specifications/ect.. to add to T1000’s comment, lets say you put a prompt/implementation plan/ect into a file, tell the model to “run filename.md” or whatever extension your using and it will update, in the file you can tell it to check chat history to refresh memory and other things you need, add for it to update the file after delivery/implementations for consistency.