Previous version of the guide:
Overview
Auto
Great for quickly digesting codebases, light analysis, and simple edits. Use it whenever possible — it’s fast, reliable, and free. Seriously, there’s no reason not to default to Auto for routine tasks.
o4-mini
When Auto falls short or doesn’t quite get what you’re asking for, switch to o4-mini. It’s not great with long context, but you can count on it for straightforward code edits. Bonus: it’s about 3× cheaper than Gemini 2.5 Pro or Claude 4.
Claude 4
I’m not a huge fan of how proactive Claude can be. And I really don’t see the point in paying extra for the “Thinking” variant.
However, Claude 4 is a very good bloodhound when you need to dig into a problem or you’re too lazy to think for a prompt.
Gemini 2.5 Pro
The former king for complex multi-step tasks. Had persistent edit_tool issues in Cursor. It’s holistic approach to problem-solving makes it one of the most well-rounded models — right behind Grok 4 and o3-Pro.
o3-Pro
Think of this like a precision tool. Try it in Manual mode when you want deep, one-shot analysis over anything that you put in its context. Just remember — the minimum cost per request is $0.80. Are you sure the task is worth that?
Grok 4
The new champion. Cursor’s team is actively optimizing its integration — they even visited the XAI office to get it dialed in. Grok 4 clearly approaches problems differently. It does things that Gemini 2.5 Pro can’t.
Optimal Strategy
Budget-Friendly (Pro users)
- Auto for everything until it breaks.
- Gemini 2.5 Pro for complex tasks.
- Grok 4 for one-shot analysis in Manual mode.
Balanced Strategy (Pro+ users)
- Auto and o4-mini for minor edits.
- Claude 4 for medium tasks, lazy prompts or as QA-engineer without code editing
- Gemini 2.5 Pro for in-depth engineering tasks.
- Grok 4 when you want to GROK THE REAL PROBLEM.
Premium (Ultra users)
- Claude 4 or Gemini 2.5 Pro for quick edits and analysis.
- Grok 4 MAX for everything else.
Turn on bell in Watching to get updates about the guide!