Agent behavior much different for o4-mini, o3

rpggio · May 1, 2025, 5:21pm

The Agent seems to behave much differently when preparing context for o4-mini and o3, as compared to Sonnet 3.7.
Sonnet 3.7 (thinking) does quick visible thinking with the context I provided, and gets immediately to work, sometimes with no initial tool calls.

With o4-mini and o3, the agent makes 10-60 tool calls just grepping and reading files. It takes 3-20 minutes before I see any visible thinking, just so so many tool calls. And the model may forget the context that I provided at the beginning, launching into its own plan based on the context it assembled.

Am I missing something here? Is your agent code for Sonnet just that much better adjusted? The new OpenAI models have great instincts for coding, but something seems wrong with how you have them wired into agent.

The difference in agent behavior is the most confusing. I feel like I must have configured something incorrectly.
Why does it take many MINUTES to scan a bunch of local files? Could my file indexing be broken? It shows 1000+ files indexed.

121kr0x33 · May 1, 2025, 5:57pm

Same here, o4-mini is painfully slow, but it’s more accurate than gemini 2.5 pro in my case

rpggio · May 1, 2025, 9:18pm

Apart from the questions I raised above, this feature request would help the situation:

Topic		Replies	Views
Why is claude 3.5 sonnet superior for composer? Discussion	3	865	February 2, 2025
Sonnet still better than o3-mini? Discussion	2	868	February 7, 2025
Claude Sonnet 3.5 Agent is sooo much better than o3 mini high Discussion	13	2947	February 21, 2025
Sonnet 3.5 vs o3 mini Discussion	16	3190	February 22, 2025
Cursor is too slow or doesn't generate responses on paid plan Bug Reports	3	257	May 1, 2025

Agent behavior much different for o4-mini, o3

Related topics