Gemini and Claude do best in agentic mode. GPT4o did better before its update, and it was far more deterministic.
I would stick to the agents that function best in Cursor instead of beating a dead horse trying to work with ones that knowingly are sub-par in the environment.