Developers’ perspective: a comparative analysis of the applications of Claude‑3.7‑Thinking, Gemini‑2.5‑Pro, and the o3/o4‑High series models

Hi everyone—are there any developers who can share the differences between Claude‑3.7‑Thinking and Gemini‑2.5‑Pro in real-world use? Also, for the newly introduced o3‑High and o4‑High models, what kinds of coding tasks do they excel at in practice? (For example, I find that Gemini‑2.5‑Pro is better at understanding an entire codebase and offering suggestions and analysis. Claude‑3.7‑Thinking’s coding skills are indeed strong, but its reasoning can be overly divergent and thus needs very strict constraints. As for o3‑High and o4‑High, I haven’t used them yet.) I look forward to hearing your insights!

1 Like

In my opinion, if you are working on the frontend and looking for creative UI, Claude is the champion. If you are working on the backend, you can save costs by using GPT-4.1 / o4-mini for free. If the backend is very challenging, you can opt for Gemini 2.5 Pro. If that does not satisfy your needs, consider using Claude Sonnect 3.7 thinking but make sure you add complete and many context even in agent to prevent halu

2 Likes

actually this is a really good description of a good workflow!

1 Like