谢谢你们的分享,但是根据我的经验,如果你尝试完成一个小范围的编码任务,其他模型或许能做到更好,但是,如果你的任务涉及比较多的上下文信息,我觉得 Gemini 3 pro 会有更好的理解能力,而理解能力在我看来是更重要的。
我曾经尝试用 Claude sonet 4.5/gpt5.2/Gemini 3 flash/Gemini 3 Pro来解决同一个比较复杂的重构任务,最后只有Gemini 3 Pro准确理解了我的意图并且以“合理”的方式完成了任务,AI在编程中(当然也包括其他任务)最难的是做到“合理”,“合理”的意思是——感觉就该这么做,这相当不容易,有赖于模型对复杂上下文的理解能力。
有人说 Claude sonet 4.5 善于编程,但我一直有一个观点:编程不仅仅是写代码,更重要的是“理解”,比如:
- 理解你项目的目标
- 理解你的意图
- 理解代码在真实世界的意义(比如更好地理解UI的空间结构)
而这些能力实际上超出了单纯的编码范围了,这就是为什么一个更大参数、多模态的模型在这一点上通常会比纯粹的编程模型做得更好。
这就是我更信任 Gemini 3 的原因。
有兄弟提出 Gemini 3 flash 比 Pro 更好,我同意,但不是所有任务都更好,Gemini 3 flash有时候会偷懒,就我的体感来说,完成复杂任务,Pro 一次性成功的概率会比 flash 更高。
Gemini 3 flash 我觉得比 Pro 更好的点在于:flash 不会过渡思考,它像 pro 一样聪明,但更“心直口快”,这在有些任务上会更符合你的直觉。而 pro 则更“深思熟虑”,它提出的看法可能并不符合你的第一直觉,但这并不是缺点,反而,有时候不符合我们直觉的原因很可能是我们“太菜了”,一时半会没有看明白高手的深意。
Thank you for your sharing. However, based on my experience, if you are attempting a small-scale coding task, other models might perform better. But if your task involves a significant amount of contextual information, I believe Gemini 3 Pro has a stronger comprehension ability—which, in my view, is more important.
I once tried using Claude Sonnet 4.5, GPT-5.2, Gemini 3 Flash, and Gemini 3 Pro to tackle the same complex refactoring task. Ultimately, only Gemini 3 Pro accurately understood my intention and completed the task in a “reasonable” way. The hardest part of AI in programming (and other tasks, of course) is achieving “reasonableness”—it just feels like the right way to do it. This is quite challenging and relies heavily on the model’s ability to comprehend complex contexts.
Some say Claude Sonnet 4.5 excels at programming, but I’ve always held the view that programming isn’t just about writing code—it’s more about “understanding.” For example:
-
Understanding the goals of your project
-
Understanding your intentions
-
Understanding the real-world implications of the code (such as better grasping the spatial structure of a UI)
These abilities actually extend beyond mere coding. That’s why a model with more parameters and multimodal capabilities typically performs better in this regard than a purely programming-focused model.
This is why I trust Gemini 3 more.
Some friends mentioned that Gemini 3 Flash is better than Pro. I agree, but not for all tasks. Gemini 3 Flash sometimes cuts corners, and from my experience, Pro has a higher success rate in completing complex tasks on the first try.
Where I think Gemini 3 Flash outperforms Pro is that Flash doesn’t overthink—it’s as smart as Pro but more “straightforward,” which can align better with your intuition in certain tasks. Pro, on the other hand, is more “thoughtful.” Its suggestions may not align with your initial intuition, but that’s not a flaw. In fact, sometimes the reason it doesn’t match our intuition might simply be that we’re “not experienced enough” to immediately grasp the deeper insight behind an expert’s approach.