Composer 2.5 is totally not good at conversations. You have to nudge and push and remind it every turn of the original intent, what is working, what is distracting and it often acts like you just hired a rideshare driver instead of a capable high-reasoning 2026 model, just above 40k tokens of context used.
Attention to detail is like 95% of Opus 4.7.
Simplicity of implementation is like GPT-5.5.
But there is always a snag, a missed point, a dice roll for what some words mean, a wild ride outside the box when it gets distracted by tool calls and forgets user imperatives. Single user usage is very cheap, I calculated budgets close to 2000$ on the Cursor Pro+ plan, that’s enough for 35 days of work every month.
But I can never leave it unattended or without review as a single mistake will lose hours of work. I don’t need to double-check Opus or GPT at all, at a glance I can see they understood the task, thus it is finished correctly. Not so with Composer. Teenage behavior.
Cursor needs to work more on sub-agents, or sub-sub-agents and agentic workflows to increase the costs and results quality, as less than 100$ a month for full time senior engineer is not a sufficient value transfer.