Does anyone actually use different models for different tasks?

Definitely.

Auto or GPT-5-mini (free, I still have unlimited Auto), Composer 1.5 if those struggle, and then Sonnet 4.6 (non thinking) if I am doing something particularly complex or the previous models are failing. This keeps my costs very low and saves my usage for when I really need it.

I’ve used other models on accident at times because the selector got changed to defaults, usually with restarts of the app and-or upgrades. It’s taught me how superior Opus is, and Claude in general, to all other models. Every time it’s happened, I figured it out almost immediately because the agent was off-track. For coding, Claude is King.

1 Like

Very true. Just more expensive.

Naaa, not in AntiGravity, unless you are a very, very heavy user.

I thought antigravity was free right now? Kind of confused on its pricing from an individual’s perspective. I get the sense that antigravity is still in a preview/promotional phase meaning it’s cost per requests per model may not be representative long term, but yea free is cheaper than anything that cost.

I believe they have a free tier, like Cursor, but those limits are probably fairly strict. I should have been more specific. I am on the Ultra plan, which is currently on sale for $200 a month. When the sale ends, the assessment of value will change somewhat, perhaps. But it will still be way cheaper than Cursor. I don’t mean to harp on Cursor. It’s great. But they have to get the cost down. Another thing I like about AntiGravity is when you hit a limit, it is temporary. The limits refresh after a period of time.

2 Likes

Yes, sometimes I need speed to get work done faster. Kimi K2.5 mostly aligns with my workflow, and it’s fast enough for most of my tasks. If it’s not enough, then I switch to Codex as my go-to model.

If other models don’t work after 1–2 prompts, I usually start a new chat, refer to the previous conversation, and continue from there.

I don’t use expensive models because I don’t want to depend on them. My currency is weak, so it’s hard to compete with Opus or Sonnet-level pricing haha.

If I had more capital, I wouldn’t mind using more expensive models. At the end of the day, it really depends on the budget we have as engineers.

Based on @Erkan_Arslan , is it a good idea to switch models for every other query in the same chat to save on tokens and costs? My understanding is that switching models breaks prompt caching, forcing the new model to re-process the entire chat history as new input. Since this increases costs and different models handle history differently, is switching models within the same chat actually advisable?

1 Like