Claude 3.5 Sonnet routing?

m955 · October 29, 2024, 7:20am

Asked it because the answers were obviously not as it used to, and to my surprise, I can see why now…

Ethras · October 29, 2024, 7:29am

I think it should show a warning when redirected to another model (due to outage, or anything else). Also I hope it doesn’t count toward the monthly count.

deanrie · October 29, 2024, 8:41am

Hi @m955

Anthropic had a few incidents today, so you were automatically switched to another model:

Dreams · October 29, 2024, 11:34am

Rerouting models without user knowing isn’t very transparent and, in my opinion, is misleading and untrustworthy. I would rather have failed requests to Anthropic and be certain that the model I’m using is actually Anthropic. Sometimes I really feel like I’m not getting what I’m paying for.

Sonnet seems to perform poorly at times, even when it’s up, going in endless loops without solving my problem. And when I switch with my API key, it’s doing it in 0-shot. This is really strange, and it’s disappointing to see that there’s logic for this in Cursor, so that could be true.

deanrie · October 29, 2024, 12:03pm

Didn’t you get any notifications about the model change? It should be there, and it’s up to you to decide whether to wait for the model you need to work or choose another one.

ssmits · October 29, 2024, 1:23pm

Did you try with o1-mini or Gemini? I don’t think it’s the model but the complexity of the problem + changes in prompting methods could also help

Dreams · October 29, 2024, 2:34pm

I am talking here from a coding session 1 week ago where i used normally my Fast-Requests and Anthropic had no downtime and it did not solve my problem after 5+ prompts. Then i added the same files for context and did the same prompt, and apparently, with my Anthropic API key, it solved it 0-shot. This is somehow weird if you ask me.

ssmits · October 29, 2024, 2:53pm

They updated to the new Sonnet version yesterday, maybe that was it then.

amanrs · October 30, 2024, 4:10am

We never switch to GPT-3.5 when anthropic has downtime. We switch to GPT-4o only when anthropic has downtime. The downtime was momentary enough yesterday that we did not show the error message to users, but we’ll be sure to do this next time.

The model will tell you it is GPT-3.5 because the switch is done in a way that re-uses our sonnet prompts. So when GPT-4o sees a claude-3.5 prompt, it sees 3.5 and thinks GPT-3.5

In general we’ve been very frustrated with Anthropic’s reliability and hope as they scale they will have fewer incidents. It’s most frustrating when they have downtime that isn’t reported on the status page so users think we’re switching models just to save money.

Ammar_Sabah · October 30, 2024, 2:08pm

Same for me. For three months, I have been a pro subscriber. I used cursor to fix a script with Sonnet 3.5’s long context (200k) chat and ended up in a vicious cycle: 7 prompts with nothing only made things worse. Then I tried the same prompt with the same context using Cline, and it resolved it in two shots. I am considering switching back to VScode and canceling my pro subscription. I believe that Cursor is over-advertised. The only drawback of Cline is that it consumes a large number of tokens, but it is worthwhile because it resolves the issue and saves the project.

Topic		Replies	Views
Anthropic is experiencing a partial outage right now. To avoid downtime, we've temporarily fallen back to an OpenAI model Discussions	7	378	November 7, 2024
Model fallback? Bug Reports	6	123	October 25, 2024
Unable to reach Anthropic March 2025 Bug Reports	21	1684	April 14, 2025
We've hit a rate limit with Anthropic Discussions	13	7097	March 13, 2025
Please Turn Claude Back On Discussions	5	785	October 13, 2024

Claude 3.5 Sonnet routing?

Related topics