Hey, a few users are reporting something similar. There’s a related thread: Has anyone noticed performance issues with Opus 4.6 (Max Mode) recently?
A couple things to keep in mind:
- Opus 4.6 with extended thinking is naturally slower than other models because it does extra reasoning steps.
- There were some recent issues on the Anthropic API side: Claude Status - Elevated errors on requests to Claude Opus 4.6, which could have affected speed.
- The team is aware of the Opus speed reports. No specific ETA yet, but every report helps us prioritize.
To look into your specific requests, we’ll need Request IDs from slow sessions. Top right of the chat > Copy Request ID. 2 to 3 examples is enough.
Also, are you using Opus in Max Mode or normal mode? About how long are replies taking, like 30 seconds or 1 minute+?
As a workaround, try starting fresh chats instead of continuing long threads. Large accumulated context can add a lot of latency.