Claude-4-sonnet thinking is extremely slow since yesterday

astralski · June 4, 2025, 10:12am

Hi,

I’m using the Pro version of Cursor.

Since about two days ago, the Claude-4-Sonnet (thinking) model has been extremely slow — with an average TPS around 0.3, which makes it nearly unusable for productive work.

The non-thinking version of Claude-4-Sonnet is significantly faster and works fine.

Please look into this — the performance degradation is really noticeable and slows down everything.

Macos 15.5

Version: 0.50.7

VSCode Version: 1.96.2

Commit: 02270c8441bdc4b2fdbc30e6f470a589ec78d600

Date: 2025-05-24T18:19:58.349Z

Electron: 34.3.4

Chromium: 132.0.6834.210

Node.js: 20.18.3

V8: 13.2.152.41-electron.0

OS: Darwin arm64 24.5.0

condor · June 4, 2025, 10:19am

Hi there,

in general the TPS depends mostly on the AI provider, Anthropic and their regional load in your area. Naturally its not good to see TPS dropping.

Could you check following to exclude potential other causes:

Have you used up your 500 included requests per month?
What amount of tokens did the chat have when it started getting slow?
Can you reproduce the issue with a new chat? (With privacy mode disabled for this one request, so the Cursor Team can review the process, then turn it back on and post here the Request ID)

astralski · June 4, 2025, 10:59am

I didn’t check the API directly, but the same model works just fine in the web app.

I’ve used around 250 out of 500 included requests.
The slowdown happens from the very beginning of any chat, both old and new.
It’s not related to long chat history or context size.
Sorry, I can’t reproduce the issue right now — it suddenly started working super fast again ^^

No idea what changed, but TPS is back to normal.

yurkomik · June 4, 2025, 7:55pm

I can confirm that for the last two days, the issue has been the same. Before that, we had messaging that there are vertex rate limits, then just 1 token per second speed. It’s in fast requests pay-as-you-go mode. Sometimes it gets better, but I feel that Anthropic could make the model dumber to fix the load. It usually happens late at night. And in the mornings. Most likely when Europe and Asia are heavily using API. I’m in Toronto, Canada.

mikler · June 4, 2025, 10:01pm

Yeah. Same here. It is borderline unusable. I am trying to work with Claude 4 in the MAX mode. With usage-based spend. Seems to depend on time of the day. It was working fine in the morning, but around 3PM Pacific - TPS just tanked.

seonedir · June 5, 2025, 12:51am

Did Sonnet 4 come to the slow pool?

yurkomik · June 5, 2025, 3:32pm

It’s possible that Claude is limiting competitors to force users to switch to their own coding tool. They released Claude code for all Pro users. It’s hard to say. I will definitely install Claude Code in terminal and try to switch that when cursor speed is impossible.

eburgwedel · June 6, 2025, 8:52am

I have been observing the same problem over the last couple of days. Sometimes it’s super-fast, sometimes it’s super-slow. Sometimes completely unusable. Not sure where the source of the problem is.

alberduris · June 9, 2025, 10:38am

This is still an issue. At some times, 50% of the messages are throttled to <1 TPS. Does anyone have more information on this?

condor · June 9, 2025, 10:52am

As most users are not affected by this issue, more information is needed to check the cause.

If your requests are repeatedly slow could you please reproduce it in one request with privacy mode disabled so the Cursor Team can look into the details as otherwise they don’t see that.

Please post then those Request IDs here

olegKusov · June 9, 2025, 12:16pm

having same issue. claude 4 very slow (fast requests)

condor · June 9, 2025, 12:21pm

@olegKusov could you also provide the Request ID (with privacy off) so the Cursor Team can investigate.

I’m using Claude 4 Sonnet and cant reproduce the slow speed, more info is required.

topdown · June 9, 2025, 1:01pm

Claude Sonnet 4 is my daily driver. Currently on usage-based spend because I ran out of gas on my Pro account.

I am not experiencing any issues with slow responses.

In fact it just completed a task while I wrote this post.

astralski · June 11, 2025, 8:28am

UPDATE:

It’s been a week since then. Since that time, I’ve been experiencing these kinds of TPS drops every day, for what seems to be around 30 minutes to 1.5 hours per day, during which the performance becomes completely unacceptable.

In my opinion, even if the root cause lies with Anthropic, Cursor should be cutting off connections that are too slow.

Right now, you’re making me wait 15 minutes for a single response — which I’ll likely cancel midway — and I still get charged for faulty requests that are ultimately useless to me.

I get several to a dozen such requests every day. Let’s assume it’s just 5. If this were to happen daily, I’d be losing around 150 requests per month…

Especially since this seems to be a global issue. Other users mentioned experiencing it in Toronto and in the Pacific Time zone.

I’m based in Europe — Warsaw, Poland — and it feels like these drops happen between 10 AM and 2 PM CEST.

onurbolukbas · June 11, 2025, 9:57am

I’m also experiencing slow responses from Sonnet 4 here in Istanbul.

effi · June 11, 2025, 10:27am

Same, in Israel

cidxb · June 11, 2025, 12:01pm

And the performance has really dropped, forcing me to use other models…

rterry · June 30, 2025, 8:11pm

I am also having this issue…thinking seems fast enough, but code generation/file editing are impossibly slow.

condor · July 1, 2025, 11:40am

@rterry please file a full Bug report in a separate thread as this is not related to the issues above. The Bug report allows us to check what is happening as your text suggests. It’s not with Claude but with some tools, make sure to add a request ID, preferably with disabled privacy setting for this one request so the Cursor Team can investigate as otherwise they have no access to the tool calls in your request

Pascaltib · July 3, 2025, 3:33pm

Same issue for me! Sonnet 4 with thinking is nearly unusable. Without thinking works fine.

Topic		Replies	Views
Cursor really really really slow Bug Reports	14	1032	June 11, 2025
Claude 4.5 Sonnet <thinking> no longer thinking? Bug Reports	6	651	November 12, 2025
The cursor is running very slowly. Tasks that used to take 2 to 3 minutes to complete now often time out and feel laggy Bug Reports performance	10	330	February 26, 2026
Claude 4T extremely slow/breaking Bug Reports	2	232	July 24, 2025
Slow Request Causes Infinite Loop in Claude 3.5 Sonnet Bug Reports	2	310	August 18, 2025

Claude-4-sonnet thinking is extremely slow since yesterday

Related topics