Claude 3.5 Sonnet updated, and Claude 3.5 Haiku soon

Orpheus · October 22, 2024, 3:27pm

The updated Claude 3.5 Sonnet shows wide-ranging improvements on industry benchmarks, with particularly strong gains in agentic coding and tool use tasks. On coding, it improves performance on SWE-bench Verified from 33.4% to 49.0%, scoring higher than all publicly available models—including reasoning models like OpenAI o1-preview and specialized systems designed for agentic coding. It also improves performance on TAU-bench, an agentic tool use task, from 62.6% to 69.2% in the retail domain, and from 36.0% to 46.0% in the more challenging airline domain. The new Claude 3.5 Sonnet offers these advancements at the same price and speed as its predecessor.

Robert137498 · October 22, 2024, 3:36pm

Is cursor directing queries to the latest model automatically?

HYAZEUS · October 22, 2024, 3:40pm

Is the new model being used by cursor yet?

ssmits · October 22, 2024, 4:17pm

This feature should’ve been long coming imo, but I really need to use BOTH the models served by Cursor Pro and custom API Key at the same time. You can’t switch quickly because you have to go to the settings each time. For now I want to use the best model for most complex cases but for easier tasks I’d like to use the previous Sonnet 3.5

hmoran · October 22, 2024, 4:34pm

the blog post does say it’s available today but i wonder if cursor will have to update the api endpoint?

javicitou · October 22, 2024, 4:39pm

They just have to change the model to use the latest version, I imagine they will do it soon.

Orpheus · October 22, 2024, 4:54pm

I just opened Cursor and saw the model there.

truell20 · October 22, 2024, 5:56pm

Yep, new Sonnet should be out in the model dropdown

hmoran · October 22, 2024, 6:35pm

does the new model need to be tuned for cursor?

It’s not following instructions properly. I’m asking it to create a markdown file for requirements and it ends up modifying other files instead.

truell20 · October 22, 2024, 6:59pm

Interesting, could you Cmd + P > Report AI Action, select the right request, and add a comment?

hmoran · October 22, 2024, 7:29pm

For some reason it’s not showing up in the list since it’s been a couple hours and it’s only showing the most recent.

here I’m asking Composer to create a new SOP based on some prompt, but instead it goes and tries to edit route.ts

It happened at least four or five times consistently before I gave up

zerk1 · October 23, 2024, 3:56pm

Having this problem consistently with the new model unfortunately, extremely annoying.

matija2209 · October 24, 2024, 10:22am

Will we get Haiku 3.5 in Cursor soon? I’m burning through tokens so fast on Sonnet.

deanrie · October 24, 2024, 10:27am

It’ll be in the API by the end of the month, and I think it’ll show up in Cursor right away too.

matija2209 · October 24, 2024, 12:46pm

That’s great news. Hopefully it is as great in coding as they say.

pegiadise · October 24, 2024, 3:31pm

Knowledge cutoff still seems to be June 2023 for the claude-3-5-sonnet-20241022 (which was the previous model’s cutoff date). New claude-3-5-sonnet at https://claude.ai/ cutoff date is at April 2024. Are we sure this works?

pegiadise · October 24, 2024, 3:45pm

It was the long context beta setting, make sure it’s off. all good

boni · October 25, 2024, 9:55am

when will the long context claude also be updated to the new version? right now we only have claude-3-5-sonnet-200k which I assume is still pointing to the old version (?)

LionSR · October 25, 2024, 12:14pm

strongly hope the long-context new sonnet 3.5 will be added asap. Should be a no-brainer and free improvements?

primate · October 26, 2024, 12:01pm

New Claude Sonnet 3.5 has proven to be awesome in all my coding tasks so far. It’s just the “naming convention” or lack of it which is pretty stupid.

Waiting to see when Claude Sonnet 3.5 “new” long context is available.

Topic		Replies	Views
Claude 3.7 Now Available! Discussions	58	19530	March 2, 2025
Claude 3.7 PLEASE ADD IT Discussions	6	660	February 25, 2025
Is cursor using sonnet-3-5 v2 or the old one Discussions	2	396	January 11, 2025
New Claude 3.5 already worse? Bug Reports	5	1382	October 28, 2024
Sonnet works, but not Haiku Bug Reports	4	54	February 3, 2025

Claude 3.5 Sonnet updated, and Claude 3.5 Haiku soon

Related topics