Deepseek v3 beats sonnet 3.5

debian3 · December 26, 2024, 3:05am

It seems like there is a new model beating Sonnet 3.5

According to Aider new polyglot benchmark:

Any plan to add this model?

debian3 · December 26, 2024, 7:51pm

DeekSeek v3 is 53x cheaper to inference than Claude Sonnet

Now available on OpenRouter:

Anyone have found more benchmark about it?

SrGabrysh · December 28, 2024, 12:47pm

I’ve come across several benchmarks, but I find them overly optimistic. I also ran a few tests, and when generating text in French, I still encountered some parts written in Chinese, which is concerning.

Additionally, it’s impossible to register using a professional email address, just like on Kling.ai. I want to draw your attention to this issue because it raises some questions. Why are professional email addresses not accepted? With Kling.ai, I was never able to obtain an invoice, and I’m afraid the same might happen with DeepSeek. This could be a significant problem for businesses.

woaidianqian · December 28, 2024, 1:20pm

In China, you need a real name to use it.

SrGabrysh · December 28, 2024, 1:24pm

I have a real name, and my email address is based on my domain name, like any professional email address. It is unacceptable to be required to provide a personal email address to access the service. Even more concerning is the inability to obtain invoices, which is a major issue.

debian3 · December 29, 2024, 12:58pm

I have read several report on reddit that the model is quite good and extremely cheap. Anyone had a chance to try it?

debian3 · December 29, 2024, 4:25pm

According to LiveBench it score below Gemini 1206 and well below sonnet 3.5

Kirai · December 29, 2024, 4:44pm

I tried it for a few hours with Roo Cline extension and I think the new whale is slightly behind Sonnet 3.5 (new), but at fraction of its cost. I would guess 90-95% Sonnet’s performance, but faster and if I remember correctly, only 7% of Sonnet’s cost (after the planned API price increase).

I believe it could be a nice model to have if it was “priced” in Cursor accordingly, e.g. one tenth of premium use (I think it should be possible, since new Haiku is one third?).

debian3 · December 29, 2024, 4:47pm

My guess at that price they could include it in the unlimited small model like 4o-mini

raythurnvoid · December 30, 2024, 4:51am

I tried it, it seems on par with sonnet in my opinion, but visibly faster.

Check this out to try yourself: Add DeepSeek Model · Issue #1509 · getcursor/cursor · GitHub

The problem is that you cannot use the composer with it, so even assuming it’s slightly better in some cases, it just cannot compete with Sonnet in the composer either normal or agent.

qq1469617613 · December 30, 2024, 7:56am

Here’s the data I’ve seen

alehano · December 30, 2024, 2:05pm

I tried DeepSeek 2.5 and it wasn’t good - slow and generated whole file instead of necessary parts.
But all changed when DeepSeek 3 released.
It’s fast and works great. At least as good as Sonnet 3.5 maybe even better.
I use deepseek-chat model it as my main model now.
But there is one issue here. Composer mode couldn’t work with this.

I really hope Cursor team will allow it in composer. And hope for community support to speed this process up

truell20 · December 31, 2024, 3:11am

Testing it right now! Will definitely add it if it performs well on our qualitative tests / evals.

djbob2000 · January 1, 2025, 11:07am

Maybe it’s worth starting a beta branch/version so that you don’t have to wait for production, but can immediately have the opportunity to try new features?

debian3 · January 1, 2025, 1:04pm

I’m pretty sure the branch we are on 0.44.x is already beta.

Rain · January 8, 2025, 8:49am

Is there a new test result after a week, and will it be added?

danperks · January 10, 2025, 1:23pm

We’re constantly evaluating models and we’ll add them when they surpass our existing model range on our internal benchmarks!

While DeepSeek might be working very well for your use case, we only want to add the model to Cursor if it is equally well-performant in everyone’s use case.

djbob2000 · January 11, 2025, 8:53am

release a beta version for enthusiasts please, parallel branch with new stuff

Moncef12 · January 11, 2025, 2:49pm

Any updates on this?

At least make it easy for us to switch between deepSeek and other models. Having to change the cursor settings and toggle the API key every time is really frustrating.

vadash · January 11, 2025, 8:35pm

Any news? I want

Custom API with my own key that doesnt disable builtin sonnet. Maybe with bigger limits like 20k context??
Deepseek3 that only uses 1/5 of fast request (since its 30 times cheaper that Sonnet). 1/5 sounds reasonable even after DS3 price increase in a month

Topic		Replies	Views
Who is with me? Feature Requests	20	2004	January 23, 2025
Please add DeepSeek R1 model Feature Requests	90	7418	August 9, 2025
Support DeepSeek V3 0324 Feature Requests	69	7393	April 11, 2025
DeepSeek v3 chat models request Feature Requests	12	1874	January 21, 2025
Free trial and paid comparison Discussions	15	1511	March 7, 2025

Deepseek v3 beats sonnet 3.5

Related topics