Hi, I want to ask whether deepseek v3.1 will be added to cursor soon as it is just being offically annonced.
Evaluation
Category
Benchmark (Metric)
DeepSeek V3.1-NonThinking
DeepSeek V3 0324
DeepSeek V3.1-Thinking
DeepSeek R1 0528
General
MMLU-Redux (EM)
91.8
90.5
93.7
93.4
MMLU-Pro (EM)
83.7
81.2
84.8
85.0
GPQA-Diamond (Pass@1)
74.9
68.4
80.1
81.0
Humanity’s Last Exam (Pass@1)
-
-
15.9
17.7
Search Agent
BrowseComp
-
-
30.0
8.9
BrowseComp_zh
-
-
49.2
35.7
Humanity’s Last Exam (Python + Search)
-
-
29.8
24.8
SimpleQA
-
-
93.4
92.3
Code
LiveCodeBench (2408-2505) (Pass@1)
56.4
43.0
74.8
73.3
Codeforces-Div1 (Rating)
-
-
2091
1930
Aider-Polyglot (Acc.)
68.4
55.1
76.3
71.6
Code Agent
SWE Verified (Agent mode)
66.0
45.4
-
44.6
SWE-bench Multilingual (Agent mode)
54.5
29.3
-
30.5
Terminal-bench (Terminus 1 framework)
31.3
13.3
-
5.7
Math
AIME 2024 (Pass@1)
66.3
59.4
93.1
91.4
AIME 2025 (Pass@1)
49.8
51.3
88.4
87.5
HMMT 2025 (Pass@1)
33.5
29.2
84.2
79.4
Note:
Search agents are evaluated with our internal search framework, which uses a commercial search API + webpage filter + 128K context window. Seach agent results of R1-0528 are evaluated with a pre-defined workflow.
SWE-bench is evaluated with our internal code agent framework.
Nope. That one was using the wrong name by the community. The current “V3.1“ is actually “DeepSeek V3 0324”, released on March 24, 2025, with a context length of ~60K.
We’ll need to wait for the Cursor team to add the real “DeepSeek V3.1” (Aug 2025) with 128K context support.
The new model DeepSeek V3.1 has just been released, and I would love to see it supported in Cursor as soon as possible.
However, I’d like to point out a potential source of confusion: the model DeepSeek V3-0324 is already listed in Cursor as “DeepSeek V3.1.” Since the newly released model is also officially named DeepSeek V3.1, this may cause ambiguity for users.
Could you please consider clarifying the naming in the model list while adding support for the new DeepSeek V3.1? That way, users will clearly understand which version they are using.