[Solved] Add Claude 3 models

HappyQuokka · March 8, 2024, 7:24pm

well gpt4 is general purpose, I think coding models have to be trained specifically for those tasks. btw have you seen the ‘starcoder2’ on huggingface? the description is that it was trained specially for coding (on 17 languages but it doesn’t matter since models can pick up other syntax from simply supplying them a pdf or plain text with syntax rules cheat-sheet in context) but i don’t know how to use it, probably locally, but i’m so lazy (after playing with stable diffusion locally and getting headache from all the customization options and concepts, i’ll better leave this to professionals , let’s wait when Cursor backend will get even better).

HappyQuokka · March 8, 2024, 7:29pm

how do they benchmark the Haiku model? could be interesting. if it’s better than gpt4 at that cost, we can try to use a map-reduce approach of breaking down one request into many smaller more specific requests for inference and run them in parallel through that lighter model (each with 200k context limit will be able to always give exact and correct factual information based on latest docs/facts supplied from the official sources) and then fact-check and fix the results only once, using the ‘best’ model (let’s say Claude or whatever will be the next best choice). And all that will result if very good experience for us in Cursor. Hey how do we ping the admins, we should ask this as a feature request and vote on it going to be supercool.

raw.works · March 8, 2024, 7:34pm

so cool! can you show exactly what you did? i’m really curious to test claude 3 in cursor for some of my work.

debian3 · March 8, 2024, 8:33pm

I keep seeing on YouTube and Reddit, people report that Claude 3 Opus is more accurate than GPT4 Turbo at coding and it’s not lazy so it output full code each time.

raw.works · March 8, 2024, 8:50pm

here are two benchmarks i trust. one shows claude just behind gpt-4, the other shows ahead:

https://evalplus.github.io/leaderboard.html

HappyQuokka · March 8, 2024, 9:49pm

Add Claude 3 models - #11 by dioro

btw aren’t you worried that Cursor will send a full 200k context max or even more, like all your codebase and docs, every single time to the API, and it’ll cost a lot of $ by mistake? I am still worried about understanding what exactly and how many content does the IDE send to AI API endpoint . Maybe someone could shed light on that? (or does it mimick openai api limits, so will send only N amount as the max allowed in openai endpoint? in this case the Claude 200k simply won’t be used even if you connect the openrouter )

dioro · March 8, 2024, 10:03pm

No idea, I’m personally not gonna mess with the API due to that reason. I just hope the Cursor team manages to add the new models to the Cursor Pro plan, or even higher tier if necessary. I would gladly pay $50+ for GPT-4 level model with 200k context.

dioro · March 9, 2024, 10:07pm

Cursor team, are there any updates you can give regarding the Claude 3 models?

Have you done any benchmarks on your side? Thoughts on how the models performed compared to GPT-4/3?
Do you have an ETA for when they might be available? Few days / a week / 2 weeks…
Thoughts on higher tier plans for bigger context models?

Thanks

HappyQuokka · March 9, 2024, 10:40pm

lol this is a good follow up but it’s weekend time
Let’s see what happens next week. Very curious to see Haiku with my tasks as well probably the only model pricing which will not make us bankrupt. I’m trying Sonnet in openrouter today, it’s pretty good, feels like gpt4 in understanding longer prompts and tasks in those longer complicated requests.

deanrie · March 10, 2024, 8:46am

You can use it via api, here are my settings

vivektiwari · March 10, 2024, 6:04pm

Interesting. Something might be wrong in my settings? I simply get a blank response in the chat.

deanrie · March 10, 2024, 7:10pm

Everything looks correct, but what about the balance on the openrouter? Are there enough funds?

ssmits · March 10, 2024, 8:03pm

Same problem, funds: more than a few dollars.

raw.works · March 10, 2024, 8:21pm

just to be clear - when you do the setup shown above, you put your openrouter api key in place of the openai api key?

deanrie · March 10, 2024, 9:07pm

Yes exactly

deanrie · March 10, 2024, 9:09pm

Does it work here: Playground | OpenRouter

SerefArikan · March 10, 2024, 9:21pm

Does Claude allow setting a budget like OAI does? As in, I’d rather run out of my pre-paid credits then sell a kidney because I asked a question to Cursor in the wrong code base and I now owe Claude two bazillion dollars.

ssmits · March 10, 2024, 11:50pm

Yes.

Taidan · March 11, 2024, 1:09am

Maybe claude shouldn’t be used in chat, but for a limited number of large context whole of codebase reasoning calls, when you really need some very hollistic reasoning?

See https://twitter.com/itsandrewgao/status/1766891500921671850 For inspiration.

debian3 · March 11, 2024, 1:12am

They don’t use the maximum context window of GPT4, (I think they limit at around 10k token). Which is plenty for a lot of situation. The only time I wish the context was bigger is where you pass a @Doc + @Codebase

Topic		Replies	Views
This is getting out of hand Feedback	12	1003	August 8, 2025
Thanks to the Cursor Team for the Current Pricing Plan Feedback	76	21366	August 15, 2025
Now I feel the new Cursor Pro package is good Feedback	26	2524	August 11, 2025
Claude 3.7 + GPT 4.5 - now available in Cursor! Featured Discussions	98	30577	May 30, 2025
$200 Claude/cursor vs $200 Claude CLI Discussions	14	5331	July 12, 2025

[Solved] Add Claude 3 models

Related topics