Is Cursor using the full power of DeepSeek R1?

The 671b model should not have been used. Instead, a small distilled model was used.

Instead of speculating, give proof with prompts and responses as one-shots.

1 Like

I don’t know. In my opinion, at least based on my very limited testing, it looks like a full-fat R1. Though there was one issue - Cursor seems to have a timeout, probably around 5 minutes, which, combined with slow inference speeds, can lead to terrible results (cut-off responses mid-thinking without any result).