The 671b model should not have been used. Instead, a small distilled model was used.
Instead of speculating, give proof with prompts and responses as one-shots.
1 Like
I don’t know. In my opinion, at least based on my very limited testing, it looks like a full-fat R1. Though there was one issue - Cursor seems to have a timeout, probably around 5 minutes, which, combined with slow inference speeds, can lead to terrible results (cut-off responses mid-thinking without any result).