Can someone explain what "Premium models" is?

I have used Cursor on some internal projects for days. And I never noticed that I have a choice on using what kind of models.

And I have sent like hundreds of requests, including easy tasks like creating a simple dialog, and complex tasks like create a complicated code for a complicated UI. But I was never asked things like “Do you want to go fast requests”.

Can somebody explain this to me? Do I have control on my own? What if the quota became 500/500 and I don’t pay extra money > What would happen?

Thanks!

Hey,

In Cursor, we have two groups of models: “premium” and “non-premium”.

Premium models are the smartest, but most expensive to run. In Cursor, you get 500 requests to premium models that are “fast”, and are executed instantly. Once you run out of fast requests, you then can use “slow” requests, which may be put in a queue to be completed.

If you want more fast requests, you can enable our usage pricing toggle in the dashboard, and pay $0.04 for every premium request outside of your allowance, knowing they are fast and will be executed immediately.

Our non-premium models are unlimited and always fast, and while they are not as smart as our premium models, can often be used for a lot of what you may be working on in Cursor!

Let me know if you have any follow up questions!

Hi Danperks,

Thanks for the reply. May I ask how do I control which requests to be fast and which to be not? Is it allocated automatically?

Are premium models more clever?

Sometimes I felt the AI agent is a bit dumb while most of the time they work great.

Could you please provide more details?

Thanks a lot!

Your requests are always fast to start, until you run out of your usage allowances, at which point they become slow.

The premium models are the ones that perform the best on benchmarks, so usually are the best at figuring out what you want them to do, and writing the code with the fewest bugs or issues in as possible.

However, they are definitely not immune to these issue, and can still have these issues if they are lacking the right context they need, or are trying to implement something too complex in one go - in this case, breaking tasks down into smaller ones, and doing them one by one, often works much better.

Hi Danperks,

Thanks very much for your fast reply.

I have been heavily using Cursor to create some complicated projects (involved with many files, npm packages, …), and though I don’t remember the exact number, I am sure I’ve sent more than 49 requests to the agent. So how do you measure the count “49”? Just wondering.

Your usage refreshes every month, so it may be your usage has reset already.

However, each request to the agent composer uses 2 fast requests worth of usage, whereas the normal composer chat and CMD+K uses 1 fast request.

It may also be that the dashboard is not showing your live usage, and may “catch up” in a few hours to your real usage, if you have used a lot of requests very recently.

1 Like

I see. Thanks for the information!

which models are premium and which models are non premium?

all good models are premium, cheap models no one use like 4o-mini and cursor-small are not