Understanding Model Usage and Request Limits for Claude 3.5 Sonnet in Cursor

Hey, in Cursor, fast requests are limited (e.g., 500 requests per month), while slow requests are potentially unlimited but depend on the current server load. This means slow requests might have lower priority, and there could be delays in the model’s response—usually a few seconds, but longer during periods of high demand.

You can choose to use only the Claude model if you prefer, but for simple tasks or generating documentation, I’d suggest keeping one or more non-premium models.

Purchasing additional requests will increase your fast request limit, and once those are used up, it will automatically switch to slow requests.

If you have any other questions, feel free to ask here.