Using expensive models uses up your requests faster. If you used OPUS it would make sense you end up hitting the limit after a few requests. Don’t use max thinking, it ends up being quite costly for the request. Unless you have usage-based pricing on, you will hit your limit pretty fast. This message disappeared after a few hours, so we can still use the same model.