I used Cursor Admin API to acquire tokens cost data in order to predict my budget after new pricing for teams from 9/15.
However, the request cost doesn’t match when I try to verify the consumed tokens with official pricing list. Could you explain how [totalCents] comes out for below 3 examples?
1)Why always ≤ 200K tier is applied though [Cache read tokens] is much higher than 200K? Is it decided by model selection or the way Cursor makes LLM request? For example, if I choose claude-4-sonnet (200K context window), there will never be >200K pricing, right?
2)As you said, Cursor bills less than the calculated result. Is it fixed discount or dynamic discount? Based on what?