When usage based pricing is on, once you reach your plans limit it will use usage based pricing, similar to before.
Here is a short intro into recent plan updates:
- Plans switched from monthly 500 fast requests to no monthly request limit plan with usage based rate limit depending on model and token usage.
- There is a burst limit that allows intensive short period usage but refills slow and a local limit that refills regularly but with smaller amounts.
- Once rate limit is reached usage based pricing takes over to continue uniterrupted usage, model power and token usage is charged at API pricing (+20% markup), the Usage records with $ should have on hover the exact token amounts used.