At this point, they’ve simply provided a formal response, but not an actual solution to the problem — and they still refuse to acknowledge it.
When we make a small request of 50KB and receive a small response of 1–2KB, we’re also charged for some imaginary cache ranging from 500K to 2 million tokens, which inflates the request cost to $1.
As customers, we’re paying for either unlimited or a limited number of fast requests, with slower access afterward — not for speed restrictions that last for days.
From what I understand, they decided to base everything on the API pricing model, which is now many times more expensive — compared to $0.04 per request — and that’s the root of the problem we’re now feeling in our wallets.
Previously, when we paid for the product, we could use it and get value from it. Now, it feels like the product is using us — draining money from us instead.