Gpt-4.5-preview each request consume 50x premium

mason-geeks · May 14, 2025, 9:42am

Yesterday, I utilized the GPT-4.5 preview for the first time to troubleshoot a problem in my React project. To my surprise, it burned through 50 premium fast requests without providing a solution. Ultimately, I turned to GPT-4.1 and managed to resolve the issue with just a single request.

This raises a question: why do we have advanced models that incur such high costs but fail to deliver effective outcomes?

T1000 · May 14, 2025, 10:01am

Oh why would you use 4.5? 4.1 is much better, has higher context and is the latest model of that line.
Not sure why OpenAi makes such problematic naming, 4.5 is way older than 4.1

Some people adjusted their prompts to 4.5 and can use it. But its going to be retired by OpenAI.

OpenAI will retire the GPT-4.5-preview model from the API on July 14, 2025. The model was deprecated in April 2025, with developers notified about its upcoming removal.

mason-geeks · May 14, 2025, 10:28am

Thanks for your update. Why we have it in the Cursor then? Shouldn’t it notifies about the cost and being useless?

T1000 · May 14, 2025, 10:31am

Also note that any preview models are not stable releases. Some people want to try those to test the model or adjust their prompts.

arwed · May 14, 2025, 2:12pm

Hey. I think you should check usage tab in dashboard (link from new docs)
For me, those “many request scenarios” seem to when I give prompts with many tool calls (like file scanning). It seems to end an API request and then issue a new one after every tool usage and therefore the request costs seems to add up

Would that match your observations?

T1000 · May 14, 2025, 3:19pm

Only MAX mode before Cursor version 0.50 did count each tool call as separate request. This will change as 0.50 is being tested now in beta and rolled out.

GPT-4.5 wasnt available in MAX mode as it is still not listed as supporting that and as it will be removed once not provided by OpenAI.

arwed · May 14, 2025, 4:26pm

What I meant was: it does not state “tool calls”. But internally, I think a model can only call one tool in their output and what might happen is that this way a human input is - for real - executed with many many requests which is what is now transparent to us.

What I think is cool is that some requests go as low as “0.2 requests cost”. In the linked dashboard you can also see some token counts when hovering over cost cell which made me believe the things I’ve stated

Topic		Replies	Views
GPT-4.1 is now available in Cursor Discussion	95	16399	April 22, 2025
O1-preview inclusion rationale is not clear Discussion	9	382	November 12, 2024
Cost of OpenAI API? Discussion	10	5664	October 29, 2024
Pro Plan - What do the "500 fast requests" include? Discussion	4	259	March 31, 2025
Gpt-4-0125-preview just dropped from OpenAI Discussion	5	1183	February 25, 2024

Gpt-4.5-preview each request consume 50x premium

Related topics