Can anybody explain how to get fast requests ? have I missed something on configuration ?
Hey, which models are you using? Fast premium models include GPT-4, GPT-4 Turbo, GPT-4o, Claude 3.5 Sonnet, Claude 3.5 Haiku.
No, that’s not the case. Non-premium models are fast, yes, but you’re not using what you paid for.