I can’t bring myself to use Opus instead of Sonnet or Gemini Pro. Especially for Background Agents. It’s just so much more expensive. Do y’all think the performance is worth the cost?
Assume you are paying ~$20/monmth btwn players.
How can I get my AI to game all the costs as a pipe funnel.
(ill leave this seed)
I feel like Claude 4 Opus in MAX mode is the most intelligent / reliable model available now, but it’s too expensive. It depends on the task though.
what if i have an mcp to factoring a toekn marketplace layer
you have a monthly/credits system… farm out my unused credits for funds
what if there is a token marketplace for ai calls… that like txt minutes can be transcribed among sim cards such that I can have it take a bunch of sim call minutes…
yall not being dystopyian cyber punk enough
do you even understand the ayalas in the PH in cell SMS culture much
do you know what the numebnt 1 thing jonny ive did with the iphone
he killed the lanyard hook on all apple devices
that led to the grift of broken ■■■■
its time for task cost as a service
troll alert?
I did try it twice on bigger tasks but unfortunately I haven’t done a side-by-side. $15 per task, yeah… For now I just use Sonnet. If I already know what needs to be done Sonnet seems good enough.
edibles
im not a troll… edibles
EDIT:: edible and ALL_THE_THINGS is fun…
but i have me som ~150 projects
(mayve from moblands voice)
anyway…
just BUILD and get GibsonAI to schema-it out for you…
holy_carp what we can build with an utterance
I resort to Opus if Sonnet gets wedged, or if I want a lot of thinking for high-level problem-solving or research on an issue. I find Sonnet to be actually better at straight-ahead focused engineering.
Even auto mode works very well most of the time if you’re specific enough in your prompts.
Honestly I gotta give auto mode a try. Does it err on the side of saving money?
Auto mode uses a premium model, but it would not use Opus since it requires Max mode.
Auto mode helps when some models/providers are overloaded, as it automatically selects a premium model that has capacity. It does not additionally save money apart of costing 1 request per request
For most tasks Claude 4 Sonnet (regular, at 1 request / request) or other premium models like it are fine and wouldnt go into high cost.
for my work, shader code mostly, opus way way way overthought things, way overengineered, and kinda failed more often than sonnet or g2.5. for my purposes currently seems c4sonnet non max mode for the win. g2.5 very close behind. they alt on sorting where the other fails. also note that the pricing is wow-ouch w opus… at least compared to better (my exp) or equal (being overly fair to opus) answers are avail at 1/10th the price per prompt. I only did a few prompts and granted w many tool calls per (due to it’s own overthinking) and ran up a $50 tab for the ride. at most those prompts would have been $3 using c4-sonnet non max.
not complaining though, I said yes to the experiment, hoping to talk to closer-to-shader-god than c4-sonnet. instead opus seemed like an engineer I once employed. Brilliant coder and mind but would always over engineer to the point where not even god could avoid the house of cards before the thing fully worked. when you ever engineer at every stage you end up w a rocket that will neither function as a rocket non an amusement ride.
With everything code and AI – your milage may vary… greatly w an e+10 at times.