I don’t like to be conspiratorial, but I feel like something should be said about this. Even the 8B model beats GPT-4o…
GPT-4o and Claude 3.5 are the only models that can do Tool Use aka Agent functionality in Cursor. Their business model is to sell 500 fast requests to either of these models for agent functionality for $20.
Groq released their benchmarks here Showing Llama 3.3 70B Tool Use was beating Claude 3.5 and GPT-4o in benchmarks consistently.
Is there an actual reason why Cursor team has not implemented Tool Use for Llama models?
I feel that there may be a conflict of interest. Perhaps they grew too fast, and are locked into their current growth model?