[Feature Request] Pro Subscription support for Haiku due to RepoQA benchmark

ssmits · April 28, 2024, 12:18pm

Hi Cursor team,

After the creators of the RepoQA benchmark implemented a new evaluation method, they found that Haiku performs exceptionally well. I was wondering if it would be possible to integrate Haiku with a context size of around 20-50k and allow for 10 times the number of requests compared to Opus (i.e., 10 Haiku requests for every 1 Opus request). This approach should still be more cost-effective overall.

Essentially, this would equate to 5,000 Haiku requests, as Haiku is 60 times cheaper than Opus. Even with a 50k context size, it would remain cheaper, and for 20k, it would be significantly more affordable.

You can find more information about the evaluation results here: RepoQA for Evaluating Long-Context Code Understanding

fun_strange · April 28, 2024, 7:46pm

Isn’t it already possible in the long context chat? Both Sonnet and Haiku are available there with 200k context length, and opus is also available there via your own API using this method:

Or are you talking about something else?

ssmits · April 28, 2024, 11:36pm

Yes, it is generous of them to add more Haiku (20) /Sonnet (50) calls. However, I would like there to be an intermediate level, as I stated with a context size of ideally 50k, 20k minimum. The quality deteriorates at 200k from empirical evidence quite a bit going up to 100+k (even though it passes the needle in the haystack tests).
I don’t mind 1 Opus call being equal to 5 Haiku calls even though it is 60x cheaper (making it 12x cheaper if context goes from 10k to 50k).
This is quite far fetched but would solve a big use case for me of quickly iterating on bugs which a large context window and a high chance of fixing them.

Topic		Replies	Views
Claude 3 Haiku with a larger context window Feature Requests	8	1824	April 10, 2024
Removal of limits on long context chat with Claude sonnet and Haiku Discussions	1	490	May 3, 2024
Claude usage-based pricing Discussions	2	1452	May 20, 2024
Including Sonnet/Haiku for Pro Users Feature Requests	2	816	April 10, 2024
When using claude w/ api key, howto increase/control max tokens in response? Discussions	10	2769	August 14, 2024

[Feature Request] Pro Subscription support for Haiku due to RepoQA benchmark

Related topics