Idea : second opinion

timrichard · January 17, 2025, 12:21pm

Hi,
Just a thought…
When diffs are suggested, removals are in red and additions are in green.

What if there was a second opinion dropdown in chat, where you could select a second model and it would make an additional prompt to that behind the scenes?

You might see the diff suggestions from two different models (say Claude and O1) using different colour schemes on your source file.

If they agree on edits, that’s useful feedback that the approach is valid. If they disagree, you can just stick to the model which is heading in the better direction.

I know that would double up token quota use, but it might be a useful tool occasionally when you want to double check something important.

jake · January 18, 2025, 7:09am

That’s a good idea, in theory, to increase accuracy.

Practically speaking though, part of the problem is that it’s unclear whether the models agree on the edits from just observing the output. i.e. determining “if they agree on the edits” is not trivial.

Two models may both suggest viable approaches to the same problem. You might be able to judge one solution as better than the other (or even equivalent functionally) via introducing a third model (LLM as judge), but even that gets increasingly complex when you consider the edits within the broader application.

i.e. a solution may look more elegant from looking at the outputs in isolation, but within the context of your app the more complex solution may be the correct one.

timrichard · January 18, 2025, 9:22am

Oh, it would certainly make things more complicated.

But I’m not quite sure what you mean, particularly the third paragraph.

I know the direction I want to take. I usually reject as much code as I accept.

jake · January 18, 2025, 9:36am

Sorry, I wasn’t exactly clear. What I mean is that getting two suggestions from different AIs is great when you can easily evaluate both solutions and know in advance what the best solution is.

However, both AIs might offer novel solutions to the same problem. Except in basic cases, it is hard to know which solution is better or if the solutions are equivalent.

For example, you might ask the AI for code that tries to manage the active state of a menu. Each AI might pose solutions which are functionally identical but look quite different from a code perspective.

It is very hard to evaluate this programmatically or using a third LLM (“LLM as judge”), except in simple cases.

Said differently: in simple cases, having two LLMs propose solutions for you can work. But in simple cases, this is not particularly useful.

It is more useful in complex cases. However, in complex cases, it is much harder to evaluate which of the LLMs’ solutions is functionally superior—or if the solutions are equivalent.

Topic		Replies	Views
AI's critique each other Feature Requests	5	96	June 6, 2025
Accepting parts of the suggestion Feature Requests	2	38	April 8, 2025
UX Enhancement Ideas: quick model switching, in-IDE stats, and smarter code integration Feature Requests	0	94	January 11, 2025
Add Support for Model Picker for Full File Inline Edit Feature Requests	4	22	July 1, 2025
I wonder if 1 prompt could be processed by 2 LLM and let us compare the result, Discussions	0	8	May 3, 2025

Idea : second opinion

Related topics