`sonic` Ghost Model Discussion

yakovw · August 20, 2025, 2:00pm

That explains a lot. I’m glad many people had a positive experience, but… I tested it again! And again!
I got terrible results.

Did anyone on your team get better results than I did?
Especially regarding compliance, following the rules, and sticking to instructions?

yakovw · August 20, 2025, 2:01pm

I find it hard to follow — who do you actually call a “partner”?
A true business partner? Or even a company you work with, like OpenAI, is also called a partner?

yakovw · August 20, 2025, 2:02pm

Did you check my complaints?
Did it follow the rules?
Did it stick to the instructions without deviating?
Right now, only GPT-5 is good at that.

condor · August 20, 2025, 2:26pm

@yakovw A partner can be anyone you consider a partner. Not just a business or life partner but also someone you work with or collaborate with on a project. In this case, yes it is about providing test access to users to explore a model and its capabilities.

yakovw · August 20, 2025, 3:26pm

@danperks

Can you tell me if your team got a different impression than I did?

yakovw · August 20, 2025, 3:27pm

I think this could cause them embarrassment…
Is this post shared with them, and will they see it?
After all, that’s supposed to be the purpose of the experiment.
It would be better for them to fix or improve it first.

condor · August 20, 2025, 3:30pm

Nothing to worry about (the admin note and reveal). We merged a few threads to make it easier for people to find the discussion.

Artemonim · August 20, 2025, 5:58pm

It’s a pity that the rates are hidden. It is impossible to estimate the quality/cost.

Artemonim · August 20, 2025, 6:02pm

Really fast! It’s only a 3 minutes of wrong work.

Artemonim · August 20, 2025, 6:43pm

Sonic is about as dumb as GPT-5 Mini: both were given a task to make a change to a PowerShell script; looked at its output; didn’t realize they hadn’t completed the task; and reported that the task had been completed.

I gave up and did it through GPT-5-low.

For simple tasks it is worth trying to run Sonic, but you need to check what it did. It can lie in the reports.

DogSkull · August 20, 2025, 7:21pm

The Sonic model called itself Grok’s brother tuned for coding, and I have to admit it’s not that bad. I think it’s somewhere around the Auto level. I gave it a few simple tasks and it did them without any problem, but it struggles more with the complex ones.

Murgur · August 20, 2025, 8:19pm

aldinokemal · August 20, 2025, 10:50pm

ah I see

yakovw · August 21, 2025, 3:21am

Did you do some kind of manipulation?
Did you ask it to write that?
Because I can’t manage to reproduce it at all.
Although unfortunately, I can’t think of another company that could create such a successful code model.

aldinokemal · August 21, 2025, 3:50am

nothing manipulation at all

yakovw · August 21, 2025, 3:55am

Interesting why I can’t manage to get the same result.

aldinokemal · August 21, 2025, 4:12am

Hi here is my full chat

___
I just ask again, and still getting the same result lol

yakovw · August 21, 2025, 4:18am

You convinced me.
In fact, there’s no need to convince me anymore.
I have a specific question that only Grok answers,
and now I checked—
indeed, it’s one hundred percent him.
You can guess what I mean if you’d like.
Good luck, and thank you for taking the time.

haris-musa · August 21, 2025, 6:31am

When you ask it, it confidently says xAI, but it feels very Anthropic

Alphayellowcat · August 21, 2025, 7:18am

The processing speed is very fast, but the ability to handle complex projects is insufficient. It may be necessary to reconstruct the project from the beginning and use standardized documentation to demonstrate its capabilities.

Topic		Replies	Views
Grok free on Cursor - Feedback needed Discussions	71	4953	September 24, 2025
Proprietary openai model, git patch, and sonet Discussions	0	127	April 8, 2024
Sonnet's Reign is Over, We need smarter MODELS Discussions	1	580	January 27, 2025
Model fallback? Bug Reports	6	166	October 25, 2024
A CRACKED Prompt to Drastically Improve Sonnet 3.7 Accuracy Discussions	23	1766	March 17, 2025

`sonic` Ghost Model Discussion

Related topics