Hi Jack,
This is a confirmed bug. The model names you pass in the /best-of-n CSV are not being applied to each runner. All subagents currently fall back to the same model (typically the parent model) instead of using the distinct models you specified.
Our team is actively working on a fix. No ETA to share yet, but it’s being treated as a high priority.
Other users have reported the same behavior in this thread and this one. Unfortunately there’s no workaround for getting different models on each runner right now.