Skip to content

Actual models tested? #6

@boxwrench

Description

@boxwrench

Practical question: for people running this with a cheaper model as the task agent (cost reasons), does the teacher need to match the family to get the diagnostic benefit, or does any strong frontier model work equally well as teacher? The MarkTechPost writeup hints at same-model pairing mattering but the two runs used different harnesses so it's hard to tell. Would love to know the actual teacher models used and whether you've tested cross-family pairings.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions