Skip to content

Latest commit

 

History

History
58 lines (42 loc) · 2.46 KB

File metadata and controls

58 lines (42 loc) · 2.46 KB
title status authors based_on category source tags
Oracle and Worker Multi-Model Approach
emerging
Nikola Balic (@nibzard)
Sourcegraph Team
Orchestration & Control
multi-model
cost-optimization
strategic-reasoning
architecture

Problem

Relying on a single AI model creates a trade-off between capability and cost. High-performance models are expensive for routine tasks, while cost-effective models may lack the reasoning power for complex problems.

Solution

Implement a two-tier system with specialized roles:

  • The Worker (Claude Sonnet 4): Fast, capable, and cost-effective agent handling bulk tool use and code generation
  • The Oracle (OpenAI o3 / Gemini 2.5 Pro): Powerful, expensive model reserved for high-level reasoning, architectural planning, and debugging complex issues

The Worker can explicitly request Oracle consultation when stuck or needing better strategy. The Oracle reviews the Worker's approach and suggests course corrections without polluting the main agent's context.

graph TD
    A[User Request] --> B[Worker Agent]
    B --> C{Need Oracle?}
    C -->|Yes| D[Oracle Consultation]
    C -->|No| E[Direct Execution]
    D --> F[Strategic Guidance]
    F --> G[Worker Implements]
    G --> H[Task Complete]
    E --> H
Loading

Evidence

  • Evidence Grade: emerging
  • Most Valuable Findings: Validated in production at Sourcegraph (~90% cost reduction vs. all-frontier); academic foundation from model cascading research (FrugalGPT: up to 98% cost reduction with quality parity)
  • Unverified: Optimal Oracle invocation thresholds remain application-specific

How to use it

Development environments, complex coding tasks, architectural decisions, debugging sessions where initial approaches fail. Also known in literature as model cascading, weak-strong model routing, or hierarchical model systems.

Trade-offs

  • Pros: Cost-efficient use of frontier models; sophisticated problem-solving; specialized AI team approach
  • Cons: Additional orchestration complexity; potential latency from model switching; requires careful Oracle invocation logic

References