You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Status: Logged, not yet deleted. Confirm before removing any item below.
Paradigm change
ANCIEN : Claude ↔ Zombie (dialogue bidirectionnel, enrichissement textuel)
NOUVEAU : Claude → DriftDetector → LatentProbe (tool pur, pas de génération)
The zombie is no longer a participant. It is a pure representation tool.
LatentProbe.probe() returns numpy activations. No text generation. No web.
V1 : activations brutes numpy [hidden_dim] → Claude devine dans le vide
V2 : activations → SAE.encode() → features sparse [16K] → diff de sets
→ Claude lit des concepts (indices Neuronpedia)
Stack zombie V2
Composant
V1
V2
Modèle probe
Qwen3-14B (LatentProbe)
Gemma 3 9B/27B (SAEProbe)
Encodage
activations brutes numpy
features SAE Gemma Scope 2
Drift metric
cosine + LLC (LLC = KL-div top-k)
Jaccard features + cosine brut
Release SAE
N/A
gemma-scope-2-9b-it-res
Format SAE ID
N/A
layer_{n}_width_16k_l0_medium
Nouveaux fichiers
Fichier
Description
scripts/validate_sae_on_aime.py
Kill gate 0 — SAEs utiles sur AIME ?
eris/sae_probe.py
SAEProbe — Gemma 3 + Gemma Scope 2
Fichiers remplacés (V1 → V2)
Fichier
Changement
eris/drift_detector.py
DriftReport V2 avec features_lost, features_gained, Jaccard. Import ProbeOutput depuis sae_probe à la place de interfaces.py
eris/orchestrator.py
ERISOrchestrator utilise SAEProbe + OrchestratorLLM. Template _RECALIBRATION_TEMPLATE orienté features SAE
eris/experiments/drift_detection/kill_criteria.py
Ajout sae_validation + test_3_scaling. Check range pour sae_validation
LatentProbe now inherits ProbeModel by delegation to HFProbe; adds steer(), steer_batch(), steering library
eris/orchestrator.py
ERISOrchestrator now takes OrchestratorLLM instead of raw anthropic.Anthropic
eris/drift_detector.py
DriftReport imported from eris.interfaces (removed local definition)
configs/eris_config.yaml
Added backends: and multi_agent: sections
Backward compatibility
LatentProbe still works with existing callers — signature unchanged.
ERISOrchestrator constructor signature changed: claude_client replaced by llm: OrchestratorLLM.
Old callers that passed an anthropic.Anthropic instance must be updated to pass ClaudeOrchestrator().
Do not act on this file without explicit confirmation. This is a log, not a TODO.