Pinned Loading
-
-
observable-replay-lab
observable-replay-lab PublicObservable-only no-meta epistemics lab: deterministic replay + reproducible audit logs, gate-based growth simulation, and identifiability/uncertainty benchmarks.
TeX
-
-
audit-closed-ai-scientist
audit-closed-ai-scientist PublicBenchmark for statistically valid AI scientist systems, using audit-closed protocols, transparency logs, and sequential inference to prevent false discoveries in autonomous research agents.
Python
-
search-stability-lab
search-stability-lab PublicTheory-to-experiment lab for search stability in long-running agents under finite context, with exact simulator tests and lightweight mechanistic probe tasks.
Python
-
split-inference-bench
split-inference-bench PublicFixed-budget multi-agent inference benchmark harness for studying when split inference helps or hurts versus a strong single-agent baseline under local context ceilings, using local Ollama gemma3:1…
Python
If the problem persists, check the GitHub status page or contact support.