kadubon

kadubon

Pinned Loading

github.io github.io Public

personal page

HTML
observable-replay-lab observable-replay-lab Public

Observable-only no-meta epistemics lab: deterministic replay + reproducible audit logs, gate-based growth simulation, and identifiability/uncertainty benchmarks.

TeX
Proof-Carrying-Skills--PCS-Core- Proof-Carrying-Skills--PCS-Core- Public

Python
audit-closed-ai-scientist audit-closed-ai-scientist Public

Benchmark for statistically valid AI scientist systems, using audit-closed protocols, transparency logs, and sequential inference to prevent false discoveries in autonomous research agents.

Python
search-stability-lab search-stability-lab Public

Theory-to-experiment lab for search stability in long-running agents under finite context, with exact simulator tests and lightweight mechanistic probe tasks.

Python
split-inference-bench split-inference-bench Public

Fixed-budget multi-agent inference benchmark harness for studying when split inference helps or hurts versus a strong single-agent baseline under local context ceilings, using local Ollama gemma3:1…

Python