EduSafeBench: open, reproducible reliability benchmarks for AP CSA/AP CSP AI learning assistants (datasets, eval tooling, reports, and external review workflow).
python open-source github-pages education benchmark machine-learning reproducibility ai-safety ap-csp ap-csa
-
Updated
Apr 27, 2026 - Python