Hi, I'm Satya. I build tools for agent reliability and evals. See here for more.
making agents reliable
- Pale blue dot
- satyaborg.com
- in/satyaborg
- @satyaborg
Pinned Loading
-
healthbench-physician-disagreement
healthbench-physician-disagreement PublicCode accompanying the paper "Decomposing Physician Disagreement in HealthBench" https://arxiv.org/abs/2602.22758
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




