Skip to content

ctxyao/chrishohoho

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 

Repository files navigation

Yucheng "Chris" Yao

Research engineering for AI-agent evaluation, context health, and reliable AI workflows.

I build local-first evaluation artifacts that make AI workflow inputs easier to inspect before an agent acts on them. My current focus is CtxGov: Agent Context Health Evaluation for AI Workflows.

Current Artifacts

Research Interests

  • LLM and agent evaluation
  • Context engineering and context-health checks
  • Model behavior measurement
  • Reproducible evaluation infrastructure
  • AI safety evaluation artifacts with explicit limitations

Boundaries

CtxGov is not a security scanner, universal benchmark, provider compatibility matrix, hosted runtime, or automatic remediation agent. Current eval materials are public v0.2 scaffold data plus a v0.3 review-ready packet until independently reviewed trace-derived labels, hard negatives, and administered holdout results exist.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors