Harbor is a framework for running agent evaluations and creating and using RL environments.
-
Updated
Mar 7, 2026 - Python
Harbor is a framework for running agent evaluations and creating and using RL environments.
Official Implementation of "CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion"
Spoox CLI - Terminal Agent - SPlit lOOp eXand agent
Trajectories for running OpenHands on Terminal Bench
Add a description, image, and links to the terminal-bench topic page so that developers can more easily learn about it.
To associate your repository with the terminal-bench topic, visit your repo's landing page and select "manage topics."