A minimal sandbox to run, score, and compare AI agent outputs locally.
python experimental minimal deterministic ai-agents local-tools research-tools agent-evaluation agent-comparsion agent-playground
-
Updated
Dec 19, 2025 - Python