Experimental research framework for running AI benchmarks at scale
benchmarking machine-learning automation elixir otp research ai beam reporting reliability test-harness ensemble-methods statistical-testing experiment-automation llm research-automation experiment-orchestration nshkr-crucible
-
Updated
Dec 1, 2025 - Elixir