Run Benchmarks¶

Benchmarks are configured with YAML files under configs/benchmarks/.

python -m experiments.run_benchmarks --suite all --output-dir results/

To run a pinned suite:

python -m experiments.run_benchmarks --config configs/benchmark_suites/default.yaml --seed 0

CI vs full runs¶

Outputs include results.csv, aggregate.csv, figures, metadata.json, and an HTML summary.