Benchmark Suites¶
The benchmark runner produces comparable results across estimators.
Run a suite¶
Or run the canonical suite:
CI vs full benchmarks¶
- CI smoke benchmarks live in
benchmarks/ci_smokeand run viamake benchmarks-smoke. - Full benchmarks live in
benchmarks/fulland run viamake benchmarks-full.
Outputs¶
results.csv/aggregate.csvreport.htmlmetadata.json(git SHA, versions, seeds, config path)
Best practices¶
- Pin seeds and config files.
- Store results with the git SHA and version.
- Use synthetic benchmarks to validate estimator changes.