EqualifyEverything / equalify-reflow

docs(benchmarks): publish Deliverable 3 pilot benchmark (#126)
Establishes docs/reference/benchmarks/ as the home for published benchmark runs. Adds the v0.1.0-beta.6 pilot (30 UIC documents, 175 pages, $12.99, 235 issues graded by severity) as the first entry: report, manifest, aggregate summary, flat per-document CSV, and 30 qualitative review notes. Generalizes scripts/batch_run.py to accept either a manifest file (reproducible corpora) or a directory glob (ad-hoc runs); drops the hardcoded UIC-specific paths. Adds docs/how-to/run-the-benchmark.md for reproduction. Phase 2 improvement roadmap links to #82 rather than duplicating it. Closes #80 Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Dylan Isaac Dylan Isaac committed on Apr 22, 2026, 06:21 PM
Showing 38 changed files +2489 additions -95 deletions
Browse files at this commit →