run-harness-bench.sh now loops VARIANTS × BENCH_REPEATS (default 5), writes each run's row to RESULTS-campaign.md.data immediately (survives interruption), and aggregates per-variant median/mean/min/max/stdev + median duration into RESULTS-campaign.md. Frees each run's repo/transcripts after tallying.
9 lines
123 B
Plaintext
9 lines
123 B
Plaintext
# runtime
|
|
.ao-state/
|
|
__pycache__/
|
|
*.pyc
|
|
*.tmp
|
|
RESULTS-harness.md.tmp
|
|
RESULTS-campaign.md.data
|
|
RESULTS-campaign.md.data.hdr
|