agent-orchestrator-benchmark/.gitignore at 8e6290e7e0600453cbcd12ee196cd54ee1e90c9f - agent-orchestrator-benchmark - Git with solidaritea

recipe-maintainers/agent-orchestrator-benchmark

Files

mfowler 3bf3316572 docs: FINDINGS.md — benchmark synthesis; track raw results data

Capstone summary of the Builder/Adversary prompt + verification-cadence study:
- adversary EXISTENCE costs ~4.7x (solo 2.8M vs ~13M); cadence is ~token-neutral
- context hygiene is the one clean -22% win; minimal prompts -25% but test less
- deferred review saves nothing (the one comprehensive pass is expensive) + late
- cost is process not product (tokens~duration 0.83, ~commits 0.79, ~LOC -0.04)
All results now in-repo: FINDINGS.md + RESULTS-campaign.md + raw .data + runners.
(deferred N=3, finalizing to N=5.)

2026-06-16 01:53:34 +00:00

7 lines

69 B

Plaintext

Raw Blame History

 # runtime
 .ao-state/
 __pycache__/
 *.pyc
 *.tmp
 RESULTS-harness.md.tmp