From 02ffbd933657f0b39b0501feabbfa25331aaf1f9 Mon Sep 17 00:00:00 2001 From: autonomic-bot Date: Sat, 13 Jun 2026 19:03:08 +0000 Subject: [PATCH] =?UTF-8?q?status(aotest):=20##=20DONE=20=E2=80=94=20all?= =?UTF-8?q?=205=20DoD=20Adversary-verified=20PASS=20@2026-06-13T19:00Z=20(?= =?UTF-8?q?cdcece9),=20no=20VETO?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Co-Authored-By: Claude Opus 4.8 --- machine-docs/STATUS-aotest.md | 56 +++++++++++++++++++++++++++++------ 1 file changed, 47 insertions(+), 9 deletions(-) diff --git a/machine-docs/STATUS-aotest.md b/machine-docs/STATUS-aotest.md index e739ca9..52671fc 100644 --- a/machine-docs/STATUS-aotest.md +++ b/machine-docs/STATUS-aotest.md @@ -6,10 +6,48 @@ --- -## Gate: aotest CLAIMED, awaiting Adversary +## DONE -The committed test suite is in `tests/` of the deliverable repo. All 5 Definition-of-Done items -are satisfied; cold-verify per the HOW/EXPECTED/WHERE below. +All 5 Definition-of-Done items are Adversary-verified with a fresh PASS (@2026-06-13T19:00Z) on +deliverable commit `cdcece9a9ac64b458103194025f2c22ba830ce15`. No findings, no VETO — the Adversary +cold-cloned to `/tmp` and re-ran the unit suite + both live smokes + isolation check inside +`nix develop` (Python 3.11.11, tmux 3.5a) and independently confirmed every item. Full +cold-verification evidence is in `REVIEW-aotest.md`. + +The `agent-orchestrator` harness now ships a committed test suite under `tests/`: 51 unit tests +(pure logic — config/defaults, kickoff assembly, phase machine, limit/WAITING-UNTIL parsing, +claude+opencode activity detection), isolated live smokes that bring a throwaway project up THROUGH +`agents.py` on the real claude and opencode backends (unique session prefix, dedicated opencode +port `:4097`, full cleanup), and `tests/run.sh` (unit always + smokes when available + isolation +sanity), documented in the README `## Testing` section. + +### WHERE (verification inputs) +- Repo: `https://git.autonomic.zone/recipe-maintainers/agent-orchestrator.git` +- `main` HEAD → `cdcece9a9ac64b458103194025f2c22ba830ce15` (commit `cdcece9`, on top of `289ef07` v0.1.0) +- New files: `tests/test_unit.py`, `tests/smoke_claude.sh`, `tests/smoke_opencode.sh`, + `tests/run.sh`; README updated (file-map line + a new `## Testing` section). +- Backends present on this host: `claude` → `/home/loops/.local/bin/claude` (v2.1.177); + `opencode` → `/home/loops/.local/bin/opencode`; creds at `/srv/cc-ci/.testenv`. + +### HOW to cold-verify (fresh /tmp clone, exactly as the plan specifies) +``` +cd /tmp && rm -rf aotest-cold +git clone https://git.autonomic.zone/recipe-maintainers/agent-orchestrator.git aotest-cold +cd aotest-cold && git rev-parse HEAD # → cdcece9a9ac6... +nix develop -c python3 -m unittest discover -s tests # DoD-1: unit tests +nix develop -c ./tests/run.sh # full suite: unit + both smokes + isolation +``` +Individual smokes (each is also invoked by run.sh): +``` +nix develop -c bash tests/smoke_claude.sh # DoD-2 +nix develop -c bash tests/smoke_opencode.sh # DoD-3 (own server on :4097, ≠ live :4096) +``` +Post-run isolation check (DoD-4): +``` +tmux ls | grep '^aotest-' # EXPECTED: no output (no leftover sessions) +ss -ltn | grep ':4097 ' # EXPECTED: no output (port freed) +tmux ls | grep -E 'cc-ci-orchestrator|cc-ci-watchdog|cc-ci-assistant3' # EXPECTED: all 3 present +``` ### WHERE (verification inputs) - Repo: `https://git.autonomic.zone/recipe-maintainers/agent-orchestrator.git` @@ -65,10 +103,10 @@ Working tree of the deliverable clone is clean and pushed. ## Gate status -| Gate | Status | Claimed | +| Gate | Status | Verified | |---|---|---| -| DoD-1 Unit tests PASS (clean /tmp, nix develop) | CLAIMED | 2026-06-13T18:56Z | -| DoD-2 Claude smoke PASSES via harness | CLAIMED | 2026-06-13T18:56Z | -| DoD-3 opencode smoke PASSES (dedicated port) | CLAIMED | 2026-06-13T18:56Z | -| DoD-4 No leftover aotest-* sessions/ports; cc-ci intact | CLAIMED | 2026-06-13T18:56Z | -| DoD-5 Test suite + runner committed + documented | CLAIMED | 2026-06-13T18:56Z | +| DoD-1 Unit tests PASS (clean /tmp, nix develop) | PASS | 2026-06-13T19:00Z | +| DoD-2 Claude smoke PASSES via harness | PASS | 2026-06-13T19:00Z | +| DoD-3 opencode smoke PASSES (dedicated port) | PASS | 2026-06-13T19:00Z | +| DoD-4 No leftover aotest-* sessions/ports; cc-ci intact | PASS | 2026-06-13T19:00Z | +| DoD-5 Test suite + runner committed + documented | PASS | 2026-06-13T19:00Z |