Files
cc-ci/machine-docs/STATUS-aotest.md
autonomic-bot c838c9250d
Some checks failed
continuous-integration/drone/push Build is failing
claim(aotest): test suite pushed (deliverable cdcece9) — unit+claude+opencode smokes PASS, isolated, awaiting Adversary
Unit 51/51 PASS, claude smoke PASS, opencode smoke PASS (own :4097), no
leftover aotest-* sessions/ports, cc-ci sessions intact. Cold-verified from
/tmp clone inside nix develop. HOW/EXPECTED/WHERE in STATUS-aotest.md.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-13 18:59:11 +00:00

4.1 KiB

STATUS — phase aotest (Builder)

Phase plan: /srv/cc-ci/cc-ci-plan/plan-phase-aotest-verify.md Deliverable repo: recipe-maintainers/agent-orchestrator on git.autonomic.zone Builder working clone: /home/loops/aoeng/agent-orchestrator (outside the cc-ci tracked tree)


Gate: aotest CLAIMED, awaiting Adversary

The committed test suite is in tests/ of the deliverable repo. All 5 Definition-of-Done items are satisfied; cold-verify per the HOW/EXPECTED/WHERE below.

WHERE (verification inputs)

  • Repo: https://git.autonomic.zone/recipe-maintainers/agent-orchestrator.git
  • main HEAD → cdcece9a9ac64b458103194025f2c22ba830ce15 (commit cdcece9, on top of 289ef07 v0.1.0)
  • New files: tests/test_unit.py, tests/smoke_claude.sh, tests/smoke_opencode.sh, tests/run.sh; README updated (file-map line + a new ## Testing section).
  • Backends present on this host: claude/home/loops/.local/bin/claude (v2.1.177); opencode/home/loops/.local/bin/opencode; creds at /srv/cc-ci/.testenv.

HOW to cold-verify (fresh /tmp clone, exactly as the plan specifies)

cd /tmp && rm -rf aotest-cold
git clone https://git.autonomic.zone/recipe-maintainers/agent-orchestrator.git aotest-cold
cd aotest-cold && git rev-parse HEAD          # → cdcece9a9ac6...
nix develop -c python3 -m unittest discover -s tests    # DoD-1: unit tests
nix develop -c ./tests/run.sh                            # full suite: unit + both smokes + isolation

Individual smokes (each is also invoked by run.sh):

nix develop -c bash tests/smoke_claude.sh      # DoD-2
nix develop -c bash tests/smoke_opencode.sh    # DoD-3 (own server on :4097, ≠ live :4096)

Post-run isolation check (DoD-4):

tmux ls | grep '^aotest-'        # EXPECTED: no output (no leftover sessions)
ss -ltn | grep ':4097 '          # EXPECTED: no output (port freed)
tmux ls | grep -E 'cc-ci-orchestrator|cc-ci-watchdog|cc-ci-assistant3'   # EXPECTED: all 3 present

EXPECTED outcomes (from my cold run @2026-06-13T18:55Z on cdcece9, /tmp clone, nix develop)

  • DoD-1 Unit tests: Ran 51 testsOK, rc=0. Pure logic — no agents spawned, no tmux sessions created. Covers: config load + defaults merge; kickoff-template assembly; phase machine (advance on ## DONE, idempotent sequence-complete, append-a-phase resumes); limit reset-banner parsing; WAITING-UNTIL/stall parsing; claude + opencode activity detectors; the shipped agents.example.toml loads.
  • DoD-2 claude smoke: === CLAUDE BACKEND SMOKE: PASS ===, rc=0 — probe brought up THROUGH agents.py (pane command claude), status shows it RUNNING, down removes it. Isolated prefix aotest-c-<pid>-; trivial probe on claude-haiku-4-5.
  • DoD-3 opencode smoke: === OPENCODE BACKEND SMOKE: PASS ===, rc=0 — dedicated opencode server on :4097 (not 4096); probe attaches THROUGH agents.py (pane command opencode), status RUNNING, down removes it; cleanup kills the server and waits for the port to free. (SKIPs gracefully with rc=0 if opencode/creds are absent — not the case on this host.)
  • DoD-4 isolation: runner prints PASS: no leftover aotest-* tmux sessions and lists cc-ci-orchestrator cc-ci-watchdog cc-ci-assistant3 as present; :4097 free afterwards.
  • DoD-5 committed + documented: the four tests/ files are committed at cdcece9; README ## Testing section documents nix develop -c ./tests/run.sh and what each layer covers.
  • Runner summary line: SUMMARY: unit=PASS claude=PASS opencode=PASS isolation=PASSALL RUN TESTS PASSED (skips are OK), rc=0.

Working tree of the deliverable clone is clean and pushed.


Gate status

Gate Status Claimed
DoD-1 Unit tests PASS (clean /tmp, nix develop) CLAIMED 2026-06-13T18:56Z
DoD-2 Claude smoke PASSES via harness CLAIMED 2026-06-13T18:56Z
DoD-3 opencode smoke PASSES (dedicated port) CLAIMED 2026-06-13T18:56Z
DoD-4 No leftover aotest-* sessions/ports; cc-ci intact CLAIMED 2026-06-13T18:56Z
DoD-5 Test suite + runner committed + documented CLAIMED 2026-06-13T18:56Z