cc-ci-orchestrator

Author	SHA1	Message	Date
autonomic-bot	34fc68d4b8	journal: coordination files moved to machine-docs/; memory committed	2026-06-11 20:57:57 +00:00
autonomic-bot	c33b21fe8d	memory: commit session notes (drone P0, weekly-upgrade-queued, mailu/index updates) Per AGENTS.md 'Agent memory lives in memory/ (in this repo)' — memory notes must be committed + pushed like any repo change, not left only in the local ~/.claude symlink target.	2026-06-11 20:56:24 +00:00
autonomic-bot	e144354668	loops: mandate machine-docs/ for ALL coordination files (kickoff/prompts/plan/AGENTS) Recent phases wrote STATUS/BACKLOG/REVIEW/JOURNAL to the repo ROOT because build_kickoff + plan.md's tree used bare filenames, even though the loops' AGENTS.md + INBOX/DECISIONS/DEFERRED conventions already said machine-docs/. Make machine-docs/ the single mandated home everywhere: build_kickoff now emits machine-docs/ paths + an explicit FILE-LOCATION RULE; both loop prompts and plan.md (tree + seed step) updated; orchestrator AGENTS.md documents + enforces it. resolve_state/INBOX handoff already read machine-docs/ first.	2026-06-11 20:56:24 +00:00
autonomic-bot	23b5fc4753	journal: weekly upgrade skipped tonight, queued after phase queue via watchdog hook	2026-06-11 20:50:25 +00:00
autonomic-bot	3fa3178546	watchdog: one-shot /upgrade-all trigger on phase-sequence completion When LOG_DIR/.run-upgrade-on-complete exists, the watchdog launches launch-upgrader.py start the moment the last phase reaches ## DONE (then consumes the flag). Lets the operator replace a scheduled weekly cron run with 'run as soon as the current phase queue finishes' — used tonight: the cc-ci-upgrade-all.timer was stopped (stamp forwarded past tonight's slot) and this flag set instead.	2026-06-11 20:49:54 +00:00
autonomic-bot	0005ce81af	journal: mailu false-completion incident + fix + re-queue	2026-06-11 18:20:54 +00:00
autonomic-bot	4275adc4a5	watchdog: phase_done ignores placeholder '## DONE' sections (skipped mailu) A Builder scaffolded 'STATUS-mailu.md' with a '## DONE / Not yet. Written here only when ...' placeholder section; phase_done's startswith('## DONE') matched it and auto-advanced past mailu without any of its work being done (no recipe PR, no claim, no review). Harden phase_done: a '## DONE' heading counts only when its first non-empty body line is not a placeholder/negation (Not yet / pending / TBD / when all / <...> etc). Verified against all shipped STATUS files (real DONEs still detected; mailu placeholder rejected).	2026-06-11 18:20:21 +00:00
autonomic-bot	211b4e231c	launch: per-phase model override (.loop-model[-adv]-<pid>) Lets a single phase pin a different model, read fresh each role_model call so a phase transition flips it automatically with no watchdog bounce. Operator wants builder on opus for the complex dstamp phase, reverting to sonnet from mailu on: .loop-model-dstamp=opus while base .loop-model stays sonnet.	2026-06-11 16:15:18 +00:00
autonomic-bot	5c260d225c	launch-orchestrator: persisted .orch-model file (ORCH_MODEL > LOOP_MODEL > file) Operator switching models near weekly limits: loops -> sonnet, orchestrator -> opus. Dotfiles updated (.loop-model/.loop-model-adv=sonnet, .orch-model=opus) so watchdog restarts keep the choice.	2026-06-11 16:03:29 +00:00
autonomic-bot	327b9f4efe	plan: phases dstamp, mailu, kuma, drone (queued after bsky) + journal - dstamp: attribute + fix the discourse abra-stamp drift (env change 06-05→ 06-10, harness-neutral, currently pinning discourse at L1); blast-radius sweep; HC1 keeps its teeth - mailu: backupbot v2 labels recipe PR, restore proven on real seeded mail, backup rung earned instead of skipped (operator approved re-entry) - kuma: uptime-kuma first-run wizard + create-a-monitor functional test (Socket.IO or Playwright, real probe evidence, flake-checked) - drone: gitea-dep enrollment, maximal subset per Phase-2 scoping; P0 /etc/timezone host deploy is orchestrator-owned (3bde76f committed)	2026-06-11 11:43:03 +00:00
autonomic-bot	f395247da4	docs(bsky): upstream registry for bluesky-pds — :0.4 is a moving tag now tracking main (0.5.1/node24/index.ts); exact tags through 0.4.219 keep classic index.js layout	2026-06-11 11:35:56 +00:00
autonomic-bot	c89cd6366b	plan: phase 'bsky' — fix bluesky-pds recipe + its screenshot (queued after lvl5) Root-cause the upstream image breakage (Cannot find module /app/index.js, Node v24 under the pinned tag — proven harness/ref-neutral in rcust M2), research upstream releases (persist to cc-ci-plan/upstream/bluesky-pds.md), fix via recipe-mirror PR (NEVER merge — operator does), prove full lifecycle green incl. the new L5 lint rung via !testme at PR head, then verify a real credential-free screenshot on those runs (hook only if needed). Close both DEFERRED bluesky entries; crisp operator handoff in STATUS-bsky.md.	2026-06-11 11:30:49 +00:00
autonomic-bot	0900c439d4	wake prompt: remove temporary limit-system night-watch line (condition met) The 2026-06-11 night watch is over: the limit-wait system was verified end-to-end on a real monthly-spend-limit window (hit -> hold without reboots -> flat probes -> prompt resume on lift), and the three bugs it surfaced are fixed (`5ea17fc`, `969eb60`). Standing supervision continues without the extra check.	2026-06-11 06:55:08 +00:00
autonomic-bot	969eb60df1	watchdog: probe-resumed tick returns True — don't evaluate stale pane after resume The tick whose probe resumed a session was continuing into stall logic with its pre-resume pane capture; a 4h-old WAITING-UNTIL in that stale data got the freshly-resumed adversary kill+rebooted (05:52). Treat probe-resume as handled-this-tick; the next 30s tick sees the live session.	2026-06-11 05:53:44 +00:00
autonomic-bot	5ea17fca21	watchdog: fix limit-probe self-match + scrollback dedupe wedge; plan(lvl5): badge shows level only Night-watch findings (monthly-spend-limit window, ~01:49-04:45): - probe text said 'usage limit' which matches LIMIT_RE, so a submitted probe kept limited_now true forever -> reworded to 'quota window' with a CAUTION note (nudge text must never match LIMIT_RE) - dedupe scanned all 40 captured lines, so once a probe scrolled into the conversation no further probe ever fired (builder/adv frozen at nudges=1, orchestrator probes degraded to hourly riding the wake scroll) -> dedupe now only checks the bottom 8 lines (input area) Core invariant HELD: zero kill+reboots during the limit window. plan(lvl5): operator addition - the top-corner level badge (card, dashboard pill, badge SVG) shows only the level number+color, zero capping info; the inline per-rung table keeps intentional-skip/unverified detail.	2026-06-11 05:52:26 +00:00
autonomic-bot	76aa104dbd	plan(lvl5): N/A split — intentional skip climbs, unintentional (unverified) blocks Operator refinement: only declared/structural skips (not backup-capable, no previous version) let the climb continue; a rung that should have run but didn't (infra error, abra missing, tier abort, timeout) blocks the level at the last verified rung. Every N/A source in derive_rungs gets an explicit classification (DECISIONS.md, adversary-reviewed); unclassifiable defaults to unverified. Unit tests + one synthesized tier-abort run prove the rule.	2026-06-11 01:47:26 +00:00
autonomic-bot	1f7fc7eb39	plan(lvl5): fold in de-capping — level = highest passed rung, N/A skips, fail blocks Operator decision (explicit Q&A 2026-06-11): remove cap/cap_reason/capped entirely. New formula: level = max i with rung_i==pass and all j<i in {pass,na}. N/A no longer stops the climb (the confusing part — e.g. non-backup-capable recipes were stuck at L2); a real FAIL still blocks. Per-rung table + verdict carry the completeness story. Added: de-cap implementation reqs, both-schema rendering, before/after level table for all recipes, N/A-skip proof run, bad-canary designed-levels re-derivation under the new formula.	2026-06-11 01:45:54 +00:00
autonomic-bot	0aab78d3a2	plan: phase 'lvl5' — L5 level rung: abra recipe lint passes on the PR (queued after shot) New top rung after install/upgrade/backup-restore/functional: lint the exact recipe ref under test; gap-caps per ladder semantics; verdict-neutral and time-bounded; mirror-origin R014 plumbing must not pollute recipe lint results (abra.py:109-114); all consumers (results/card/dashboard/badge/docs/tests) updated; old artifacts still render. M1 = adversary-cold-verified implementation pre-merge; M2 = real-CI proof incl. a genuine L5, a genuine lint-capped L4, and 2 drone-path runs. Recipe lint failures -> mirror PRs or DEFERRED, never merged.	2026-06-11 01:39:27 +00:00
autonomic-bot	7c042c2f2a	plan: phase 'shot' — recipe screenshot audit & repair (queued after rcust) Audit every enrolled recipe's CI badge/card screenshot, diagnose defects (plausible null-every-run; ~4.8KB blank-frame SPAs: immich/lasuite-meet/ cryptpad/flaky n8n), fix via harness default-wait improvement first, per-recipe SCREENSHOT hooks second; M1 audit matrix + M2 visually-verified PNGs on fresh real-CI runs (>=2 !testme). Cosmetics-never-block and secret-safety guardrails binding. Also: temporary hourly-wake instruction to verify the new limit-wait system tonight; journal entry.	2026-06-11 01:17:32 +00:00
autonomic-bot	2e1ab8d384	watchdog: hourly orchestrator wake fires even during a limit window Operator request: the hourly supervision prompt should land regardless of limit state, as a fallback that keeps things on track if the limit-state machinery ever breaks. If the limit is genuinely still in force the wake is harmless (the banner just re-prints and limit_tick re-arms); once it lifts, the queued wake doubles as a resume nudge.	2026-06-11 01:00:29 +00:00
autonomic-bot	d6e1a704da	watchdog: parse limit-reset time, never reboot limit-stalled sessions; rename orch session Replace the blind every-300s 'limit appears lifted' nudge (claude) and the opencode-only _maybe_nudge_limit with one unified limit_tick state machine: - parse the reset time from the limit banner (last match wins; stale banners whose time already passed fall back rather than waiting ~a day) - arm a quiet window until reset+45s; parse failure -> flat 5-minute probe loop (operator-specified; not exponential backoff) - while armed, suppress ALL healing: a limit-stalled session is NEVER kill+rebooted (this was the conc-phase churn: claude limit stalls fell through to the generic idle reboot, losing the banner and re-hitting the limit fresh) - at window end send ONE nudge as a self-verifying probe: spinner clears the state; a re-printed banner re-arms from the fresh reset time - dedupe: never stack a probe while our own text is visible in the pane - state persisted per session in LOG_DIR (.limited-<session>) so watchdog restarts keep the window - orchestrator gets the same treatment: limit_tick in heal_orchestrator, a per-signal-tick orch_limit_check, and hourly wakes deferred during limit windows - loud WARNING at 3 probes, then continue flat probes forever Also rename the orchestrator session default cc-ci-orchestrator-vm -> cc-ci-orchestrator (launch.py ORCH_SESSION, launch-orchestrator.py SESSION, docs/scripts references).	2026-06-11 00:55:07 +00:00
autonomic-bot	aefaf17757	plan: recipe-customization restructure — full builder/adversary plan (P1-P6 + real-CI regression sweep gate)	2026-06-10 16:28:09 +00:00
autonomic-bot	a6e177e286	journal: phase conc DONE — concurrency restructure landed, M1+M2 adversary-verified	2026-06-10 13:58:25 +00:00
autonomic-bot	e0c9f23391	feat(launch): ADV_MODEL — per-role model override for the Adversary loop	2026-06-10 04:03:35 +00:00
autonomic-bot	a1b4943da1	plan: adapt concurrency restructure to builder/adversary loop protocol (gates M1/M2, phase-namespaced state)	2026-06-10 03:54:31 +00:00
autonomic-bot	520fb18461	plan: full concurrency restructure implementation plan (builder/adversary handoff)	2026-06-10 03:48:14 +00:00
autonomic-bot	0d169c2a20	plan: concurrency restructure — flock-probe janitor, per-run ABRA_DIR, lock-lifetime chain	2026-06-10 03:41:05 +00:00
autonomic-bot	335ea1d7c1	journal: session wrap — concurrent CI fixed, immich (245) + plausible (247) both GREEN	2026-06-09 23:18:32 +00:00
autonomic-bot	08706c665e	memory: swarm UpdateStatus convergence gotchas (builds 238/241)	2026-06-09 23:14:18 +00:00
autonomic-bot	926e4553b7	journal: immich PR #2 GREEN (build 245, level=4); cc-ci PR #9 merged; plausible unblocked	2026-06-09 23:13:56 +00:00
autonomic-bot	e3e0a9ee80	journal: two harness convergence fixes (UpdateStatus settle + paused-is-settled); immich build 245 in flight	2026-06-09 23:08:59 +00:00
autonomic-bot	1580738c97	journal: concurrent-CI fixes landed on cc-ci main (build 236 green)	2026-06-09 22:02:08 +00:00
autonomic-bot	ec3e0c35dd	journal: orchestrator handover — concurrent-CI fixes + immich/plausible drive	2026-06-09 19:45:21 +00:00
autonomic-bot	542ed0afe3	memory: move agent memory into repo (memory/), note in AGENTS.md Persistent agent memories now live in memory/ in this repo; the Claude auto-memory path is symlinked here so future memories land in the repo and get committed like any other change.	2026-06-09 19:25:20 +00:00
autonomic-bot	330378d30d	ideas: fail-fast on crash-looping deploy + don't let one wedged run starve the CI queue After a live incident: plausible build 220 (ClickHouse exit-1 crash-loop) held the single serial runner for its full 1200s DEPLOY_TIMEOUT, starving immich PR-2's queued builds for ~12min until manually torn down. Logs the two fixes (fail-fast on crash-loop; head-of-line blocking on the serial runner) + the interim mitigations (step-2b dev loop for debugging; SIGINT to free a wedged run). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 16:29:30 +00:00
autonomic-bot	a2c1cb550a	upstream(immich): release-notes sources + DB-pin & VectorChord backup/restore notes	2026-06-09 15:49:20 +00:00
autonomic-bot	c60fc6d056	change(cleanup): reap dev deploys at start+end of /upgrade-all instead of a timer Per operator: drop the hourly cc-ci-reap-dev-deploys systemd timer; instead run the dev-* reaper at the START (Step 0, alongside the orphan sweep) and END (new step 4b) of each /upgrade-all run, with THRESHOLD=0 (the run is quiescent then, so clear all dev-* unconditionally). The reaper keeps its safe default (4h) for ad-hoc use. Step-2b mandatory teardown is unchanged (primary mechanism); this is the backstop. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 15:47:16 +00:00
autonomic-bot	23bba98be4	feat(cleanup): guarantee step-2b dev deploys get reaped - /recipe-upgrade step 2b: teardown is now MANDATORY on every exit path (finally), with a verify-no-leak check; tear down even on failure before reporting. - reap-dev-deploys.sh: safe, age-gated backstop that removes only idle dev-* stacks (never CI per-run stacks, warm-*, infra; an active dev loop stays fresh). - orchestrator: hourly cc-ci-reap-dev-deploys systemd timer runs it against cc-ci, bounding any leaked dev deploy from a crashed/abandoned loop. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 15:42:23 +00:00
autonomic-bot	77ba7ee075	guardrail: upgrader never modifies cc-ci tests/harness unless --with-tests Absolute, mode-gated rule reinforced in /recipe-upgrade (Guardrails + the new step-2b direct-deploy loop where the upgrader has cc-ci host access) and noted as the interim safeguard in IDEAS.md until the deploy loop moves to isolated infra. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 15:32:50 +00:00
autonomic-bot	98276124e5	ideas: isolate the upgrader's direct deploy onto separate infra (can't tamper with tests) The step-2b direct deploy-and-inspect runs on the cc-ci server's own swarm today, so the upgrader holds write access to the host that owns the tests + CI verdict — a trust hole (could hack the tests). Parked idea: a dedicated throwaway test server with scoped creds, so the upgrader can deploy+inspect but not modify the gate. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 15:31:20 +00:00
autonomic-bot	de0baa00b1	feat(upgrade): add direct deploy-and-inspect dev loop (recipe-maintainer style) before CI The upgrader now deploys the WIP recipe directly on cc-ci (abra app deploy --chaos under a dev-<recipe> domain on the local swarm) and inspects live logs (docker service logs) to SEE what the upgrade does, before/alongside the !testme CI gate. ADDITIONAL to — not a replacement for — the 3-attempt !testme verification; it front-loads diagnosis so fewer CI attempts are spent on basics. Always torn down (orphan-sweep is the backstop). /upgrade-all dispatch references the new step 2b. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 15:28:06 +00:00
autonomic-bot	f1c63f1ca0	feat(survey): don't skip recipes on abra tag+digest FATA — check upstream directly abra hard-FATAs on image refs with both a tag and a digest (immich: postgres:14-vectorchord...@sha256:..., valkey:9@sha256:...), aborting the whole recipe survey so immich was silently dropped. Per operator: don't normalize the recipe; catch the failure and check the upstream registry directly. - /upgrade-all box item 4: a tag+digest parse FATA is NOT not-fetchable. Use abra for the images it parses; for the rest, list upstream tags (Docker Hub / ghcr / buildx imagetools) and judge availability (match the variant the app supports, not blindly the max). Upgradeable if abra OR the direct check finds a newer tag. - /recipe-upgrade implement: hand-bump tag+digest pins (abra can't), and re-resolve + re-append the digest for the new tag so the pin is preserved (never drop it). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 15:21:36 +00:00
autonomic-bot	f687174b53	feat(recipe-report): TESTS rename + live binary STATUS column Rename the table's Status column -> TESTS (the CI/test verdict, unchanged content). Add a new STATUS column showing the PR's LIVE state, fetched client-side: 'open' vs a ✓ for any not-open state (merged or closed). The cell is a JS hook (data-repo/data-pr) derived from existing recipe+pr fields; an inline, dependency-free, CSP-safe script GETs the same-origin /pr/<recipe>/<n> proxy (cc-ci nix/modules/reports.nix) on load and every 30s, and degrades to a muted '?' if the proxy/repo is unreachable. Blank cell when a row has no PR. Doc + SKILL updated. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 13:15:02 +00:00
autonomic-bot	eb1439324e	plan: finalize report PR-STATUS column (binary open/✓; proxy in reports.nix; decisions locked) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 12:59:41 +00:00
autonomic-bot	a76aca80e2	plan: Recipe Report TESTS rename + live PR-STATUS column (public mirrors + same-origin proxy) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 12:52:59 +00:00
autonomic-bot	1f52795534	skill(ci-dev-workflow): capture the cc-ci feature-dev flow + adversary plan template Documents the end-to-end workflow used to land the intentional-skips/4-rung-ladder feature: explore harness → branch a local cc-ci clone → implement + unit-verify cold on cc-ci → live full-stage check → open PR (never push main) → independent adversary verdict → squash-merge on PASS → deploy via /root/builder-clone rebuild. Includes the adversary-verify-pr6.md plan as a reusable template. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 03:16:47 +00:00
autonomic-bot	dbafcddb62	feat(upgrade-all): sweep orphans from previous runs at the start of each weekly run Adds sweep-orphans.sh (safe-by-allowlist: removes orphan test stacks, standalone debug containers >30m old, leaked dangling volumes, and reparented docker-run wrappers; spares infra + warm-* canonicals and their retained volumes) and wires it as Step 0 of /upgrade-all so a prior run's leaked stack/container/process can't contend for the shared Swarm or skew the survey. Idempotent; no-op when clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 02:39:43 +00:00
autonomic-bot	d31378b180	feat(recipe-report): restructure page — priority-sorted wire table w/ CVE column, addendum, per-recipe changes New page order: short lead -> the full wire table (sorted by priority-to-address, CVE recipes first, new CVEs count column) -> Addendum (bullets of real special issues, omitted if clean) -> Security Bulletin -> per-recipe "What changed". - recipe-report.py: _table() gains a CVEs column + recipe-name linking; new _changes() helper; render() reordered; docstring SPEC SHAPE updated (cve/addendum/changes added, needs_attention/routine removed). - recipe-report/SKILL.md + example-spec.json: new procedure, spec shape, and gold-standard template (2026-06-05, new format). - launch-report.py: kickoff text reflects the new priority-ordered structure. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-05 17:06:43 +00:00
autonomic-bot	49491fcb90	fix(recipe-report): weekly trigger uses launch-report.py 'fresh'; start kills idle/leftover session A stale cc-ci-report session (from a prior week's run, gone idle) caused this week's launch-report.py 'start' (use-or-create) to leave it and never run a fresh report. Fix: upgrade-all step 6 now calls 'fresh', and start only leaves a session that's actively busy producing a report — an idle/leftover session is killed + restarted. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-05 12:06:46 +00:00
autonomic-bot	f397968f47	upstream(uptime-kuma): release-notes sources	2026-06-05 06:18:33 +00:00

1 2 3 4 5

217 Commits