Files

autonomic-bot bf71420106 Add cc-ci-upgrader agent: observable one-shot weekly upgrade-run agent

The weekly upgrade run now executes inside a dedicated, remote-control agent
(cc-ci-upgrader) — viewable/steerable at claude.ai/code like the Builder — rather
than buried in headless cron output.

- launch-upgrader.sh: spins up the cc-ci-upgrader tmux session under
  --remote-control with a kickoff that runs /upgrade-all (DEFAULT mode) to
  completion. On finish the agent STOPS and stays idle (does NOT self-terminate)
  so the run + summary stay reviewable in the web UI. `start` = use-or-create:
  leaves an in-flight (busy) run alone, else clears a finished/idle/wedged
  session and runs fresh; `fresh` always restarts. UPGRADER_ARGS passes flags
  (e.g. --dry-run); never --with-tests.
- launch.sh: orchestrator_alive() now also skips the cc-ci-upgrader
  remote-control name, so the upgrader job isn't mistaken for the orchestrator.
- upgrade-all skill: documents it runs as the cc-ci-upgrader agent; the weekly
  cron invokes `launch-upgrader.sh start` (not /upgrade-all inline).
- Phase 5: V8a verifies the agent lifecycle (launch → run to completion → stay
  idle/viewable → next start clears it); V9 stops the verification session.
- cron memory: weekly task = launch-upgrader.sh start at 0 3 * * 6 UTC.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-05-29 21:12:47 +01:00

prompts

watchdog: reboot idle-wedged loops via self-reported WAITING-UNTIL markers

2026-05-29 19:05:29 +01:00

systemd

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

brief.md

Initial commit: cc-ci autonomous orchestrator

2026-05-26 20:46:28 +01:00

IDEAS.md

rename the opt-in heavy-tests flag: --extra-tests -> --extra (operator 2026-05-29)

2026-05-29 10:36:04 +01:00

kickoff.md

Watchdog handoff signalling: ping the waiting loop on gate-claim / verdict (kill double-idle)

2026-05-27 06:15:25 +01:00

launch-orchestrator.sh

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

launch-upgrader.sh

Add cc-ci-upgrader agent: observable one-shot weekly upgrade-run agent

2026-05-29 21:12:47 +01:00

launch.sh

Add cc-ci-upgrader agent: observable one-shot weekly upgrade-run agent

2026-05-29 21:12:47 +01:00

plan-lasuite-drive-oidc-robustness.md

plan: lasuite-drive recipe-robustness PR sub-plan (collabora healthcheck + perms + lazy OIDC)

2026-05-29 12:58:36 +01:00

plan-lasuite-drive-recipe-pr.md

lasuite-drive PR: scope the repeated-green/3x bar to lasuite-drive (flakiness proof) — NOT the general standard (operator 2026-05-29)

2026-05-29 13:25:10 +01:00

plan-orchestrator-migration.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

plan-phase1b-review-lint.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

plan-phase1c-full-reproducibility.md

Phase-1c: true verification = Adversary deletes the throwaway VM, creates a fresh one, full install

2026-05-27 16:05:54 +01:00

plan-phase1d-generic-test-suite.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

plan-phase1e-harness-corrections.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

plan-phase2-recipe-tests.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

plan-phase2b-test-performance.md

Add Phase-2b plan: test performance (measure, attribute, improve empirically)

2026-05-27 04:26:27 +01:00

plan-phase2pc-image-cache.md

2pc: drop the pull-through registry cache — single host makes it marginal; keep PC1 prune-policy only

2026-05-29 09:24:56 +01:00

plan-phase2w-warm-canonical-quick.md

plan(2w): WC1.2 — pre-deploy auto-upgrade safety gate (major/manual-migration -> alert, hold)

2026-05-29 00:02:28 +01:00

plan-phase3-results-ux.md

Add Phase-2b plan: test performance (measure, attribute, improve empirically)

2026-05-27 04:26:27 +01:00

plan-phase4-final-review-polish-cleanup.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

plan-phase5-verify-upgrade-flow.md

Add cc-ci-upgrader agent: observable one-shot weekly upgrade-run agent

2026-05-29 21:12:47 +01:00

plan-prepull-images.md

plan: per-test image pre-pull sub-plan (warm images before deploy + upgrade; cheap on warm cache)

2026-05-29 14:55:21 +01:00

plan-sso-dep-testing.md

rename the opt-in heavy-tests flag: --extra-tests -> --extra (operator 2026-05-29)

2026-05-29 10:36:04 +01:00

plan.md

watchdog: reboot idle-wedged loops via self-reported WAITING-UNTIL markers

2026-05-29 19:05:29 +01:00

README.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

reboot-log.sh

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

REBOOTS.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

test-e2e-testme-acceptance.md

orchestrator: reboot-resilience + session auto-resume + full session plan/tooling

2026-05-28 20:28:10 +01:00

README.md

cc-ci-plan

Self-contained handoff package for building the cc-ci Co-op Cloud recipe CI server with two autonomous Claude loops (a Builder and an adversarial Reviewer) running over days.

Start here

Read plan.md — the full plan and single source of truth (mission, Definition of Done, architecture, milestones, the two-agent coordination protocol, loop discipline).
Read kickoff.md — how to launch and supervise the loops.
Run ./launch.sh start to bring up both loops + the watchdog.

Files

File	Purpose
`plan.md`	The Phase-1 plan (build the CI server). Agents treat it as their single source of truth.
`plan-phase1c-full-reproducibility.md`	Phase 1c (runs first): make the VM fully reproducible from git (all secrets incl. the wildcard cert in sops, in a separate private `cc-ci-secrets` repo as a flake input; base stays well-parameterized) and do the genuine throwaway-VM live rebuild to close D8 honestly (the "infeasible by design" was overstated).
`plan-phase1b-review-lint.md`	Phase 1b (after 1c): deterministic linting/formatting in CI + a white-box review checklist (real tests, DRY harness, idempotent Nix, no footguns/secrets), ending in a full cold re-verification of all D1–D10 — now covering 1c's refactor.
`plan-phase1d-generic-test-suite.md`	Phase 1d (after 1b, before 2): a generic install/upgrade/backup/restore suite that runs on any recipe with zero config, with a recipe's own `test_<op>.py` overriding or extending the generic (Builder's call) and reusing the generic's deployment — no redeploy, plus optional custom install-steps; recipes needing special setup fail the generic form gracefully. The test-architecture foundation Phase 2 builds on.
`plan-phase1e-harness-corrections.md`	Phase 1e (after 1d, before 2): three operator-review corrections to the shared generic harness — (HC1) upgrade goes previous-release → PR head via `deploy --chaos`; (HC2) repo-local PR code runs only for approved recipes (default = cc-ci overlays + generic only); (HC3) the generic runs by default alongside an overlay, skipped only via explicit opt-out.
`plan-phase2-recipe-tests.md`	Phase 2 (after Phase 1e): build on the corrected generic suite — author the recipe overlays (port recipe-maintainer tests as `test_*.py`) + define custom install steps where a recipe fails generically.
`plan-phase2b-test-performance.md`	Phase 2b (after Phase 2, before Phase 3): empirically measure where test time goes and reduce it (image cache, readiness tuning, dedup deploys, warm infra, concurrency) — no weakened tests.
`plan-phase3-results-ux.md`	Phase 3 (after Phase 2b): beautiful YunoHost-style results — per-run level, image-forward PR comment (badge + summary card + app screenshot), polished dashboard.
`IDEAS.md`	Deferred/future ideas, parked out of current scope.
`brief.md`	The original one-page brief (context only; `plan.md` supersedes it).
`kickoff.md`	Launch & supervision guide.
`launch.sh`	Starts both loops + a watchdog; restarts dead loops; stops on `## DONE`.
`prompts/builder.md`	Builder loop prompt (fed to `claude` by the script).
`prompts/adversary.md`	Adversary loop prompt.

Before launching

Set the org in plan.md (git.autonomic.zone/recipe-maintainers/cc-ci) and lock the six proof recipes (§8).
Ensure the launching shell has: SSH+sudo to cc-ci, the Gitea token, git.autonomic.zone access.
Preconfigure test-app DNS + TLS (plan §4.0): point a wildcard *.ci.commoninternet.net record at a gateway that TLS-passthroughs to cc-ci, and pre-issue the wildcard cert (*.ci.commoninternet.net + ci.commoninternet.net, via Gandi DNS-01) into /var/lib/ci-certs/live/ on cc-ci. The agent handles everything else on cc-ci (Traefik file provider → that cert, swarm, routing) and does no ACME; renewal (~90 days) is an out-of-band operator task, so the DNS token never goes to the agent.
export CC_CI_REPO=https://git.autonomic.zone/recipe-maintainers/cc-ci.git so the watchdog can detect ## DONE.

What "done" means

The loops stop only when all of plan.md §2 (D1–D10) hold and the Adversary has independently re-verified each within 24h. The watchdog then tears the loops down automatically.

README.md Unescape Escape

cc-ci-plan

Start here

Files

Before launching

What "done" means

README.md