cc-ci-orchestrator

Author	SHA1	Message	Date
autonomic-bot	027fdbd161	fix(upgrade skills): run abra over a pseudo-TTY (fixes FATA inappropriate ioctl) abra over plain 'ssh cc-ci abra ...' has no TTY -> FATA 'inappropriate ioctl for device' (the abra error). The working harness (runner/harness/abra.py) wraps abra in util-linux 'script' for a pseudo-TTY + passes -n. Apply the same in the recipe-upgrade and upgrade-all skills: every abra call becomes ssh cc-ci 'script -qec "abra <args> -n" /dev/null'. Confirmed: abra server ls FATAs plain, works pty-wrapped. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 04:06:38 +00:00
autonomic-bot	ad7ba8375a	fix(recipe-upgrade): extend open upgrade PRs by commit-on-top, no force-push Instead of force-pushing HEAD onto the existing PR branch (history rewrite), add a commit ON TOP of the branch tip (fast-forward) when it already exists, so the PR's history is preserved and it re-tests. Fresh branches still push normally. The only remaining force-push is the mirror-main->upstream sync (intentional mirroring), never a PR branch. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 01:58:10 +00:00
autonomic-bot	5f814307ad	fix(recipe-upgrade): default to extending an existing open upgrade PR, not a parallel one When an open upgrade PR already exists for a recipe (branch upgrade-*), push the new work onto ITS branch and update+re-test that PR — one evolving upgrade PR per recipe instead of spawning a second parallel PR. Only open a fresh upgrade-<version> PR when none exists. Unrelated open PRs (e.g. backup fixes) are still never touched; merged-upstream PRs still close. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 01:54:58 +00:00
autonomic-bot	19fda8d2b8	fix(recipe-upgrade): stop auto-closing superseded/unrelated open PRs Per operator: opening a new upgrade PR should stack ON TOP of any other still-open PRs, not close them. Only PRs already merged into upstream main are closed (merging them is a no-op). This prevents the phase-7 incident where an unrelated open ghost PR was auto-closed as 'superseded'. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:07:05 +00:00
autonomic-bot	1f96eba577	fix(ci-test-review): resolve PR ref to commit sha in verify-pr.sh Resolve the recipe branch/ref to its head commit sha via the Gitea API before invoking the cold full-suite run, so the upgrade tier deploys the exact PR head. From the phase-5 upgrade-flow verification. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 21:46:29 +00:00
autonomic-bot	9574972f1d	feat(skill): add Hetzner server recovery playbook	2026-06-01 13:48:23 +00:00
autonomic-bot	a896ee9476	fix(testme-on-pr): wait for a fresh cc-ci status update	2026-06-01 13:03:41 +00:00
autonomic-bot	2486b7c368	fix(ci-test-review): resolve remote cc-ci worktree	2026-06-01 13:03:41 +00:00
autonomic-bot	df6ca04611	feat(recipe-upgrade): add stale-test PR helpers	2026-06-01 03:48:05 +00:00
autonomic-bot	6910b197d0	fix(testme-on-pr): read cc-ci/testme context URL not first-status URL When multiple commit statuses exist (e.g. an Adversary probe + the real run), the first status in the array may not be the cc-ci run. Filter by context 'cc-ci/testme' to get the correct Drone build URL.	2026-05-31 14:00:02 +00:00
autonomic-bot	0df57c6d0c	fix(open-recipe-pr): replace python3 with jq (cc-ci has jq, not python3)	2026-05-31 13:35:07 +00:00
autonomic-bot	1c2be64124	Phase 5 §4: install weekly upgrade cron at completion+1h and verify first kickoff Operator: when the final phase completes, install the weekly cron anchored to actual completion — first run ~1h after the build finishes, weekly from then on (supersedes the fixed "Sat 03:00 UTC" placeholder). - plan-phase5 §4: orchestrator computes T0=now+1h, installs a weekly job at T0's DOW+HH:MM running launch-upgrader.sh start; cron env needs claude on PATH + tmux + claude.ai login (mirror cc-ci-loops.service). VERIFY the first kickoff: cheap --dry-run pre-check, then confirm the real T0 fire launched the cc-ci-upgrader agent (status RUNNING, ran /upgrade-all, summary produced); record schedule + verified kickoff in DECISIONS.md. - upgrade-all skill Cron section + cron memory updated to the completion-anchored schedule + first-kickoff verification. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 21:21:20 +01:00
autonomic-bot	bf71420106	Add cc-ci-upgrader agent: observable one-shot weekly upgrade-run agent The weekly upgrade run now executes inside a dedicated, remote-control agent (cc-ci-upgrader) — viewable/steerable at claude.ai/code like the Builder — rather than buried in headless cron output. - launch-upgrader.sh: spins up the cc-ci-upgrader tmux session under --remote-control with a kickoff that runs /upgrade-all (DEFAULT mode) to completion. On finish the agent STOPS and stays idle (does NOT self-terminate) so the run + summary stay reviewable in the web UI. `start` = use-or-create: leaves an in-flight (busy) run alone, else clears a finished/idle/wedged session and runs fresh; `fresh` always restarts. UPGRADER_ARGS passes flags (e.g. --dry-run); never --with-tests. - launch.sh: orchestrator_alive() now also skips the cc-ci-upgrader remote-control name, so the upgrader job isn't mistaken for the orchestrator. - upgrade-all skill: documents it runs as the cc-ci-upgrader agent; the weekly cron invokes `launch-upgrader.sh start` (not /upgrade-all inline). - Phase 5: V8a verifies the agent lifecycle (launch → run to completion → stay idle/viewable → next start clears it); V9 stops the verification session. - cron memory: weekly task = launch-upgrader.sh start at 0 3 * * 6 UTC. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 21:12:47 +01:00
autonomic-bot	4a1da1dd60	recipe-upgrade: !testme-on-PR verification + make test PRs opt-in (--with-tests) Per operator: - Verify via `!testme` posted ON the recipe PR (the real CI path) so results are viewable in the PR; iterate up to 3 !testme runs (fix a real regression + re-test). New helper testme-on-pr.sh posts !testme and polls the PR head commit status for the verdict (POST=0 to keep polling without re-triggering). - Test updates are now OPT-IN via `--with-tests`. DEFAULT: recipe PR only using existing tests; if a test fails and is genuinely stale, leave an explanatory COMMENT on the PR (upgrade looks correct; re-run --with-tests to update tests) and do NOT touch any test. --with-tests keeps the verified cc-ci test-update PR path (verified via the branch-checkout harness run, since !testme uses prod tests). - upgrade-all (weekly cron) calls the DEFAULT — never auto-edits tests unattended; surfaces "tests look stale" PRs in the summary for the operator to opt in per-recipe. - New RESULT: SUCCESS-PENDING-TESTS for the recipe-green-but-test-stale default case. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 20:18:59 +01:00
autonomic-bot	62b7af7a97	recipe-upgrade: reconcile mirror to upstream main + close merged/superseded PRs Per operator: an open mirror PR must mean "genuinely still open against true current upstream main". On every run the recipe-upgrade flow now: - force-syncs the recipe-maintainers/<recipe> mirror `main` to be IDENTICAL to upstream main (origin/main of the abra checkout = coopcloud); - closes any open mirror PR whose changes are already in upstream main (merged upstream, no-op merge detected via `git merge-tree` vs main's tree) — even when the recipe is up to date (new `--reconcile-only` mode, run in step 1); - when opening a new upgrade PR, closes any other still-open PR for that recipe (superseded) and opens the new one IN ITS PLACE; same-version re-runs just update the existing same-branch PR. open-recipe-pr.sh gains the --reconcile-only mode + the close logic (with an auto-close comment naming the reason). upgrade-all reconciles every candidate's mirror during the survey so merged PRs are closed fleet-wide. Still never merges. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 17:32:34 +01:00
autonomic-bot	a8b4b4c39e	upgrade-all: pin weekly slot (Sat 03:00 UTC) + defer activation until cc-ci is built Operator: don't run the weekly upgrade-all while the build loops are still constructing cc-ci (shared-host contention). Activate the Sat 03:00 UTC (0 3 * * 6) cron only once the build is complete; on-demand until then. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 17:24:40 +01:00
autonomic-bot	db31c08d6a	Add /recipe-upgrade + /upgrade-all skills (cc-ci-gated upgrades, never merge) Per-recipe and fleet-wide upgrade skills modelled on recipe-maintainer's recipe-upgrade-full / recipe-upgrade-cron-all, but gated by the cc-ci CI server and inheriting ci-test-review's create+verify+never-merge discipline. - recipe-upgrade/: plan (release notes, breaking changes) -> implement (abra recipe upgrade + version bump + config, lint) -> open the recipe PR -> VERIFY green on cc-ci (full suite cold against the PR head via verify-pr.sh). If the upgrade is correct but a cc-ci TEST went stale, also update the test, verify it, and open a second PR to recipe-maintainers/cc-ci. Never merges; never weakens a test; prefers a recipe-only PR. Emits a parseable RESULT line. + open-recipe-pr.sh: adapted recipe-create-pr; runs on cc-ci (has the recipe checkout + bot token), creds passed from the orchestrator .testenv; force-syncs the mirror main so the PR diff is exactly the upgrade. - upgrade-all/: weekly fan-out — enumerate enrolled recipes, survey upgrades, run /recipe-upgrade per upgradeable recipe via subagent (sequential default, --parallel / --dry-run), collect into one PR-list summary. Coordination + single-writer + shared-Swarm-teardown guardrails; built for a weekly cron. - ci-test-review/verify-pr.sh: pass SRC (recipe-maintainers/<recipe>) alongside REF so the harness clones the mirror PR head correctly (its real contract). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 17:19:20 +01:00
autonomic-bot	cbe1406bce	ci-test-review: close the loop — author + open + cc-ci-verify fix PRs (never merge) Per operator: the skill should not just propose, it should CREATE the fix PR (recipe repo or cc-ci repo) and VERIFY it green on its own CI server — but not merge. It drives cc-ci like the loops do. - SKILL.md: diagnose+classify (recipe vs CI-server) -> author the fix + open a PR (recipe-create-pr for recipe PRs; Gitea API for cc-ci PRs, dedicated branch in a separate clone, single-writer safe) -> VERIFY on cc-ci (full suite cold against the PR head = the !testme dogfood path) -> report a verified, ready-to-merge PR. Never merges; never weakens a test; flake != bug. General bar = one cold green; repeated-green (REPEAT=3) only for a known-flaky recipe. Adds coordination/single-writer guardrails (shared Swarm is stateful; tear down deploys; never push main or touch the loops' clones). - verify-pr.sh: deterministic recipe-PR gate — RECIPE + REF -> cold full suite on cc-ci, green iff every repeat exits 0. CI-server-PR verification stays bespoke (branch checkout + rebuild + regression sample) per SKILL.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 17:04:35 +01:00
autonomic-bot	2530845e50	orchestrator: add /ci-test-review skill (in THIS repo) + drop Phase 3r from loops queue The on-demand AI review layer is now an orchestration-repo skill built directly by the orchestrator, NOT a loops phase in the cc-ci product repo: - .claude/skills/ci-test-review/{SKILL.md,run-all-recipes.sh}: runs the real cc-ci harness across all enrolled recipes (deterministic, AI-free execution), then AI diagnoses each failure and classifies it as needing a recipe PR / a CI-server PR / a stale-test update — or reports "ALL PASSED, recipes + tests up to date". Proposes PRs; never decides pass/fail; never auto-merges. - .gitignore: track .claude/skills/ (shareable) while still ignoring local claude session state (locks, history) under .claude/. - launch.sh: remove Phase 3r from PHASES_SPEC; loops sequence back to 1c 1b 1d 1e 2w 2pc 2 2b 3 4. Deleted plan-phase3r (superseded by the skill). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-29 16:57:26 +01:00

19 Commits