cc-ci-orchestrator

Author	SHA1	Message	Date
autonomic-bot	b63edbbd7f	upstream(lasuite-drive): release-notes sources	2026-06-05 02:49:50 +00:00
autonomic-bot	a3740e1fdf	upstream(lasuite-docs): release-notes sources	2026-06-05 02:43:03 +00:00
autonomic-bot	f5da8ac3ff	upstream(keycloak): release-notes sources	2026-06-05 02:27:00 +00:00
autonomic-bot	287fb51d91	upstream(ghost): release-notes sources	2026-06-05 02:20:11 +00:00
autonomic-bot	d24feb0671	upstream(discourse): release-notes sources	2026-06-05 02:02:19 +00:00
autonomic-bot	85065880a5	upstream(custom-html-tiny): release-notes sources Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-05 01:58:29 +00:00
autonomic-bot	167ce2d881	upstream(custom-html): release-notes sources	2026-06-05 01:32:13 +00:00
autonomic-bot	f0716764db	feat(recipe-upgrade): upstream release-notes registry + recipe-README read (recipe-maintainer parity) Close the two gaps vs recipe-maintainer's recipe-upgrade-plan: - Per-recipe release-notes registry at cc-ci-plan/upstream/<recipe>.md (discover the source repo + releases/changelog URL for each image once, persist+commit, reuse) — fetch release notes FROM those URLs instead of rediscovering ad-hoc each run. Format doc + cryptpad seed included. - Explicitly read the recipe's README for shipped upgrade/migration notes. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-05 01:28:27 +00:00
autonomic-bot	d8ad5a2805	feat(recipe-report): link recipe names in all story sections (security/needs/routine), not just the lead _stories() now auto-links whole-word recipe mentions in story titles + bodies to their mirror repos (same single-pass linkify as the lead); explicit PR/build links are untouched. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 02:21:31 +00:00
autonomic-bot	a6efcec720	feat(recipe-report): link recipe names in the lead to their mirror repos; 3-para concise lead render() auto-links whole-word recipe mentions in the editorial lead to git.autonomic.zone/recipe-maintainers/<recipe> (single regex pass, longest-name-first, no href corruption). Skill: lead is ~3 short paragraphs (~150-180 words) incl. an 'anything strange worth looking into' paragraph. example-spec.json lead updated to the concise target. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 02:17:19 +00:00
autonomic-bot	f20a066f5c	journal: recipe-report v2 newspaper front page (CVE-led editorial) live Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 23:20:48 +00:00
autonomic-bot	6cf59130db	feat(recipe-report): newspaper front-page layout — editorial lead + CVE security bulletin first Masthead + opus 'lead' editorial (overall fleet state + what to focus on), a Security Bulletin of critical-CVE upgrades up top (mined from per-recipe upgrade_notes_md), then needs-attention/routine, and the comprehensive table as 'the full wire' at the end. survey now includes each recipe's upgrade_notes_md (breaking-change/CVE analysis) so opus can lead with security. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 23:13:40 +00:00
autonomic-bot	856df8cb37	journal: /recipe-report + report.ci.commoninternet.net shipped; first opus report live Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 23:06:22 +00:00
autonomic-bot	c7301a9e39	feat(recipe-report): /recipe-report skill + helper + launcher (default opus); wire into upgrade-all - recipe-report.py: survey (run + per-recipe PRs + CI verdicts) / render (spec->HTML) / publish (copy to cc-ci:/var/lib/cc-ci-reports + regen index). - skill .claude/skills/recipe-report: review the weekly run, classify needs-attention vs routine, publish one public HTML page per week + index at report.ci.commoninternet.net. Read-only. - launch-report.py: one-shot cc-ci-report agent, REPORT_MODEL default opus (separate from the sonnet upgrader), REPORT_BACKEND default claude. - upgrade-all SKILL: closing step launches the report agent. Serving (nix/modules/reports.nix) already deployed + live. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 23:02:22 +00:00
autonomic-bot	73aa20e8ab	plan(recipe-report): separate configurable report model (default opus); link CI results, no embedded images Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:52:27 +00:00
autonomic-bot	81984c84da	plan: /recipe-report skill + report.ci.commoninternet.net weekly report Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:37:56 +00:00
autonomic-bot	d38f80048a	journal: bridge one-comment-per-!testme deployed; note cc-ci deploy-path gap Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 17:30:13 +00:00
autonomic-bot	bfe3a97301	journal: overnight /upgrade-all complete — 10 GREEN, 2 stale-test, 2 failed, 4 skipped Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 11:41:47 +00:00
autonomic-bot	c0852d2302	feat(logs): readable greppable per-agent transcript logs (agent-log.py) The raw 'tmux pipe-pane' logs are TUI-escape soup (the 191MB builder log). agent-log.py renders Claude's own JSONL transcript into a clean one-event- per-line <agent>.clean.log — read-only on a file the agent writes anyway, so zero agent slowdown and zero extra tokens. Resolves each agent's transcript (disambiguating the shared project dir by kickoff signature; tracks restarts). 'follow-all' runs as the cc-ci-cleanlogs session, wired into launch.py start so it comes up with the loops. render/tail subcommands for ad-hoc use. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 04:35:17 +00:00
autonomic-bot	cdbc5bb42f	journal: mirror+regression phases DONE (build sequence complete); overnight /upgrade-all running Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 03:43:46 +00:00
autonomic-bot	04fdefcd39	plan: overnight run — after assistant, run /upgrade-all + morning report Bash runner (cheap polling, no claude budget) that gates on the assistant's PR-consolidation done-marker, waits past the usage-limit reset (~03:30 UTC) and for the loops to idle, runs the weekly /upgrade-all (DEFAULT, never merges), then writes overnight-report-<date>.md and pings the orchestrator to notify. One-off; the Sunday 02:00 timer is unchanged. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 02:10:13 +00:00
autonomic-bot	7789e44252	task: assistant — consolidate open recipe PRs to one per recipe Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 02:02:00 +00:00
autonomic-bot	35f83a4b74	docs: orchestration.md as the root agent map; wake prompt + AGENTS.md point to it One root doc maps every agent (Builder, Adversary, Orchestrator, Assistant, Upgrader) -> its prompt + plan, with the watchdog and git coordination protocol as the subtlety beneath. Fold the orchestrator supervision routine into it (remove orchestrator-supervision.md). The hourly wake prompt and AGENTS.md now just point at orchestration.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 01:42:49 +00:00
autonomic-bot	37a422bc31	refactor(wake): thin wake prompt -> points at orchestrator-supervision.md The hourly wake prompt was hardcoding phase 5 / STATUS-5.md and going stale as the build advanced. Make it a one-line pointer to a maintained doc (orchestrator-supervision.md) that looks the CURRENT phase up live via launch.py status — so the wake prompt never needs editing as phases change. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 01:37:32 +00:00
autonomic-bot	7bdeb74449	plan(regression): add per-tier RED canaries (install/upgrade/backup/restore) One deliberately-broken custom-html-tiny fixture per lifecycle tier so the suite proves the server reports RED at EVERY tier (not just one) — each asserts RED at the intended tier with prior tiers PASS, so it's 'catches a failure at this tier', not 'fails somewhere'. Fast (simplest recipe); the fast subset of the suite vs the slow good canaries. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 01:28:23 +00:00
autonomic-bot	2f9d7df78f	ideas: package cc-ci itself as a Co-op Cloud recipe (parked, not implementing) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:43:44 +00:00
autonomic-bot	ad2ade842c	plan(mirror): remove the operator deploy gate — loops deploy+verify autonomously The gate existed because a wrong-target nixos-rebuild #cc-ci once dropped the cc-ci server into emergency mode. That footgun is fixed (be4f451 maps #cc-ci -> the Hetzner host config), and deploying cc-ci is the loops' normal operation, so Phase 4 now runs autonomously with verify + rollback as the safety net. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:38:59 +00:00
autonomic-bot	fd86baea2a	plan: regression canaries are milestone-cadence (polish/review/release), not per-commit Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:30:09 +00:00
autonomic-bot	947e7f55b9	plan: server regression canaries (codified E2E good+bad self-tests) E2E pytest canaries proving the server confirms a healthy app healthy (semantic per-tier assertions, not just exit codes) AND catches a broken one (false-green guard). Good canaries: custom-html-tiny + lasuite-docs; known-bad fixture must report RED. Queued as the loops' next phase after mirror-enroll. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:29:01 +00:00
autonomic-bot	2b617ba19f	feat(launch): persist PHASES_SPEC to .phases-spec (status/watchdog/reboot agree) Mirror the .loop-backend pattern: env wins, else the persisted file, else the default build sequence. Without this, a custom single-phase run was invisible to bare 'launch.py status' and would NOT survive a reboot (the service has no PHASES_SPEC env). Now the current phase set is durable. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:17:34 +00:00
autonomic-bot	d349656c3b	feat(launch): forward PHASES_SPEC/backend to watchdog; mark plan Phase 4 as operator gate The watchdog is spawned into the existing tmux server and didn't reliably inherit a custom PHASES_SPEC — it would fall back to the default 11-phase spec and mis-detect completion. Forward PHASES_SPEC/PHASE_IDX_FILE/ LOOP_BACKEND/LOOP_MODEL explicitly in the watchdog command so custom single-phase runs (like the mirror-enroll plan) work end-to-end. Also make the mirror-enroll plan's live-host-deploy step an explicit claim-and-wait operator gate for the loops. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:15:42 +00:00
autonomic-bot	8007053d94	plan: mirror + enroll ALL recipes before resuming per-recipe debugging Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 00:13:00 +00:00
autonomic-bot	d219b0972c	journal: BUILD COMPLETE + weekly-upgrade cron cutover to NixOS timer (Sun 02:00 UTC) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 23:26:59 +00:00
autonomic-bot	d8f558e987	journal: backend reverted to claude, waker folded into watchdog, boot service fixed Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 21:48:09 +00:00
autonomic-bot	2235110e29	journal: phase-5 progress-monitor events (19:04, 19:08) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 21:46:29 +00:00
autonomic-bot	ca6e68c08d	feat(orchestrator): fold hourly supervision wake into the watchdog The standalone ai-progress-monitor.sh waker pinged a hardcoded orchestrator session every 15m. Move that into the watchdog loop: ORCH_WAKE_INTERVAL (default 3600s) types the supervision prompt into the live orchestrator session, retrying each tick until it lands so a busy or briefly-absent orchestrator is never interrupted and no hour is skipped. Delete the now-redundant waker script; the prompt file is now driven by the watchdog. Reboot-safe by inheritance (the watchdog is started by cc-ci-loops.service). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 21:46:20 +00:00
autonomic-bot	8f7265e948	feat(orchestrator): wake the live monitor session	2026-06-01 18:51:05 +00:00
autonomic-bot	9fe9d49cac	journal: record Hetzner rescue recovery for cc-ci	2026-06-01 13:55:15 +00:00
autonomic-bot	8093a95184	journal: session 2026-06-01 03:34 UTC handoff (opencode gpt-5.4 visible)	2026-06-01 13:03:51 +00:00
autonomic-bot	837fed17d2	fix(orchestrator): attach opencode session from orchestrator repo	2026-06-01 13:03:51 +00:00
autonomic-bot	24bf379b5b	feat(assistant): add opencode launcher and phase 6/7 plans	2026-06-01 12:59:03 +00:00
autonomic-bot	6a6c17f526	fix(launch-orchestrator): opencode uses plain TUI + ping, not run --attach Same fix as the loops: opencode run --attach exits after one turn; plain opencode TUI stays alive in tmux. Send startup prompt via ping_session (Enter) after 8s init wait. Bootstrap points to JOURNAL.md rather than sending the full prompt inline. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 18:30:09 +00:00
autonomic-bot	2aa3fbda8d	journal: session 2026-05-31 18:30 UTC handoff (opencode/deepseek running, phase 5)	2026-05-31 18:27:17 +00:00
autonomic-bot	3412100240	fix(opencode): all issues from first live run resolved 1. API key: opencode doesn't support env: substitution in apiKey — write actual key value to ~/.config/opencode/opencode.jsonc at setup time (file is not committed to git; key sourced from .testenv). 2. Permission system: add permission:"allow" to opencode config (equivalent to --dangerously-skip-permissions) to avoid interactive prompts. 3. Submit key: opencode TUI uses Enter (return) to submit; Ctrl+S not needed. ping_session already uses Enter — keep as is. 4. Startup timing: bump opencode TUI init wait from 4s to 8s so the TUI is fully connected to the server before bootstrap is sent. 5. Backend persistence: LOOP_BACKEND/LOOP_MODEL written to .loop-backend / .loop-model so the watchdog uses them when restarting dead sessions. All tested: both builder and adversary sessions alive, deepseek-v4-pro processing kickoffs via tinfoil inference.tinfoil.sh, no API/permission errors. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 18:21:10 +00:00
autonomic-bot	cd5e645427	fix(opencode): use inference.tinfoil.sh + attach TUI + NO_COLOR Three fixes discovered during first live run: - inference host is inference.tinfoil.sh not api.tinfoil.sh (control plane only serves /v1/models, not /v1/chat/completions) - opencode run exits after one turn; switch to opencode attach for the persistent TUI, then ping_session sends the kickoff prompt - NO_COLOR=1 suppresses the first-run interactive theme picker Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 17:56:06 +00:00
autonomic-bot	bca51071bd	refactor: rewrite launchers as Python; add orchestrator JOURNAL.md Bash scripts are now one-liner wrappers: exec python3 <script>.py "$@" All logic lives in the Python scripts (pure stdlib, no deps). launch.py — loops + watchdog: Full port of launch.sh: phase sequencing, start/stop/status/logs/watchdog, handoff signalling, stall detection, heal_session, heal_orchestrator. Cleaner structure: config block → helpers → phase/kickoff/agent/healing/ handoff/watchdog/main. LOOP_BACKEND + LOOP_MODEL switches throughout. launch-orchestrator.py — orchestrator session: claude path: --resume <id> preserved (conversation survives reboots). opencode path: run --attach --title (no --resume; STARTUP_PROMPT orients the new session; reads JOURNAL.md for context). STARTUP_PROMPT updated to reference JOURNAL.md on startup. launch-upgrader.py — one-shot upgrade job: LOOP_BACKEND / LOOP_MODEL take precedence over UPGRADER_BACKEND / UPGRADER_MODEL. Both claude and opencode paths supported. cc-ci-plan/JOURNAL.md — new orchestrator handoff file: Persistent across conversation resets. Documents the handoff format and carries the current session's summary: migration complete, phase 5 in progress (V3/V7 PASS), phase 4 deferred, open items for next session. AGENTS.md: step 1 on startup = read JOURNAL.md; step 5 = append on handoff. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 17:50:09 +00:00
autonomic-bot	e0e5bf6e64	feat: opencode web at oc.commoninternet.net (one server, named sessions) configuration.nix: - systemd.services.opencode-web: one shared opencode server on 127.0.0.1:4096, EnvironmentFile=/srv/cc-ci/.testenv (TINFOIL_API_KEY), ExecStartPre clears stale /tmp/opencode so restarts never fail on the EEXIST race. - services.nginx: reverse-proxy oc.commoninternet.net → localhost:4096, bound to tailscale IP 100.84.190.30 (tailnet-only, plain HTTP). DNS: A record oc.commoninternet.net → 100.84.190.30 (operator step). launch.sh + launch-upgrader.sh: - Drop per-session ports / OPENCODE_HOST; add OPENCODE_SERVER=http://127.0.0.1:4096. - opencode backend: agents use `opencode run --attach $OPENCODE_SERVER --title $session` so each shows up as a named session in the web UI. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 17:37:03 +00:00
autonomic-bot	a87d42f491	feat: opencode/tinfoil backend support in all launchers Adds LOOP_BACKEND=opencode\|claude (+ LOOP_MODEL) to launch.sh and launch-upgrader.sh, enabling the loops/upgrader to run via opencode CLI against the tinfoil.sh API (deepseek-v4-pro etc.) instead of Claude. launch.sh: - LOOP_BACKEND (claude\|opencode), LOOP_MODEL env vars - OPENCODE_BIN, OPENCODE_HOST (tailscale IP), OPENCODE_PORT (per-session) - start_agent: backend switch — claude path unchanged; opencode starts `opencode --hostname <ts-ip> --port <N> run <kickoff>` so the web UI is bound to the tailscale interface (tailnet-only observability) - preflight: validates the right binary per backend - heal_session / heal_orchestrator: extend active-work detection to opencode spinner chars + "Running tool" - help: shows both backend configs launch-upgrader.sh: - UPGRADER_BACKEND / UPGRADER_MODEL (LOOP_BACKEND/LOOP_MODEL override) - start: same backend switch as launch.sh - OPENCODE_PORT=4098 (separate from loops 4096/4097) configuration.nix: note opencode binary location + re-install command. Tinfoil config: ~/.config/opencode/opencode.jsonc — provider "tinfoil" with baseURL=https://api.tinfoil.sh/v1, apiKey=env:TINFOIL_API_KEY (key + TINFOIL_MODEL + TINFOIL_BASE_URL stored in .testenv). opencode v1.15.13 installed at /home/loops/.local/bin/opencode. Usage: LOOP_BACKEND=opencode LOOP_MODEL=tinfoil/deepseek-v4-pro \ RESUME_PHASE=1 cc-ci-plan/launch.sh start Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 17:21:13 +00:00
autonomic-bot	25fd7407fd	launch-upgrader: default model to sonnet (UPGRADER_MODEL) Adds UPGRADER_MODEL env var (default: sonnet) passed as --model to the claude invocation. The cron runs the upgrader on Sonnet so it doesn't consume Opus weekly credits. Override with UPGRADER_MODEL=opus if needed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 13:24:53 +00:00
autonomic-bot	21e7a79f50	orchestrator-hetzner: enable reboot-resilience + record migration Now the workspace is staged on the Hetzner cpx22 (server 134487234, public 91.98.47.73, tailnet cc-ci-orchestrator-1 @ 100.84.190.30): - configuration.nix: enable cc-ci-loops.service (wantedBy multi-user.target) so the loops + watchdog auto-resume on boot; wire reboot-log.sh as ExecStartPre so reboots auto-log to REBOOTS.md (boot_id-gated). - plan-orchestrator-hetzner-migration.md: full migration record. - REBOOTS.md / AGENTS.md: point the orchestrator host at Hetzner; first auto-logged reboot line. - launch-orchestrator.sh: default session id -> the Hetzner orchestrator session. - flake.lock: pin inputs. Verified: nixos-rebuild switch applied; systemctl is-enabled cc-ci-loops.service = enabled; ExecStartPre logged this boot to REBOOTS.md; loops healthy on phase 2. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-31 03:54:17 +00:00

1 2 3

122 Commits