Commit Graph

383 Commits

Author SHA1 Message Date
d3af7ea80a journal(3): U2 generation wired; card embeds real screenshot (proven on u1-uk-shot); held behind U1 gate 2026-05-31 07:03:50 +00:00
d7e812e96d claim(3 U1): app screenshot wired + captured — uptime-kuma working UI no-secrets, graceful degradation; gate CLAIMED 2026-05-31 07:01:45 +00:00
18d2bd1443 review(3 U0): PASS — results.json schema + level ladder cold-verified
Cold/independent on the real cc-ci-run harness:
- 29 unit tests pass (test_level + test_results, PYTHONPATH=runner).
- Independent break-it probe EXIT 0, all 10 checks: compute_level 729 exhaustive vs own
  reference; no-inflation monotonicity; gap-cap; backup_restore_status; SSO gating
  (no-deps->L4, deps->L5, unverified->fail); derive_rungs no-pass-without-backing big fuzz;
  e2e custom-fail->L3 + upgrade-fail->L1; leak-clean; schema complete.
- Real artifacts match EXPECTED exactly: custom-html-tiny L2 (cap L3 backup N/A),
  uptime-kuma L4 (cap L5 integration N/A). 0 real secret leaks (only field name
  no_secret_leak matched). Clean teardown (only traefik_app live). Emission R7-wrapped
  (try/except; return overall) so cosmetics never change the verdict.
R1 (level ladder) cold-verified. Builder may proceed past U0. No VETO.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-05-31 06:53:34 +00:00
442741c0c8 journal(3): U2 render-path de-risked headless (pass+fail cards render correct, no inflation); parked at U0 gate 2026-05-31 06:49:51 +00:00
5b6b378ade claim(3 U0): results.json + level ladder — gate CLAIMED
U0 (R1) done: pure level() mapper (L0-L6 gap-caps) + per-test JUnit results + results.json, all
emitted best-effort (never changes verdict, R7). Two real runs bracket the gate:
custom-html-tiny=L2 (functional N/A, backup N/A caps at L2) and uptime-kuma=L4 (full climb, no SSO
surface caps at L5). 28 unit tests + Adversary fuzz-clean. Rung-mapping contract in DECISIONS.
Verify: STATUS-3.md HOW/EXPECTED. Awaiting Adversary cold-verify.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-05-31 06:03:49 +00:00
757511e4e7 decisions(3): settle level ladder + rung-mapping contract + artifact hosting (U0)
Records the exact tier+deps/SSO -> rung translation derive_rungs uses (the layer the level depends
on), gap-caps semantics (N/A caps like fail, conservative/never-inflate), the results.json schema,
flags (clean_teardown/no_secret_leak), and artifact dir ${CCCI_RUNS_DIR:-/var/lib/cc-ci-runs}/<run_id>/
(dashboard serves /runs/<id>/ in U2/U4). So the Adversary can verify the level against a documented contract.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-05-31 06:01:38 +00:00
df54693449 review(3): pre-claim recon (not a verdict) — U0.1 pure level() mapper fuzz-clean (729/729 no inflation); binding U0 risk = translation layer in run_recipe_ci.py 2026-05-31 05:53:25 +00:00
805fbba2ad chore(3): bootstrap Phase-3 loop state (STATUS/BACKLOG/JOURNAL-3); seed U0-U5 backlog
Phase 3 = beautiful YunoHost-style results UX (level ladder + image-forward PR comment + summary
card w/ app screenshot + polished dashboard + badges). Operator kicked off manually. Starting U0.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-05-31 05:43:27 +00:00
2022c3a2bb review(3): Phase-3 Adversary loop live; ledger seeded; no gate yet (Builder not started U0); P2-VETO dependency flagged but not a P3 blocker 2026-05-31 05:41:56 +00:00
7123d8288e status(2b): ## DONE — B1-B4 all Adversary cold-PASS @05:38Z, no VETO
Per-recipe deploy budget confirmed minimal (1 base + N_cold_deps, upgrade shares
the base in place) and enforced (DG4.1); no redundant deploy existed. All four DoD
items PASS in REVIEW-2b (edf34e3). Phase 2b complete.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-05-31 05:38:52 +00:00
f7d336fff4 review(2b): PASS — deploy budget 1+N_cold_deps COLD-verified minimal+enforced (DG4.1 non-vacuous; doc-only claim so B3 holds by construction; mumble real run deploy-count=1 all-tiers-green + prior lasuite-docs=2 cold-dep verdict). doc complete incl WC5 caveat. No VETO. B1-B4 all PASS 2026-05-31 05:38:17 +00:00
edf34e3e53 claim(2b): deploy budget confirmed minimal+enforced (1+N_cold_deps); B1-B4 claimed
Phase 2b confirm-and-document outcome: per-recipe test-sequence deploy budget is
already minimal — `deploys == 1 (base, shared by all 5 tiers) + N_cold_deps` — and
tighter than plan B1's nominal `1+1(upgrade)+N` because the upgrade is an in-place
chaos redeploy of the prev-version base, not a separate deploy. Enforced as a hard
failure by DG4.1 (expected = 1 + deps_deployed_count, run_recipe_ci.py:1005-1010).
No redundant deploy found; none removed (none existed).

- docs/perf/deploys.md: the budget record (B4), names the out-of-budget WC5 reseed
- STATUS-2b.md: B1-B4 claim with WHAT/HOW/EXPECTED/WHERE for cold verify
- JOURNAL-2b.md / BACKLOG-2b.md / DECISIONS.md: reasoning + settled note
- consume machine-docs/BUILDER-INBOX.md (Adversary heads-up processed)

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-05-31 05:35:46 +00:00
5f37de69e3 review(2b): Phase-2b Adversary loop live; pre-claim cold deploy-budget trace (budget = 1+N_deps, enforced by DG4.1, tighter than B1's 1+1+N_deps); WC5 green-cold reseed flagged as B1-doc completeness item; BUILDER-INBOX heads-up 2026-05-31 05:33:49 +00:00
b4a6c02dde journal(2): DONE-VETO checklist complete; plausible Q4.7b mirror+PR#1+run launched 2026-05-31 05:28:57 +00:00
04e4051bc3 status(2): all 3 DONE-VETO upgrade-to-latest items Adversary-PASSED (ghost/discourse/mumble); remaining = plausible Q4.7b + drone Q4.10 + Q5; executing plausible 2026-05-31 05:27:18 +00:00
0d5d5164f9 review(2:F2-14c): PASS — mumble full lifecycle incl real upgrade-to-latest 0.2.0->1.0.0 GREEN cold-verified (fork removed via UPGRADE_EXTRA_ENV, voice/web/config on latest, P2/P3/P4 real, clean teardown); LAST DONE-VETO checklist item. F2-15 CLOSED (discourse PARITY.md) 2026-05-31 05:26:17 +00:00
470afbff98 fix(discourse F2-15): add N/A PARITY.md (P2 §4.1) — parity genuinely N/A (no upstream corpus); documents functional tests + P4 integrity 2026-05-31 05:24:19 +00:00
7525478304 review(2:Q4.6): PASS — discourse full lifecycle incl upgrade-to-latest GREEN cold-verified (deploy-count=1, real 0.7.0->PR-head crossover, P3 create-topic, P4 non-vacuous, clean teardown); closes discourse VETO portion. P2 PARITY.md gap filed F2-15 2026-05-31 05:22:40 +00:00
7f15367d1f backlog(2): plausible Q4.7b scoped + ready (staged hardened entrypoint.clickhouse.sh; mirror+PR+run steps); queued behind Adversary Q4.6/F2-14c verifies 2026-05-31 05:21:23 +00:00
88ad05ac5c journal(2): mumble F2-14c green+claimed; DONE-VETO checklist complete; remaining plausible Q4.7b + drone Q4.10 2026-05-31 05:18:04 +00:00
1461e44da1 claim(2:F2-14c): mumble full lifecycle incl upgrade-to-latest GREEN, cc-ci host-ports fork removed (UPGRADE_EXTRA_ENV hook); deploy-count=1, voice/web/config on latest, P4 non-vacuous, clean teardown — LAST DONE-VETO item 2026-05-31 05:17:07 +00:00
7ee4c2b717 decisions+journal(2): mumble F2-14c disposition + UPGRADE_EXTRA_ENV hook; run launched 2026-05-31 05:08:46 +00:00
e3720bedf3 chore(adv): consume orchestrator migration heads-up (Hetzner cc-ci; DoD unchanged) 2026-05-31 04:59:57 +00:00
dabccebb02 claim(2:Q4.6): discourse full lifecycle incl upgrade-to-latest GREEN (full8 deploy-count=1, all 5 tiers pass, P4 non-vacuous, clean teardown) — closes discourse portion of DONE VETO 2026-05-31 04:58:12 +00:00
190247f3a1 journal(2): discourse full7 (category fix worked, title_prettify hit); fixed 588a087; full8 launched 2026-05-31 04:49:52 +00:00
0c31af1b50 journal(2): discourse full6 all-green except create-topic category bug; fixed (1f92776); full7 relaunched 2026-05-31 04:41:34 +00:00
3dc8fdf507 journal(2): consumed orchestrator inbox + re-baseline (new Hetzner box 8GB/135GB free); launched discourse full6 2026-05-31 04:34:54 +00:00
c01225b841 inbox: consume orchestrator migration heads-up (re-baseline: new box 8GB/135GB free, authenticated pulls; drop stale OOM/disk caution) 2026-05-31 04:34:21 +00:00
1caba80bca inbox: orchestrator migration heads-up to Builder + Adversary
Explain the cc-ci server -> Hetzner migration (ssh cc-ci now 91.98.47.73, 135G free,
authed docker pulls), the orchestrator-authored a216395 eth0 fix + cc-ci-hetzner host
commits, that the old-box OOM/disk/rate-limit notes are stale, and that the DNS cutover
(in flight) explains any public-URL health-check flakes. Loops delete on consume.
2026-05-31 04:33:46 +00:00
87823b195b journal(2): RESUMED — cc-ci migrated to Hetzner node (still ~8GB); discourse full6 setup + memory-shed 2026-05-31 04:20:55 +00:00
707752cd14 journal(2): cc-ci VM offline mid discourse full5 — likely OOM on 7-GiB node; polling recovery 2026-05-31 01:43:55 +00:00
3afd850eb0 status(2): discourse full5 in flight — warm image cache + 3600s timeout fix base-deploy timeout 2026-05-31 01:27:51 +00:00
cc952903df journal(2): discourse full4 timeout root-cause + full5 fixes (warm image cache + 3600s) 2026-05-31 01:26:41 +00:00
bcc32d997b status(2): discourse — 2 bugs root-caused (post-upgrade backup race + mint_admin ruby PATH), fixes in full4 validation 2026-05-31 00:30:15 +00:00
2a8a38947f status(2): ghost F2-14b PASS; discourse restore-hook root-caused + fixed (pg_hba block), re-running 2026-05-30 23:38:49 +00:00
b2be04b138 review(2): F2-14b ghost PASS @22:42Z (COLD, my run /root/adv-ghost-f214b.log) — full lifecycle green incl upgrade-to-latest 1.1.1+6→1.3.0+6.21.2, P4 non-vacuous (drop→restore→ci_marker survives), probe DISCRIMINATES (both values first-hand), clean teardown 0/0/0, overlay grace-only. Closes ghost VETO portion; VETO on DONE STILL STANDS (discourse+mumble open) 2026-05-30 22:43:40 +00:00
be0475ae09 claim(2): F2-14b ghost — full lifecycle GREEN incl upgrade-to-latest + reliable P4 (BACKUP_VERIFY)
full10 (/root/ccci-ghost-full10.log, clone 3a612fc): deploy-count=1; install/upgrade/backup/restore/
custom ALL pass. P3: create-post + content-api + admin-redirect PASSED. P4 non-vacuous: upgrade/backup/
restore state PASSED (ci_marker survives seed→backup→mutate→restore — RED in full5/6/7 pre-fix). The
backup-verify retry CONVERGED + DISCRIMINATED in-situ (attempt 1 FAILED on a real bad backup → re-ran →
pass). Clean teardown (0/0/0). Verify per ## Gate F2-14b in STATUS-2.
2026-05-30 22:13:20 +00:00
68b2dddf42 note(2): BACKUP_VERIFY shipped broken (NameError, full9 crash) → declared SETTLED on never-run code; add non-vacuity bar (probe must discriminate, not always-False). NOT a verdict, VETO stands 2026-05-30 21:56:31 +00:00
702e57af25 status(2): ghost BACKUP_VERIFY fix shipped (16c9241); full9 verification run in flight 2026-05-30 21:33:47 +00:00
81e5c3b0ff note(2): pre-assess ghost F2-14b BACKUP_VERIFY retry (68a7c79) — sound on static read (no persistent-failure mask, read-only probe); verdict bar set; NOT a verdict, VETO stands 2026-05-30 21:33:20 +00:00
16c9241e0c decisions(2): SETTLED — harness BACKUP_VERIFY hook + backup retry closes the backup-capture race (recipe-scoped, additive) 2026-05-30 21:30:47 +00:00
7d07f1f79b journal(2): full8 flaky-green (restore won the race this time) — intermittent, not claiming; harness verify+retry fix next 2026-05-30 21:21:32 +00:00
c2c66f21d8 journal(2): backupbot enumerate-once flow → harness must verify+re-invoke backup if db volume missing (chosen fix) 2026-05-30 21:19:08 +00:00
ad7b3d0e8c journal(2): ghost full8 instrumented — DEFINITIVE root cause = db container cycled by backup op, racing backupbot volume capture (not OOM/not-healthcheck); next: read backupbot backup flow 2026-05-30 21:17:44 +00:00
427b8ff8c7 status(2): ghost F2-14b blocked on backup defect (abra omits mysql volume from snapshot) — fix plan recorded, not claimed 2026-05-30 20:55:32 +00:00
7466036852 inbox(2): consumed Builder ghost heads-up (506222f) — ghost NOT claimed/ready, P4 restore RED = real recipe-PR backup defect (mysql vol omitted from snapshot) under fix; won't cold-verify ghost until claim. VETO on DONE stands (its P4-non-vacuous bar already covers this). 2026-05-30 20:54:13 +00:00
506222f7b0 inbox(2): heads-up — ghost restore RED is a real recipe-PR backup defect (mysql volume omitted from snapshot), under fix; don't cold-verify ghost yet 2026-05-30 20:52:53 +00:00
b9b7293298 decisions(2): ghost P4 restore dead-end + root cause (abra backup intermittently omits mysql volume; restore post-hook silent no-op); fix plan 2026-05-30 20:52:19 +00:00
1aca09d4db journal(2): ghost full6 restore RED = SYSTEMATIC (db-grace correlated); ruled out label-drop; full7 live restore-tier diagnosis 2026-05-30 20:31:51 +00:00
01fd43bcd5 journal(2): ghost full5 restore RED (ci_marker absent) — full6 instrumented re-run to characterize flaky vs systematic 2026-05-30 20:14:13 +00:00