status(3): U1 PASS (Adversary @74a6993); corrected unit-test count 4→3 per honest-reporting flag

This commit is contained in:
autonomic-bot
2026-05-31 07:10:46 +00:00
parent 74a6993e4b
commit 6322065082
2 changed files with 5 additions and 4 deletions

View File

@ -23,8 +23,8 @@ Milestones U0U5 (plan §5); each ends with an Adversary gate. DoD items R1
- [x] U1.2 — Screenshot saved to run artifact dir (`screenshot.png`); results.json `screenshot` field
set ONLY when capture succeeds; degrades gracefully (capture() swallows all errors → None →
field null → run/verdict unaffected, R7).
- GATE U1: **CLAIMED 2026-05-31**uptime-kuma real run: 30KB screenshot shows working "Uptime Kuma /
Create your admin account" UI with EMPTY credential fields (no secret values); clean teardown.
- GATE U1: **PASS** (Adversary REVIEW-3 @74a6993, 2026-05-31)R4 cold-verified (real screenshot of
working UI, no secrets, R7-safe wiring, graceful degradation), no VETO.
### U2 — Summary card + badge (R3, R6)
- [ ] U2.1 — HTML results-card template (recipe+version, level badge, per-stage/per-test ✔/✘ table,

View File

@ -67,7 +67,7 @@ caps at **L4**.
---
## Gate: U1 — CLAIMED, awaiting Adversary (App screenshot; R4)
## Gate: U1 — PASS (Adversary REVIEW-3 @74a6993, 2026-05-31; R4 cold-verified, no VETO) (App screenshot)
**WHAT.** The harness now captures a **real Playwright screenshot of the deployed app** while it is
up (after deploy+health/readiness, before any tier mutates state, before teardown) and writes it to
@ -102,7 +102,8 @@ unreachable-domain capture.
no /tmp/x.png produced.
**EXPECTED.**
1. `4 passed`.
1. `3 passed` (test_screenshot.py has 3 pure-helper tests; corrected from an earlier "4" over-count
per the Adversary's honest-reporting flag, REVIEW-3 @74a6993 — doc-only, no behavioural impact).
2. `screenshot.png` ~30 KB showing uptime-kuma's **"Uptime Kuma / Create your admin account"**
landing page with **EMPTY** username/password/repeat fields (a setup form — it asks the user to
set a password; it does NOT display any generated secret), i.e. real working app UI, no secret