admin/orchestrator

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict (ORCH-061) #62

Merged

admin merged 9 commits from feature/ORCH-061-bug-deploy-staging-development into main

2026-06-07 16:30:07 +03:00

Author	SHA1	Message	Date
claude-bot	39769bdf23	tester(ET): auto-commit from tester run_id=300 All checks were successful CI / test (push) Successful in 17s Details CI / test (pull_request) Successful in 17s Details	2026-06-07 13:21:17 +00:00
claude-bot	de47737f4f	reviewer(ET): auto-commit from reviewer run_id=299 All checks were successful CI / test (push) Successful in 16s Details CI / test (pull_request) Successful in 15s Details	2026-06-07 13:18:47 +00:00
stream	e3f7c1c272	ci: re-trigger after gitea restart (ORCH-061) All checks were successful CI / test (push) Successful in 16s Details CI / test (pull_request) Successful in 17s Details	2026-06-07 13:14:14 +00:00
stream	32a7aa8c6b	ci: trigger re-run after host disk cleanup (ORCH-061)	2026-06-07 13:08:38 +00:00
stream	fe8586ed78	ci: re-run after host disk cleanup (ORCH-061)	2026-06-07 13:04:38 +00:00
claude-bot	9070489968	fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict Some checks failed CI / test (push) Failing after 39s Details CI / test (pull_request) Failing after 35s Details The self-hosting orchestrator looped on deploy-staging -> development because scripts/staging_check.py exited 1 on ANY failed check, so two infra-only checks (C9a sandbox branch / C9b analyst-job — caused by SANDBOX bot accounts not being members of the sandbox Plane project, NOT a pipeline regress) forced staging_status: FAILED -> rollback -> loop, burning developer retries and tokens. Direction (б) per ADR-001: classify staging checks as REAL (all pipeline checks, fail-closed) vs SANDBOX_INFRA (narrow allowlist {C9a, C9b}, waivable). New leaf module src/staging_verdict.py (stdlib-only, never-raise): classify_check + compute_staging_verdict fold per-check results into a tolerant-but-fail-closed verdict — any REAL failure -> FAILED/exit1 (safety net holds under any flag); only C9a/C9b failed & tolerant -> SUCCESS/exit0 with waived list; only infra & strict -> FAILED/exit1; any internal error -> FAILED/exit1 (never a false green). staging_check.py now auto-classifies each check (public 3-tuple _items shape kept as an ORCH-048 b6 regression guard), exposes categorized_items(), prints INFRA-WAIVED/VERDICT lines, and exits via the verdict; new --strict flag forces legacy strictness per-run. Kill-switch ORCH_STAGING_INFRA_TOLERANCE_ENABLED (default true) restores legacy strict mode globally. launcher gains action_stage_no_changes_note so "no changes to commit" on action stages is logged as expected, not treated as under-delivery. Contracts unchanged: STAGE_TRANSITIONS, QG_CHECKS registry, staging_status:/ deploy_status: frontmatter, hook exit-code (0/1/2), check_staging_status; no DB migration. Docs: README, STAGING_CHECK.md, deployer.md, .env.example, CHANGELOG. Refs: ORCH-061 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-07 12:39:00 +00:00
claude-bot	1d1208c136	architect(ET): auto-commit from architect run_id=297 All checks were successful CI / test (push) Successful in 18s Details	2026-06-07 12:22:46 +00:00
claude-bot	3ab2690a68	analyst(ET): auto-commit from analyst run_id=296 All checks were successful CI / test (push) Successful in 16s Details	2026-06-07 12:10:46 +00:00
Slava	3806522041	docs: init ORCH-061 business request All checks were successful CI / test (push) Successful in 17s Details	2026-06-07 15:05:55 +03:00