orchestrator/tests at 32a7aa8c6b6e7749687f86e5cf06ea493212a17d - orchestrator - Gitea: Git with a cup of tea

admin/orchestrator

Files

History

claude-bot 9070489968

CI / test (push) Failing after 39s

Details

CI / test (pull_request) Failing after 35s

Details

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict

The self-hosting orchestrator looped on deploy-staging -> development because
scripts/staging_check.py exited 1 on ANY failed check, so two infra-only checks
(C9a sandbox branch / C9b analyst-job — caused by SANDBOX bot accounts not being
members of the sandbox Plane project, NOT a pipeline regress) forced
staging_status: FAILED -> rollback -> loop, burning developer retries and tokens.

Direction (б) per ADR-001: classify staging checks as REAL (all pipeline checks,
fail-closed) vs SANDBOX_INFRA (narrow allowlist {C9a, C9b}, waivable). New leaf
module src/staging_verdict.py (stdlib-only, never-raise): classify_check +
compute_staging_verdict fold per-check results into a tolerant-but-fail-closed
verdict — any REAL failure -> FAILED/exit1 (safety net holds under any flag);
only C9a/C9b failed & tolerant -> SUCCESS/exit0 with waived list; only infra &
strict -> FAILED/exit1; any internal error -> FAILED/exit1 (never a false green).

staging_check.py now auto-classifies each check (public 3-tuple _items shape kept
as an ORCH-048 b6 regression guard), exposes categorized_items(), prints
INFRA-WAIVED/VERDICT lines, and exits via the verdict; new --strict flag forces
legacy strictness per-run. Kill-switch ORCH_STAGING_INFRA_TOLERANCE_ENABLED
(default true) restores legacy strict mode globally. launcher gains
action_stage_no_changes_note so "no changes to commit" on action stages is logged
as expected, not treated as under-delivery.

Contracts unchanged: STAGE_TRANSITIONS, QG_CHECKS registry, staging_status:/
deploy_status: frontmatter, hook exit-code (0/1/2), check_staging_status; no DB
migration. Docs: README, STAGING_CHECK.md, deployer.md, .env.example, CHANGELOG.

Refs: ORCH-061

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-06-07 12:39:00 +00:00

..

__init__.py

feat: orchestrator MVP — webhooks, agent launcher, QG checks

2026-05-19 15:57:00 +03:00

conftest.py

feat(reconciler): sweeper потерянных webhook (реконсиляция застрявших стадий)

2026-06-06 20:55:25 +00:00

test_analysis_approve_flow_links.py

feat(notifications): direct BRD + Plane links in approve ping (ORCH-017)

2026-06-05 17:58:00 +00:00

test_analyst_comment_regression.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_analyst_comment.py

feat(config): external gitea_public_url for clickable doc links

2026-06-03 22:58:18 +03:00

test_analyst_status_only_regression.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_config.py

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict

2026-06-07 12:39:00 +00:00

test_deploy_approve.py

developer(ET): auto-commit from developer run_id=264

2026-06-07 07:46:19 +00:00

test_deploy_build_once.py

developer(ET): auto-commit from developer run_id=264

2026-06-07 07:46:19 +00:00

test_deploy_hook_mapping.py

developer(ET): auto-commit from developer run_id=264

2026-06-07 07:46:19 +00:00

test_deploy_hook_provenance.py

fix(ORCH-058): parametrize staging_check in --build-staging + explicit staging target

2026-06-07 09:24:38 +00:00

test_deploy_hook_rollback_sim.py

developer(ET): auto-commit from developer run_id=192

2026-06-06 21:07:35 +00:00

test_deploy_notifications.py

developer(ET): auto-commit from developer run_id=192

2026-06-06 21:07:35 +00:00

test_deploy_rollback.py

fix(deploy): clear stale self-deploy markers on rollback; document env

2026-06-06 21:07:35 +00:00

test_deploy_routing.py

developer(ET): auto-commit from developer run_id=192

2026-06-06 21:07:35 +00:00

test_deploy_terminal_sync.py

developer(ET): auto-commit from developer run_id=192

2026-06-06 21:07:35 +00:00

test_fmt_duration.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_git_worktree.py

feat(worktree): git worktree per task to isolate shared /repos (ORCH-2 / S-4)

2026-06-02 21:12:06 +03:00

test_gitea_sha_resolve.py

feat(reconciler): sweeper потерянных webhook (реконсиляция застрявших стадий)

2026-06-06 20:55:25 +00:00

test_image_freshness.py

developer(ET): auto-commit from developer run_id=264

2026-06-07 07:46:19 +00:00

test_launcher.py

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict

2026-06-07 12:39:00 +00:00

test_log_rotation.py

feat(launcher): prune old run logs (L-2)

2026-06-03 09:53:55 +03:00

test_m6_sequence.py

fix(tests): per-project Plane states in webhook tests + close CI hole (ORCH-39) (#35 )

2026-06-05 17:36:40 +03:00

test_merge_gate_race.py

feat(merge-gate): auto-rebase onto current main + re-test + serialise merges

2026-06-06 17:32:50 +00:00

test_merge_gate.py

feat(merge-gate): auto-rebase onto current main + re-test + serialise merges

2026-06-06 17:32:50 +00:00

test_notify_approve_links.py

feat(notifications): direct BRD + Plane links in approve ping (ORCH-017)

2026-06-05 17:58:00 +00:00

test_notify_done_regression.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_orch10_states.py

fix(plane): resolve issue states per-project instead of hardcoded enduro UUIDs (ORCH-10)

2026-06-05 14:23:31 +03:00

test_orch040_compose.py

fix(infra): run orchestrator containers as host uid 1000:1000 (not root)

2026-06-06 15:02:33 +00:00

test_pipeline_start_bugs.py

fix(pipeline): fetch issue name from Plane API on status-trigger start

2026-06-03 22:42:53 +03:00

test_plane_author.py

feat(plane): per-agent bot authorship for comments

2026-06-03 10:53:25 +03:00

test_plane_webhook.py

fix(tests): per-project Plane states in webhook tests + close CI hole (ORCH-39) (#35 )

2026-06-05 17:36:40 +03:00

test_post_usage_comments_integration.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_projects.py

test(projects,webhook): cover registry resolvers + project filter

2026-06-02 22:30:51 +03:00

test_qg_checks.py

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict

2026-06-07 12:39:00 +00:00

test_qg_merge_gate.py

feat(merge-gate): auto-rebase onto current main + re-test + serialise merges

2026-06-06 17:32:50 +00:00

test_qg_registry_snapshot.py

developer(ET): auto-commit from developer run_id=264

2026-06-07 07:46:19 +00:00

test_qg.py

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict

2026-06-07 12:39:00 +00:00

test_queue.py

test(resilience): 34 tests for preflight/classifier/backoff/breaker (ORCH-1)

2026-06-03 00:12:17 +03:00

test_reconciler_plane.py

feat(reconciler): sweeper потерянных webhook (реконсиляция застрявших стадий)

2026-06-06 20:55:25 +00:00

test_reconciler.py

fix(reconciler): skip escalated / Blocked / Needs-Input tasks in F-1

2026-06-07 11:50:02 +00:00

test_resilience.py

test(resilience): 34 tests for preflight/classifier/backoff/breaker (ORCH-1)

2026-06-03 00:12:17 +03:00

test_resolve_agent_effort.py

feat(agents): configurable LLM model + effort per-agent and per-project (ORCH-41) (#36 )

2026-06-05 19:45:19 +03:00

test_resolve_agent_model.py

feat(agents): configurable LLM model + effort per-agent and per-project (ORCH-41) (#36 )

2026-06-05 19:45:19 +03:00

test_review_parse.py

feat(stage-engine): embed verbatim reviewer/tester findings in rollback task_desc

2026-06-06 04:42:11 +00:00

test_stage_engine.py

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict

2026-06-07 12:39:00 +00:00

test_stage_visibility.py

feat(plane): stage visibility on board + verdict status UUIDs

2026-06-03 18:18:17 +03:00

test_stages.py

developer(ET): auto-commit from developer run_id=192

2026-06-06 21:07:35 +00:00

test_staging_check_b6.py

fix(staging): tolerate sandbox-infra-only FAILs (C9a/C9b) in deploy-staging verdict

2026-06-07 12:39:00 +00:00

test_staging_precondition.py

developer(ET): auto-commit from developer run_id=192

2026-06-06 21:07:35 +00:00

test_status_comment_authorship.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_status_comment_dedup_regression.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_status_comment_duration_db_fallback.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_status_comment_format.py

feat(plane): unified status-comment format with duration line (ORCH-016) (#34 )

2026-06-05 17:50:47 +03:00

test_status_only_verdict.py

feat(webhook): pull reject reason from latest comment

2026-06-03 22:18:24 +03:00

test_status_trigger.py

feat(webhook): pull reject reason from latest comment

2026-06-03 22:18:24 +03:00

test_taskmd_description.py

fix(pipeline): fetch issue name from Plane API on status-trigger start

2026-06-03 22:42:53 +03:00

test_telegram_tracker.py

feat(notifications): add bump mode + russify Telegram live-tracker

2026-06-06 10:13:49 +00:00

test_tracker_bump.py

feat(notifications): add bump mode + russify Telegram live-tracker

2026-06-06 10:13:49 +00:00

test_usage.py

fix(observability): merge-gate on deploy, full token input, Plane Done, artifact links

2026-06-04 11:17:58 +03:00

test_verdict_status.py

feat(webhook): pull reject reason from latest comment

2026-06-03 22:18:24 +03:00

test_webhook_dedup.py

feat(webhook): start pipeline on In Progress status (not on create)

2026-06-03 18:18:26 +03:00

test_webhooks.py

test: migrate sequential_ids test to In Progress contract

2026-06-04 22:38:09 +03:00