orchestrator

Author	SHA1	Message	Date
dev-agent	4e4cc6c724	fix(qg): find 14-deploy-log.md in origin/main when absent in feature worktree ET-013: deployer writes 14-deploy-log.md and merges deploy artifacts into main via a separate PR, so the log lands in origin/main, not the feature branch worktree that check_deploy_status reads via _repo_path(repo, branch). Result: every successful deploy was falsely failed (Deploy log not found) and rolled back deploy->development. Fix: when the log is absent in the worktree, fall back to reading it from origin/main on the shared clone (git fetch origin main + git show origin/main:docs/work-items/<WI>/14-deploy-log.md). Lookup order: worktree -> origin/main -> not found. Fetch/show failures degrade to not found (never raise). Does not touch the merge-gate in gitea.py. Tests: origin/main SUCCESS->PASS (ET-013 case), origin/main FAILED->FAILED, absent everywhere->not found, fetch failure->degrades no exception, worktree log short-circuits main lookup.	2026-06-04 13:35:35 +03:00
dev-bot	ec9aa74492	fix(tracker): no duplicate Telegram messages on not-modified/transient edits edit_telegram now returns a distinguishable outcome (ok\|not_modified\|gone\| failed) instead of a bare bool. update_task_tracker only sends a NEW message when the original is truly gone; not_modified and transient failures no longer spawn duplicate trackers or orphan the live one. render_task_tracker shows "попытка N" on an actively re-run stage (>=2 agent runs) so the text changes between review<->development cycles. Finished (✅) lines are unchanged. Tests: edit_telegram classification (ok/not_modified/gone/failed via mocked httpx), update_task_tracker (not_modified/failed -> no send, gone -> send+id), render attempt marker.	2026-06-04 13:20:40 +03:00
dev-bot	9a0298de9d	feat(telegram): live editable task tracker (Variant B+), replace 15-message spam Replace the ~15 separate Telegram messages per task (agent start/finish, stage transition, QG-pending, tech noise) with ONE live tracker message edited in place (editMessageText) on every stage transition. Only attention-worthy events are still sent as SEPARATE, notifying messages: approve-gate, deploy-fail, agent-fail, task error. - db.py: idempotent ALTERs — tasks.tracker_message_id, tasks.title, tasks.brd_review_started_at/ended_at, agent_runs.model. Helpers for tracker message_id + BRD-review clock. - usage.py: short_model_name() (strip provider/claude- prefix); parse model from result-JSON modelUsage; record_usage persists model. - notifications.py: render_task_tracker(task_id) (stateless render from agent_runs), update_task_tracker (sendMessage->store id->editMessageText with fallback to a new message, silent), edit_telegram(). Per-stage line in↓/out↑·cost·model, ⏸️ Ревью БРД (human time), 💰 totals, finish block (⏱️ wall/agents/yours, 🔗 PR · 📦). notify_* are now tracker-only/log-only except the four alerts. - stage_engine.py: stamp brd_review_ended on analysis->architecture advance. - webhooks/plane.py: persist task title on creation. - tests/test_telegram_tracker.py: render, short_model_name, send/edit/fallback, separate-vs-silent alert behavior.	2026-06-04 11:42:46 +03:00
Dev Agent	61e26a8930	fix(observability): merge-gate on deploy, full token input, Plane Done, artifact links 1. BUG 8 (second door): merge webhook no longer fake-completes a task at the deploy stage; done is gated by the deployer verdict (check_deploy_status). Other stages keep merge->done. 2. Token accounting: parse+persist cache_creation_input_tokens (new idempotent agent_runs column). usage_comment / task_summary now show the FULL input (input + cache_read + cache_creation) with a cached breakdown. cost_usd untouched. 3. deploy->done success now forces the Plane issue to terminal Done state. 4. All agents (architect/developer/reviewer/tester/deployer) attach artifact links to their finish comment via gitea_public_url. Tests added for each fix; pytest 244 passed / 9 failed (off-limits HMAC group).	2026-06-04 11:17:58 +03:00
dev-agent	e4a9c48395	fix(deploy): gate deploy->done on deployer verdict, not LLM exit code	2026-06-04 02:43:01 +03:00
Dev Agent	3a285de11d	fix(ci): bounce task back to developer on red CI (capped retries)	2026-06-04 01:39:40 +03:00
Dev Agent	e15d339b14	fix(qg): use check_ci_green instead of local tests on development stage	2026-06-04 01:22:43 +03:00
orchestrator-dev	90c9ffe839	fix(qg): run pytest directly instead of make in check_tests_local	2026-06-04 00:43:04 +03:00
Dev Agent	0b8013cb06	fix(stage): approved verdict advances analysis->architecture instead of re-running gate	2026-06-03 23:30:08 +03:00
Dev Agent	ca63bc26bb	feat(config): external gitea_public_url for clickable doc links	2026-06-03 22:58:18 +03:00
dev-agent	a9cdb17614	feat(plane): analyst comment asks for Approved status + links docs The analyst ready-comment used the obsolete :approved: wording (comment-based approve was removed in PR #12). Rewrite it for the status-only model: ask the stakeholder to move the issue to Approved (reject = reason comment + Rejected), and add clickable Gitea links to the analyst docs that actually exist in the worktree.	2026-06-03 22:42:53 +03:00
dev-agent	96c5e6b2f9	fix(pipeline): fetch issue name from Plane API on status-trigger start issue.updated ships only the changed fields, so name was absent and the branch slug became feature/<id>-untitled. Add fetch_issue_fields (single issue-detail GET returning name+description, reusing the endpoint/token of fetch_issue_description) and pull the name above the slug build. Empty name still falls back to untitled.	2026-06-03 22:42:53 +03:00
Dev Agent	857bad314c	feat(webhook): pull reject reason from latest comment handle_verdict(rejected): the reason is now pulled from the issue latest Plane comment (_latest_comment_reason: GET comments, newest by created_at, HTML stripped) instead of a fixed stub. Slava writes the reason in a comment before flipping the status to Rejected. Falls back to a fixed note when there is no comment / the API call fails. tests: add test_status_only_verdict.py (test_inreview_comment_does_not_revert [bug 3 root], test_any_comment_no_pipeline_action, test_approved_status_advances_without_inprogress_reset, test_rejected_status_pulls_reason_from_comment) and test_inprogress_from_needs_input_relaunches_analyst in test_status_trigger.py. Rewrote the comment-based tests (test_verdict_status, test_plane_approved/ rejected in test_webhooks) under the status-only model: comments are no-ops, verdicts come from status changes.	2026-06-03 22:18:24 +03:00
Dev Agent	c69e11348b	test(pipeline): cover status-start description fetch and work_item_id uniqueness - test_status_start_fetches_description: empty payload description -> pulled from Plane API (mocked) -> QG-0 passes, analyst enqueued. - test_status_start_empty_api_still_blocks: empty API -> honest QG-0 fail. - test_work_item_id_uniqueness: ET-006 taken -> next free id, per-repo isolation. - test_collision_reassigns_in_start_pipeline: end-to-end collision reassignment. - test_worktree_per_task: two tasks never share a worktree path.	2026-06-03 21:12:59 +03:00
Dev Agent	7fd6529a35	test(conftest): mute Telegram in all tests to stop prod leakage A pytest run on prod was sending REAL Telegram messages to Slava: some tests (e.g. test_webhook_dedup advancing a stage) reach notify_stage_change -> send_telegram, which read the live .env token/chat_id and actually POSTed. Add an autouse fixture stubbing send_telegram to a no-op for every test. Patch the SOURCE src.notifications.send_telegram (covers all notify_* helpers and the many modules that do a local from .notifications import send_telegram inside functions) AND src.stage_engine.send_telegram (module-level binding, would not be intercepted by the source patch alone). webhooks/plane, launcher, queue_worker are patched defensively with raising=False. Verified: full suite run with FAKE telegram creds + an un-swallowable httpx.post trip-wire (BaseException, so send_telegram except Exception can not hide it) shows ZERO calls to api.telegram.org. Without the fixture the trip-wire fires, proving the guard is real.	2026-06-03 18:23:09 +03:00
Dev Agent	9a702a0216	feat(metrics): per-agent token/cost accounting Feature 4. claude is now launched with --output-format json; the run-log trailing result JSON is parsed (defensively, never fatal) for usage + total_cost_usd. New idempotent ALTERs add input_tokens/output_tokens/cache_read_tokens/cost_usd to agent_runs; the launcher monitor records usage per run, posts a per-agent finish comment under that agent bot (e.g. Developer gotov · 45.2k in / 12.1k out · $0.21), and the deployer posts an end-of-task summary (SUM over agent_runs GROUP BY agent) on done. New src/usage.py holds parse/format/record/summary helpers; test_usage.py covers parsing a real CLI JSON blob, NULL-on-garbage, recording, formatting, and the per-task aggregate.	2026-06-03 18:18:46 +03:00
Dev Agent	38a741d24e	feat(webhook): verdict via Approved/Rejected statuses (variant B) Feature 2. The issue updated dispatch (shipped with the status-trigger handler) also routes Approved -> _try_advance_stage (== :approved: comment) and Rejected -> _rollback_stage (== :rejected: comment). The :rejected: comment branch was refactored into the shared _rollback_stage so both mechanisms behave identically; a status reject passes Reason: (rejected via status, see latest comment) since no inline reason arrives with a status change. Comments stay fully working. This commit adds test_verdict_status.py proving both status and comment paths funnel into the same advance/rollback logic.	2026-06-03 18:18:36 +03:00
Dev Agent	09b1c5e1b9	feat(webhook): start pipeline on In Progress status (not on create) Feature 1. work_item.created no longer starts the pipeline (soft QG-0 log only); the issue stays in the backlog until moved to In Progress. The pipeline-start body is extracted into start_pipeline(); a new issue updated handler routes a state change to In Progress -> handle_status_start, which is idempotent: an existing task for the plane_id is NOT re-created or restarted (protects handle_comment, which also flips issues to In Progress). Real Plane payload: event=issue, action=updated, data.state.id. Existing m6/plane_webhook/dedup tests updated to drive the new trigger; new test_status_trigger.py covers created-no-op / start / idempotent.	2026-06-03 18:18:26 +03:00
Dev Agent	a4668c0303	feat(plane): stage visibility on board + verdict status UUIDs Feature 3 + Feature 2 infra. Extend the global PLANE_STATES with the 6 new enduro status UUIDs (architecture/development/review/testing + approved/rejected), remap STAGE_TO_STATE so the 4 mid-pipeline stages move the issue across its own board column instead of all sitting in In Progress, and add the set_issue_stage_state() helper. Needs Input / In Review / Blocked keep their own explicit setters and stay higher priority. TODO(ORCH-10): statuses are per-project; resolve per project when more projects are onboarded.	2026-06-03 18:18:17 +03:00
Dev Agent	d305521067	feat(plane): per-agent bot authorship for comments add_comment now accepts an optional author (agent role) and POSTs under the matching Plane bot token via _headers_for(), so Plane shows the real author (Analyst/Architect/Developer/Reviewer/Tester/Deployer/Stream) instead of a single shared account. Unknown/empty roles or missing tokens fall back to the shared orchestrator token (autonomy preserved). GET/PATCH (find_issue_id, set_state) are unchanged and stay on the shared token. Call sites in stage_engine, launcher, webhooks/plane and the plane_sync notify helpers now pass author by stage role; stage transitions use stream. Adds tests/test_plane_author.py.	2026-06-03 10:53:25 +03:00
Dev Agent	1d978caea7	feat(webhook): derive work_item_id from Plane sequence_id (M-6)	2026-06-03 10:02:15 +03:00
Dev Agent	0653c2437f	feat(launcher): prune old run logs (L-2)	2026-06-03 09:53:55 +03:00
Dev Agent	4ac449ff63	test(webhook): cover delivery dedup + migration safety (M-7)	2026-06-03 09:18:02 +03:00
Dev Agent	6abdc220d2	test(stage): cover unified stage_engine + launcher/plane delegation 18 tests: happy-path advance per stage with correct agent (ORCH-4 fix), QG-fail no-advance, reviewer REQUEST_CHANGES rollback+retry/alert, tester FAIL rollback+retry/block, architect conflict rollback to analysis, analyst approved-flow no-advance, and launcher+plane both delegating to the engine.	2026-06-03 08:56:25 +03:00
Dev Agent	c167c6930d	test(launcher): watchdog graceful kill ordering + timeout config + M-4 removal Cover M-2: SIGTERM-before-SIGKILL ordering, graceful exit within grace skips SIGKILL, ProcessLookupError before SIGTERM is tolerated (no _record_kill), and _resolve_timeout per-agent override / default / malformed-JSON fallback. Cover M-4: _auto_merge_pr removed, _ensure_pr retained.	2026-06-03 08:28:09 +03:00
Dev Agent	a613fd8180	test(resilience): 34 tests for preflight/classifier/backoff/breaker (ORCH-1) Covers preflight FAIL->queued + cache, transient/permanent classifier + Retry-After, exp backoff + available_at gating, launcher transient vs permanent finalize, circuit breaker open/half-open/closed. test_queue worker tests stub preflight OK. Popen never spawned.	2026-06-03 00:12:17 +03:00
Dev Agent	2283b8898b	test(queue): 19 tests for job queue lifecycle/atomicity/retry/worker (ORCH-1) Covers enqueue->claim->mark, atomic claim (no double dispatch, 8-thread race), retry fail->queued->failed, requeue_running_jobs, observability, worker max_concurrency. Popen fully mocked (no real agent spawned).	2026-06-02 23:58:44 +03:00
Dev Agent	c1f35a2047	test(projects,webhook): cover registry resolvers + project filter ORCH-6: test_projects.py covers resolvers and ORCH_PROJECTS_JSON parsing (valid/malformed/fallback). test_plane_webhook.py covers the webhook project filter via TestClient (unknown->ignored, orchestrator->orchestrator repo, enduro->enduro-trails, independent ORCH/ET prefixes); launcher mocked. test_webhooks.py: register proj-1 so existing ET fixtures pass.	2026-06-02 22:30:51 +03:00
Dev Agent	1ebe8afc23	feat(worktree): git worktree per task to isolate shared /repos (ORCH-2 / S-4) - add src/git_worktree.py: ensure/remove/get_worktree_path - config: worktrees_dir=/repos/_wt - launcher: agent runs in per-branch worktree; task-file + commit/push in worktree; no shared checkout - qg/checks: read artifacts + run make test from worktree (branch arg, backward-compatible) - webhooks/plane: pass branch into QG dispatch; review fallback from worktree - webhooks/gitea: keep read-only branch --contains in main clone (documented) - tests: test_git_worktree.py (isolation) + update test_launcher write-task-file - docs: ARCHITECTURE worktree section + BUGFIXES_2026-06-02_ORCH2 Preserves B-1/B-2/S-1/S-5 fixes (paths now point at worktree).	2026-06-02 21:12:06 +03:00
Dev Agent	67b9f814b5	test(launcher): cover _write_task_file and reviewer verdict parsing (L-5)	2026-06-02 20:12:29 +03:00
claude-bot	51f7364532	feat: integrate Analyst into Plane/Orchestrator pipeline - Add git fetch+checkout in agent launch cmd (ensures correct branch) - Add git fetch+checkout in _monitor_agent before commit/push - Post start comment in Plane when analyst launches - Post :approved: request comment after analyst completes successfully - Branch lookup moved before cmd construction for reuse	2026-05-31 20:15:01 +03:00
Dev Agent	b545665e2d	feat: full pipeline fixes - CI status branch lookup, review webhook routing, auto-advance, plane sync - handle_ci_status: fallback git branch -r --contains when branches[] empty - webhook router: handle pull_request_approved event type - handle_pr: map review.type to review.state for new Gitea format - launcher: auto-advance stage after agent completion (_try_advance_stage) - plane_sync: notify Plane on stage changes - stages.py: stage machine with QG definitions - notifications.py: stage change notifications - safe.directory fix for container git operations	2026-05-22 01:57:02 +03:00
Dev Agent	3116ae67bb	chore: clean up .gitignore, remove cached files from tracking	2026-05-19 15:58:45 +03:00
Dev Agent	95072e000f	fix: tests — add setup_db fixture for init_db in test env	2026-05-19 15:58:37 +03:00
Dev Agent	daf8cdad9e	feat: orchestrator MVP — webhooks, agent launcher, QG checks	2026-05-19 15:57:00 +03:00

35 Commits