Morning Review — 2026-04-17
Recent Commits (last 24h)
5 commits merged — focused on data integrity and concurrency safety:
| Commit | Issue | Description |
|---|---|---|
d4b36da2 | #2736 | Stuck-task recovery — stop swallowing resolve_task_id errors; fixes stale routing fields. |
354f05c4 | #2735 | Review subscriber — distinguish DB errors from stale status. |
c8d63bb2 | #2734 | Worktree scan — stay resilient when dir entry reads fail. |
d90c2854 | #2730 | Router lock — avoid holding read lock across dispatch awaits. |
567678b1 | #2728 | Backend merge — merge external tasks when store is internal-only. |
Operational Health
Service
- Version mismatch recurred: CLI
0.69.25, Service0.69.27. This is the third consecutive day of mismatch despite yesterday's evening retro claiming the problem was "resolved."- Apr 15 morning: 0.69.15 vs 0.69.18
- Apr 16 morning: 0.69.15 vs 0.69.18 (still pending at morning)
- Apr 16 evening: claimed fixed at 0.69.25
- Apr 17 morning: 0.69.25 vs 0.69.27
- Fix:
brew upgrade orch && brew services restart orch - Logs: clean tick cycle (~1.5–1.9s), no persistent errors
- One watchdog stall alert (70s > 60s threshold) during this morning's dispatch — caused by glm:haiku returning 403, LLM budget exceeded, falling back to round-robin. Expected behavior under routing failure, not a real stall.
Agent Health (24h)
| Agent | Model | Success | Failed | Rate limit | Notes |
|---|---|---|---|---|---|
| claude | sonnet | 31 | 12 | 0 | Main workhorse; 12 failures vs 31 successes = 72% |
| minimax | opus | 24 | 1 | 5 | Strong; rate limits on review runs |
| glm | opus | 20 | 2 | 1 | Solid performance after runner-registry fix |
| opencode | minimax-m2.5-free | 18 | 0 | 0 | Clean run |
| codex | gpt-5.3-codex | 17 | 0 | 0 | Good medium-task results |
| opencode | gpt-5-mini | 9 | 0 | 0 | Best github-copilot model |
| claude | sonnet | 12 | 3 | 0 | Another claude/sonnet window (failed) |
| opencode | gpt-5.4 | 0 | 3 | 0 | github-copilot still broken |
| opencode | claude-sonnet-4.6 | 0 | 3 | 0 | github-copilot still broken |
| opencode | gemini-3.1-pro-preview | 0 | 3 | 0 | github-copilot still broken |
| opencode | nemotron-3-super-free | 7 | 1 | 1 | 6 parse errors |
| minimax | opus | 5 | 1 | 5 | Rate limits confirmed |
| codex | gpt-5.2-codex | 0 | 1 | 0 | Model unavailable |
| glm | opus | 2 | 2 | 1 | Mixed |
Notable improvements vs Apr 16:
- claude/sonnet at 72% (was 73% → consistent)
- glm/opus much improved: 20 successes, only 2 failures (was 12 successes / 5 failures)
- codex/gpt-5.3-codex at 100% (was good)
- opencode/gpt-5-mini at 100% (still best copilot)
- nemotron parse errors down to 6 (was 5 in 12h yesterday — rate consistent)
- github-copilot gpt-5.4, claude-sonnet-4.6, gemini: still 0% each
Active Cooldowns
| Key | Remaining | Reason |
|---|---|---|
| kimi | 4d23h | Billing cycle exhausted |
| glm:haiku | 3h58m | Persisted |
Stuck / Blocked Tasks
- No open GitHub issues.
- Only active tasks: this morning review (
internal:145911), morning-briefing (internal:145912), twitter-trending-watch (internal:145913). - No blocked or stuck external tasks.
Retro Follow-ups
| Priority from Apr 16 Evening | Status |
|---|---|
| Fix version mismatch | Re-broken again — CLI 0.69.25 vs Service 0.69.27. Pattern: every day a new push lands between evening and next morning. Needs brew upgrade && restart. |
| Post-fix health signals (#2720, #2723) | Confirmed improved — glm/opus went from 64% to 91%, claude/sonnet stable at 72%. Placeholder-error noise gone. |
| github-copilot investigation | Still unresolved — gpt-5-mini (100%) is fine; gpt-5.4, claude-sonnet-4.6, gemini (all 0%) still failing. |
| nemotron parse errors | Still occurring — 6 parse errors in 24h (vs 5 in 12h yesterday, rate similar). Not worsening. |
| Monitor stream changes (#2717, #2712) | Not confirmed — orch stream --pipe and same-length diffing were deployed but today's review didn't exercise streaming enough to verify. |
Task Activity (12h)
| Event | Count |
|---|---|
| status_change | 631 |
| dispatch | 211 |
| branch_delete | 144 |
| push | 137 |
| routed | 101 |
| review_start | 72 |
| review_decision | 65 |
| pr_create | 61 |
| error | 41 |
| rerouted | 12 |
Throughput is healthy and consistent with yesterday. Error rate (41 / 211 = 19%) is slightly lower than yesterday's 23%, consistent with github-copilot models being handled by cooldowns.
Priorities Today
Fix version mismatch —
brew upgrade orch && brew services restart orch. This is the fourth consecutive day this has been flagged. Service is 2 patch versions behind.Verify stream changes — tomorrow's review should confirm
orch stream --pipeand same-length output diffing work correctly with no regressions.github-copilot routing — gpt-5-mini is the only healthy copilot model. gpt-5.4, claude-sonnet-4.6, and gemini-3.1-pro-preview are all at 0% and should remain excluded via cooldown.
nemotron parse errors — 6 parse errors in 24h. If this rate persists or worsens, inspect raw outputs and file a root-cause issue.
Notes
- The recurring version mismatch is the most actionable item. The service keeps falling behind because
brew upgradeis not being run regularly enough after pushes to main. - No new GitHub issues to file. All observable problems are either covered by existing retro priorities or handled by the generic cooldown/error-classification system.
- The 70s watchdog stall during dispatch was benign — it was caused by the router correctly falling through its pool (glm:haiku returned 403) and exhausting its LLM budget before falling back to round-robin. The stall threshold (60s) is too tight for slow model routing. This is not a bug.
Prepared by Orch automation (internal task internal:145911).