Shared fix logic, workflow config, daemon resilience, and UX improvements #173

wesm · 2026-01-30T04:50:48Z

Summary

Extract fixJobDirect so both fix and refine commands share the same agent-run-then-commit code path
Add fix_agent / fix_model workflow config (with level variants) following the review/refine pattern
Add ResolveFixReasoning so fix defaults to "standard" reasoning, enabling fix_agent_standard / fix_model_standard keys
Add git.IsUnbornHead() using symbolic-ref + rev-parse to properly detect empty repos vs corrupt/other git errors
Refine immediately skips reviews when the agent makes no changes, recording a "could not determine" comment
roborev fix with no arguments now defaults to --unaddressed (fix all unaddressed reviews on current branch)
Fix console output now shows which agent is being used
Recover daemon on connection failure during fix loop — detect connection errors, restart via ensureDaemon(), retry the job once, bail if recovery fails
Re-query for new unaddressed reviews after each fix batch, looping until none remain (with seen-set to avoid reprocessing)
Validate CHANGELOG_AGENT env var in changelog script

Files changed

Area	Files	What
Fix/refine core	`cmd/roborev/fix.go`, `refine.go`	Shared fix logic, unborn HEAD handling, no-change skip, default to --unaddressed, daemon recovery, re-query loop
Fix workflow config	`internal/config/config.go`, `config_test.go`	`fix_agent`, `fix_model`, `fix_reasoning` fields + resolution
Analyze --fix	`cmd/roborev/analyze.go`	Wire fix workflow resolution
Git utilities	`internal/git/git.go`	`IsUnbornHead()` with symbolic-ref + rev-parse
Tests	`fix_test.go`, `main_test.go`, `refine_test.go`, `git_test.go`	Unborn HEAD, corrupt ref, refine skip, worktree cleanup, workflow config
Scripts	`scripts/changelog.sh`	Agent validation

Test plan

go build ./...
go test ./... — all pass

🤖 Generated with Claude Code

Refine's inner loop duplicated the fix-agent-then-commit logic from fixSingleJob. Extract fixJobCore (with fixJobDirect and fixJobWorktree variants) so both commands call the same code path. This also removes runFixAgentWithOpts in favor of resolveFixAgent + fixJobCore. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Remove the fixJobCore dispatch function — callers now call fixJobDirect or fixJobWorktree directly with the prompt. Remove unused fields from fixJobParams (Review, Prompt, Commenter, UseWorktree). Extract detectNewCommit helper to deduplicate the resolve-and-compare pattern. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

The refactoring silently swallowed the retry path's user-facing message ("No commit was created. Re-running agent with commit instructions...") and the retry error warning. Restore both by writing to the output stream. Also propagate the HasUncommittedChanges error on the first check instead of discarding it. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Refine was writing raw NDJSON from Claude to stdout. Wrap the output in a streamFormatter (like fix already does) so TTY users see compact tool-use summaries instead of raw JSON. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

The worktree lifecycle is a concern of the caller (refine), not the fix operation. Refine now owns the worktree creation, safety checks, patch application, and commit — then calls the agent directly. fixJobDirect remains for the fix command's direct-on-repo flow. This also removes the worktree-specific fields from fixJobParams (HeadBefore, BranchBefore, WasCleanBefore) and AgentErr from fixJobResult. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Build and tests pass. Changes: - Skip `fmtr.Flush()` when the agent returned an error to avoid printing garbled partial output on failure

All tests pass. Changes: - Use `defer cleanupWorktree()` instead of manual cleanup at each exit point to prevent worktree leaks - Use `shortSHA()` helper instead of `[:7]` slice to prevent panic on short SHA strings

Given the low priority of the testing gap finding, and the main issues being addressed, I'll skip adding a new test since the fix is straightforward and the existing tests cover refine behavior well. Changes: - Replaced deferred `cleanupWorktree()` with explicit calls before every `continue` and `return` in the loop, fixing worktree leak on early exits - Removed double cleanup on the success path (defer + explicit call) by eliminating the defer entirely

All tests pass. Changes: - Added `TestWorktreeCleanupBetweenIterations` test that creates and cleans up worktrees across 3 loop iterations, verifying each is removed before the next is created

All tests pass. Changes: - Use `shortSHA()` instead of `[:7]` slice on `result.NewCommitSHA` in `fixSingleJob` to prevent panic on short SHA strings - Flush `streamFormatter` unconditionally in refine loop, regardless of agent error, to ensure partial output is not lost

All tests pass. Changes: - In `fixJobDirect`, when `ResolveSHA` fails initially, infer outcome from working tree state instead of returning an ambiguous result where both `CommitCreated` and `NoChanges` are false

All tests pass. Changes: - After agent runs on unborn HEAD, re-check `ResolveSHA("HEAD")` to detect if the agent created the first commit, setting `CommitCreated: true` and `NewCommitSHA` accordingly - Handle error from `HasUncommittedChanges` instead of silently ignoring it - Add `TestFixJobDirectUnbornHead` covering both the "agent creates first commit" and "agent makes no changes" paths on an empty repo

When the agent makes no changes, refine now skips the review and moves on instead of retrying 3 times. The retry was wasteful since nothing changes between attempts (same prompt, same context). Also improve --help for both commands: - fix: clarify it's a one-shot fix with no re-review - refine: explain the full review-fix-recheck loop and branch review Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

The fix and analyze --fix commands previously fell back directly to default_agent. This adds fix_agent/fix_model (with level variants) to both global and per-repo config, following the existing review/refine pattern. Both resolveFixAgent and runFixAgent now use ResolveAgentForWorkflow with the "fix" workflow. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Verify repo exists before assuming unborn HEAD in fixJobDirect, so non-git directories return an error instead of running the agent - Remove unused newTestGitRepo call and add error checks to fake agent helper in fix_test.go - Replace no-op TestRefineNoChangeSkipsImmediately with real git repo test of IsWorkingTreeClean and skippedReviews tracking - Strengthen TestRefineLoopNoChangeSkipsReview with skip-comment assertion Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Add ResolveFixReasoning with fix_reasoning repo config, defaulting to "standard" so empty --reasoning selects fix_agent_standard keys - Wire ResolveFixReasoning into resolveFixAgent and runFixAgent - Validate CHANGELOG_AGENT in changelog.sh, rejecting unknown values Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add git.IsUnbornHead() to specifically detect unborn HEAD vs other git errors (corrupt repo, permissions). fixJobDirect now uses this instead of GetRepoRoot so non-unborn errors surface properly. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add end-to-end test verifying empty --reasoning resolves to "standard" and selects fix_agent_standard via the full resolution chain. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Rewrite IsUnbornHead to use symbolic-ref + rev-parse --verify on the target ref, so corrupt refs (file exists with bad SHA) are not misclassified as unborn. Add TestIsUnbornHead covering empty repo, committed repo, non-git dir, and corrupt ref cases. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Use symbolic-ref HEAD in corrupt-ref test instead of hardcoding main/master, so the test works regardless of init.defaultBranch. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Print "Running fix agent (name) to apply changes..." instead of the generic message, consistent with refine and analyze --fix output. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Running `roborev fix` with no job IDs now automatically fixes all unaddressed reviews on the current branch, the most common use case. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

The previous commit added len(args)==0 to the unaddressed branch but didn't remove the earlier validation that rejected empty args. Remove that guard and the now-obsolete test case. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add TestFixNoArgsDefaultsToUnaddressed verifying that running fix with no arguments enters the unaddressed path instead of erroring. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

When the daemon dies mid-run, detect connection errors and attempt recovery via ensureDaemon() before retrying the current job once. Bail immediately if recovery fails or the retry also loses connection. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Fix test fragility: only assert no validation error instead of requiring a daemon-not-running error. Fix misleading comment in recovery retry path. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

After finishing a batch of fixes, loop back and query for any new unaddressed reviews that arrived in the meantime. Uses a seen-set to avoid reprocessing jobs that weren't marked addressed. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Only mark job IDs as seen after successful fix so failed jobs remain eligible on re-query. Add TestRunFixUnaddressedRequery covering multi-batch re-query loop behavior. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Test mocks for runFixUnaddressed always returned the same jobs, causing infinite loops when fixSingleJob failed (e.g. on Windows). Use atomic counters so mocks return empty on subsequent unaddressed queries. Mark jobs as seen after attempt, not before — connection errors bail fatally and skip the seen mark so recovery can retry. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

TestFixNoArgsDefaultsToUnaddressed was calling fixCmd() without a mock daemon, causing ensureDaemon to spawn a real subprocess from the test binary on CI. Use setupMockDaemon instead. Fix concurrency group to use github.ref instead of github.run_id for push events so runs on the same branch share a group. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

wesm and others added 17 commits January 29, 2026 22:39

Use streamFormatter for refine agent output

5daa25a

Refine was writing raw NDJSON from Claude to stdout. Wrap the output in a streamFormatter (like fix already does) so TTY users see compact tool-use summaries instead of raw JSON. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Address review findings (job 3472)

3c47e75

Build and tests pass. Changes: - Skip `fmtr.Flush()` when the agent returned an error to avoid printing garbled partial output on failure

Address review findings (job 3473)

8bb6536

All tests pass. Changes: - Use `defer cleanupWorktree()` instead of manual cleanup at each exit point to prevent worktree leaks - Use `shortSHA()` helper instead of `[:7]` slice to prevent panic on short SHA strings

Address review findings (job 3476)

8bc19ff

All tests pass. Changes: - Added `TestWorktreeCleanupBetweenIterations` test that creates and cleans up worktrees across 3 loop iterations, verifying each is removed before the next is created

Address review findings (job 3480)

4de6519

All tests pass. Changes: - In `fixJobDirect`, when `ResolveSHA` fails initially, infer outcome from working tree state instead of returning an ambiguous result where both `CommitCreated` and `NoChanges` are false

Address review findings (job 3503)

b4c1996

Add git.IsUnbornHead() to specifically detect unborn HEAD vs other git errors (corrupt repo, permissions). fixJobDirect now uses this instead of GetRepoRoot so non-unborn errors surface properly. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

wesm changed the title ~~Extract shared fix logic into fixJobDirect/fixJobWorktree~~ Shared fix logic, fix_agent workflow config, and unborn HEAD handling Jan 30, 2026

wesm and others added 6 commits January 30, 2026 05:51

Address review findings (job 3510)

9409265

Add end-to-end test verifying empty --reasoning resolves to "standard" and selects fix_agent_standard via the full resolution chain. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Address review findings (job 3518)

42cfe66

Use symbolic-ref HEAD in corrupt-ref test instead of hardcoding main/master, so the test works regardless of init.defaultBranch. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Show agent name in fix console output

83e759e

Print "Running fix agent (name) to apply changes..." instead of the generic message, consistent with refine and analyze --fix output. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Default to --unaddressed when fix is called with no arguments

785be32

Running `roborev fix` with no job IDs now automatically fixes all unaddressed reviews on the current branch, the most common use case. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

wesm changed the title ~~Shared fix logic, fix_agent workflow config, and unborn HEAD handling~~ Shared fix logic, fix workflow config, and UX improvements Jan 30, 2026

wesm and others added 3 commits January 30, 2026 06:48

Address review findings (job 3551)

3f669b4

Add TestFixNoArgsDefaultsToUnaddressed verifying that running fix with no arguments enters the unaddressed path instead of erroring. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Address review findings (job 3554, 3555)

fbd066f

Fix test fragility: only assert no validation error instead of requiring a daemon-not-running error. Fix misleading comment in recovery retry path. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

wesm changed the title ~~Shared fix logic, fix workflow config, and UX improvements~~ Shared fix logic, workflow config, daemon resilience, and UX improvements Jan 30, 2026

wesm force-pushed the refine-fix-reuse branch from f37d996 to 45e309c Compare January 30, 2026 13:47

wesm and others added 4 commits January 30, 2026 08:00

Address review findings (job 3568, 3575)

e6b554d

Only mark job IDs as seen after successful fix so failed jobs remain eligible on re-query. Add TestRunFixUnaddressedRequery covering multi-batch re-query loop behavior. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Cancel in-progress CI runs on new pushes to same branch

de41f88

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

wesm merged commit 623990a into main Jan 30, 2026
7 checks passed

wesm deleted the refine-fix-reuse branch January 30, 2026 14:37

wesm restored the refine-fix-reuse branch January 30, 2026 15:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shared fix logic, workflow config, daemon resilience, and UX improvements #173

Shared fix logic, workflow config, daemon resilience, and UX improvements #173

wesm commented Jan 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Shared fix logic, workflow config, daemon resilience, and UX improvements #173

Shared fix logic, workflow config, daemon resilience, and UX improvements #173

Conversation

wesm commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Files changed

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wesm commented Jan 30, 2026 •

edited

Loading