Skip to content

fix(review+routing): reviewer uses latest user question, routing igno…#185

Merged
PetrAnto merged 1 commit intomainfrom
claude/review-model-sync-gVjuw
Feb 25, 2026
Merged

fix(review+routing): reviewer uses latest user question, routing igno…#185
PetrAnto merged 1 commit intomainfrom
claude/review-model-sync-gVjuw

Conversation

@PetrAnto
Copy link
Owner

…res conversation length

Two bugs found during Phase 7B.6 benchmark:

  1. extractUserQuestion() iterated forward and returned the FIRST user message. In multi-turn conversations the reviewer evaluated the assistant's answer against the wrong question (e.g. "capital of France" instead of "read README.md and summarize"). Fixed by iterating backwards. Also skips 7B.4 file-injection blocks.

  2. Model routing used classifyTaskComplexity(msg, conversationLength) which gates on conversationLength >= 3 → 'complex', preventing simple messages from routing to fast models in longer conversations. Fixed by passing conversationLength=0 for routing decisions so only message content determines complexity.

https://claude.ai/code/session_01K2mQTABDGY7DnnposPdDjw

…res conversation length

Two bugs found during Phase 7B.6 benchmark:

1. extractUserQuestion() iterated forward and returned the FIRST user
   message. In multi-turn conversations the reviewer evaluated the
   assistant's answer against the wrong question (e.g. "capital of
   France" instead of "read README.md and summarize"). Fixed by
   iterating backwards. Also skips 7B.4 file-injection blocks.

2. Model routing used classifyTaskComplexity(msg, conversationLength)
   which gates on conversationLength >= 3 → 'complex', preventing
   simple messages from routing to fast models in longer conversations.
   Fixed by passing conversationLength=0 for routing decisions so only
   message content determines complexity.

https://claude.ai/code/session_01K2mQTABDGY7DnnposPdDjw
@chatgpt-codex-connector
Copy link

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.
To continue using code reviews, you can upgrade your account or add credits to your account and enable them for code reviews in your settings.

@PetrAnto PetrAnto merged commit 9f3cd1d into main Feb 25, 2026
@github-actions
Copy link

E2E Test Recording (workers-ai)

❌ Tests failed

E2E Test Video

@github-actions
Copy link

E2E Test Recording (base)

❌ Tests failed

E2E Test Video

@github-actions
Copy link

E2E Test Recording (discord)

❌ Tests failed

E2E Test Video

@github-actions
Copy link

E2E Test Recording (telegram)

❌ Tests failed

E2E Test Video

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants