-
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Simplified Testing Strategy for Regression Prevention #1379
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
AndyMik90
wants to merge
24
commits into
develop
Choose a base branch
from
auto-claude/141-simplify-testing-strategy-to-prevent-actual-regres
base: develop
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+7,780
−302
Open
Changes from all commits
Commits
Show all changes
24 commits
Select commit
Hold shift + click to select a range
9b0157d
auto-claude: subtask-1-1 - Add CLI detection tests for Claude/Node/Py…
AndyMik90 39d2100
auto-claude: subtask-1-2 - Add path handling edge case tests
AndyMik90 bb0f68c
auto-claude: subtask-1-3 - Add token decryption tests for all platfor…
AndyMik90 0165dde
auto-claude: subtask-1-4 - Add frontend platform tests for npm/npx co…
AndyMik90 1404c0e
auto-claude: subtask-2-1 - Create test_agent_flow.py with planner to …
AndyMik90 b921de4
auto-claude: subtask-2-2 - Add subtask completion detection tests to …
AndyMik90 9e31a80
auto-claude: subtask-2-3 - Add QA loop tests for fixer interaction an…
AndyMik90 6301059
auto-claude: subtask-2-4 - Add worktree isolation tests to verify con…
AndyMik90 09cb52e
auto-claude: subtask-3-1 - Expand test_recovery.py with session check…
AndyMik90 b8db5f2
auto-claude: subtask-3-2 - Expand test_implementation_plan.py with JS…
AndyMik90 fd8f870
auto-claude: subtask-3-3 - Add tests for edge cases in plan state tra…
AndyMik90 f5e6285
auto-claude: subtask-4-1 - Create test_review_verdict.py with verdict…
AndyMik90 6f18751
auto-claude: subtask-4-2 - Expand test_finding_validation.py with evi…
AndyMik90 a28462f
auto-claude: subtask-4-3 - Add deduplication and severity mapping tes…
AndyMik90 3da1cc1
auto-claude: subtask-5-1 - Create E2E smoke test file with project cr…
AndyMik90 186f510
auto-claude: subtask-5-2 - Add task creation and execution E2E test
AndyMik90 ae831f2
auto-claude: subtask-5-3 - Add settings management E2E test
AndyMik90 44c17f5
auto-claude: subtask-6-1 - Run full backend test suite and verify all…
AndyMik90 f8a79da
auto-claude: subtask-6-2 - Fix flaky test by clearing CLI path env vars
AndyMik90 162f150
Merge branch 'develop' into auto-claude/141-simplify-testing-strategy…
AndyMik90 dfe72e7
fix(tests): resolve PR review findings for testing strategy
AndyMik90 198f0e6
fix(tests): resolve CI test failures
AndyMik90 deb587d
fix(tests): normalize paths in cross-platform tests
AndyMik90 5ebfd93
Merge branch 'develop' into auto-claude/141-simplify-testing-strategy…
AndyMik90 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CI failure doesn't block NEEDS_REVISION verdicts as documented
Medium Severity
The
apply_ci_status_overridefunction's docstring states "Failing CI -> BLOCKED" without any condition, but the code only returnsBLOCKEDwhen the current verdict isREADY_TO_MERGEorMERGE_WITH_CHANGES. If the verdict isNEEDS_REVISIONand CI is failing, the function incorrectly returnsNEEDS_REVISIONinstead ofBLOCKED. The asymmetry in the docstring (failing CI has no qualifier while pending CI explicitly has one) indicates failing CI was intended to always result inBLOCKED.