Skip to content

Comments

Add full e2e test: define -> implement -> execute workflow#45

Merged
nhorton merged 10 commits intomainfrom
claude/e2e-full-workflow-test
Jan 15, 2026
Merged

Add full e2e test: define -> implement -> execute workflow#45
nhorton merged 10 commits intomainfrom
claude/e2e-full-workflow-test

Conversation

@nhorton
Copy link
Contributor

@nhorton nhorton commented Jan 15, 2026

Summary

Updates the Claude Code e2e test to verify the complete DeepWork workflow:

  1. /deepwork_jobs.define - Create a new job from scratch (fruits job)
  2. /deepwork_jobs.implement - Generate step instruction files
  3. /fruits.identify - Execute the generated identify command
  4. /fruits.classify - Execute the generated classify command

This tests the actual user experience rather than just validating pre-existing fixtures.

Changes

  • Fresh test project setup with no pre-existing job definitions
  • Deterministic prompts for creating the fruits job with exact specifications
  • Validates job.yml, step files, and slash commands are generated correctly
  • Executes generated commands and validates outputs

Test plan

🤖 Generated with Claude Code

nhorton and others added 10 commits January 14, 2026 17:30
Replace Markdown with JSON array - CLA Assistant action
expects JSON format for storing signatures.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
CLA Assistant expects { "signedContributors": [] } format,
not a plain array.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Per CLA Assistant docs: "You do not need to create this file manually.
Our workflow will create the signature file if it does not already exist.
Manually creating this file will cause the workflow to fail."

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Add a deterministic 'fruits' test job and comprehensive CI tests to validate
that deepwork-generated commands work correctly with Claude Code.

Changes:
- Add fruits job fixture (identify + classify steps) for CI testing
- Add integration tests for fruits workflow (8 tests)
- Add e2e tests for Claude Code execution (3 tests, skipped without API key)
- Add GitHub Actions workflow for automated testing:
  - validate-generation: Always runs, tests command generation
  - claude-code-e2e: Runs with ANTHROPIC_API_KEY, tests actual execution

The fruits job is designed to be deterministic:
- Input: comma-separated list of items (e.g., "apple, car, banana")
- Step 1: Identify which items are fruits
- Step 2: Classify fruits by category (citrus, tropical, etc.)
- Add concurrency rules to ensure only one instance runs per PR
- Fix test to use 'deepwork install --platform claude --path test_project'
- Create .claude directory before install for platform detection
- Run commands from repo root with --path flag instead of cd'ing
Update the claude-code-e2e job to test the COMPLETE DeepWork workflow:
1. /deepwork_jobs.define - Create a new job from scratch
2. /deepwork_jobs.implement - Generate step instruction files
3. /fruits.identify - Execute the generated identify command
4. /fruits.classify - Execute the generated classify command

This tests the actual user experience rather than just validating
pre-existing fixtures. The test provides deterministic instructions
for creating a 'fruits' job with identify and classify steps.
@nhorton nhorton merged commit c14e5b1 into main Jan 15, 2026
4 checks passed
@nhorton nhorton deleted the claude/e2e-full-workflow-test branch January 15, 2026 23:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants