-
Notifications
You must be signed in to change notification settings - Fork 0
Description
Feature Type
SDK integration (Anthropic or Agent SDK)
Problem or Need
The recent SDK alignment brought cc-plugin-eval up to date with the Claude Agent SDK v0.2.25 type definitions by fixing type mismatches and removing workarounds. However, several newly available SDK capabilities remain unused. These represent enhancement opportunities for improving execution control, safety, and observability.
Proposed Solution
Incrementally adopt the following SDK v0.2.25 features as needed:
-
interrupt()method - Use as graceful timeout alternative toAbortControllerfor scenario execution. Provides cleaner termination semantics. -
forkSessionoption - Enable session forking as alternative to/clear+continuefor batch isolation. May improve checkpoint handling. -
sandboxsettings - Configure execution safety with filesystem/network restrictions. Useful for testing untrusted plugins. -
Stophook - Capture explicit completion signals from scenarios. Enables detection of clean vs forced termination. -
SessionStart/SessionEndhooks - Track session lifecycle events for diagnostics. Helps with session batching metrics. - New message types - Handle
SDKToolProgressMessage,SDKToolUseSummaryMessage,SDKFilesPersistedEvent,SDKTaskNotificationMessage,SDKAuthStatusMessagein transcript capture and evaluation.
Pipeline Stage Affected
Stage 3 - Execution
Component Type (if applicable)
Not component-specific
Alternatives Considered
Current workarounds are functional:
AbortControllerworks for timeouts (just less semantic)/clear+continuehandles batch isolation (slight overhead)- Existing message types cover core evaluation needs
These enhancements are nice-to-have improvements, not critical gaps.
How important is this feature to you?
Low - Just a suggestion
Additional Context
These items were identified during the SDK v0.2.25 alignment work. The core alignment (type corrections, bug fixes) is complete. These represent future opportunities to leverage additional SDK capabilities as the framework matures.
Related commit: See recent Agent SDK alignment commits in the repository.
🤖 Created with Claude Code