Skip to content

[Feature]: Leverage Additional SDK v0.2.25 Capabilities #344

@sjnims

Description

@sjnims

Feature Type

SDK integration (Anthropic or Agent SDK)

Problem or Need

The recent SDK alignment brought cc-plugin-eval up to date with the Claude Agent SDK v0.2.25 type definitions by fixing type mismatches and removing workarounds. However, several newly available SDK capabilities remain unused. These represent enhancement opportunities for improving execution control, safety, and observability.

Proposed Solution

Incrementally adopt the following SDK v0.2.25 features as needed:

  • interrupt() method - Use as graceful timeout alternative to AbortController for scenario execution. Provides cleaner termination semantics.
  • forkSession option - Enable session forking as alternative to /clear + continue for batch isolation. May improve checkpoint handling.
  • sandbox settings - Configure execution safety with filesystem/network restrictions. Useful for testing untrusted plugins.
  • Stop hook - Capture explicit completion signals from scenarios. Enables detection of clean vs forced termination.
  • SessionStart/SessionEnd hooks - Track session lifecycle events for diagnostics. Helps with session batching metrics.
  • New message types - Handle SDKToolProgressMessage, SDKToolUseSummaryMessage, SDKFilesPersistedEvent, SDKTaskNotificationMessage, SDKAuthStatusMessage in transcript capture and evaluation.

Pipeline Stage Affected

Stage 3 - Execution

Component Type (if applicable)

Not component-specific

Alternatives Considered

Current workarounds are functional:

  • AbortController works for timeouts (just less semantic)
  • /clear + continue handles batch isolation (slight overhead)
  • Existing message types cover core evaluation needs

These enhancements are nice-to-have improvements, not critical gaps.

How important is this feature to you?

Low - Just a suggestion

Additional Context

These items were identified during the SDK v0.2.25 alignment work. The core alignment (type corrections, bug fixes) is complete. These represent future opportunities to leverage additional SDK capabilities as the framework matures.

Related commit: See recent Agent SDK alignment commits in the repository.


🤖 Created with Claude Code

Sub-issues

Metadata

Metadata

Assignees

Labels

effort:large> 4 hoursenhancementNew feature or requestpriority:lowNice to havesdk:agentClaude Agent SDK usage (Stage 3)stage:executionStage 3: Agent SDK execution with tool capturestatus:analyzedIssue has been analyzed by Claude

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions