-
Notifications
You must be signed in to change notification settings - Fork 0
Description
Parent Issue
Sub-issue of #344 (Leverage Additional SDK v0.2.25 Capabilities)
Follow-up to #349 (SessionStart/SessionEnd hooks)
Feature Type
Cost estimation
Problem or Need
The --dry-run mode estimates costs and scenario counts but doesn't account for session lifecycle overhead. With #349 capturing session timing data, dry-run output could include timing projections that help users estimate total wall-clock time, not just API costs.
Currently, dry-run shows:
- Estimated scenario count
- Estimated cost per scenario
- Total estimated cost
Missing:
- Estimated session startup overhead per batch
- Projected wall-clock time factoring in session lifecycle
- Number of session starts/restarts expected per batch strategy
Proposed Solution
-
Add timing estimates to dry-run output — Based on historical session timing data (if available from previous runs) or conservative defaults, include:
- Estimated session startup time per batch
- Estimated total wall-clock time
- Number of expected session starts (1 per batch in
batched_by_component, 1 per scenario inisolated)
-
Use historical data — If previous run results exist in the state file, extract average
session_timingdurations to improve estimates
Pipeline Stage Affected
General / Multiple stages
Component Type
Not component-specific
Alternatives Considered
- Only show timing data in post-execution reports (via [Feature]: Expose session timing metrics in Stage 4 evaluation reports #357). This is simpler but doesn't help with pre-execution planning.
- Show a simple "sessions needed" count without timing estimates. Lower effort but less useful.
How important is this feature to you?
Low - Just a suggestion
Additional Context
This pairs with #357 (session timing in Stage 4 reports). Together they provide pre-execution estimates and post-execution actuals for session lifecycle overhead.
Risk Assessment
Low risk — Additive output in dry-run mode. No changes to execution behavior.
Files Affected
src/stages/3-execution/dry-run.ts(or equivalent) — Cost estimation output- CLI formatters for dry-run display
🤖 Created with Claude Code