-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
Context
The Entireio CLI implements agent-specific transcript chunking to handle arbitrarily large transcripts:
- Claude Code: JSONL-aware splitting that preserves line boundaries
- Gemini CLI: JSON splitting that maintains structure across chunks
- Configurable chunk sizes for storage constraints
TranscriptChunkerinterface allows each agent to define its own splitting logic
Current State
Our checkpoint system stores full transcripts. For long-running agent sessions (especially multi-hour implementation phases), transcripts can grow very large. Git isn't optimized for large blob storage, and this could impact checkpoint branch performance over time.
Proposal
Add transcript chunking to the checkpoint system:
- Split large transcripts into manageable chunks before storage
- Preserve structural integrity (JSONL line boundaries)
- Support reassembly for
explainandshowcommands - Consider compression as an alternative or complement
- Set configurable size thresholds
This is a scalability concern that becomes more relevant as agents run longer sessions and produce larger transcripts.
Reference
See entireio/cli — TranscriptChunker interface in the agent package.
Authored-by: egg
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels