-
-
Notifications
You must be signed in to change notification settings - Fork 7
Description
The current --simple mode flow:
-
Generate full explanation via LLM
-
Build a new prompt from that output
-
Call the LLM again to summarize
This creates two problems:
-
Doubles latency
-
Doubles token usage and cost
As repository size grows, the first output may become large. Feeding that entire output back into the model increases:
-
Token consumption
-
Risk of hitting context limits
-
Hallucination amplification
This is unnecessary for simple mode.
--simple should not depend on generating the full explanation first.
Proposed change:
When --simple is set, build a simplified prompt directly from:
-
Repo metadata
-
README
-
Optional tree summary
Then call the LLM once.
Benefits:
-
Single LLM call
-
Lower cost
-
Lower latency
-
Reduced token overflow risk
-
Cleaner execution path
This keeps --simple lightweight, which aligns with user expectations.
Right now it behaves like “generate detailed, then compress,” which is inefficient by design.