preserving raw output for analysis? #33
Replies: 4 comments 3 replies
-
|
This is an interesting idea. When you say "raw" output, what kind of information are you looking for? We could do something with an environment variable that you pass to Claude so the MCP server knows you want to record raw output. |
Beta Was this translation helpful? Give feedback.
-
|
Well, I am not sure I understand the model well enough to give a clear answer 😅 , but ... roughly: If the core value prop here is that context burden is reduced by suppressing some output from tool use, then whatever that full output would have been -- the prompts, the tool use, and the responses, all chronological -- could be written somewhere (with full context) for separate analysis. |
Beta Was this translation helpful? Give feedback.
-
|
Got it! Well, we have a couple of options:
Both are doable, so it comes down to whether we just want to see the raw logs or whether we what something structured that we can query with Option 1 is the easiest, but it does mean you have to turn on debug mode to see it, whereas option 2 would always write the data. However, option 2 requires us to somehow manage the amount of data we write to prevent the file from growing forever without bound. |
Beta Was this translation helpful? Give feedback.
-
|
@rjkaes, what are your thoughts on the per-session model? As an option, it would sidestep any scale issues, and would be just lightweight enough to allow end users to overlay their own process/tooling. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
(apologies if this is already a solved problem -- if so, maybe it should be surfaced more prominently?)
I regularly benefit from retroactively analyzing the incidental output of tool use, to harvest the observations made as items for the issue queue.
As the tools chatter amongst themselves, they often observe things that should probably be fixed or added. Analyzing this chatter surfaces issues that are often quite important -- that I wouldn't even know to look for.
So it might be useful to save all of that raw output, so that it could be separately analyzed in a single pass in the same manner. Something like a
--log-raw-outputoption, that I could create a shorthand to trigger analysis against.Beta Was this translation helpful? Give feedback.
All reactions