Strip XML tags, runtime model metadata, run detail UI by penso · Pull Request #90 · moltis-org/moltis

penso · 2026-02-11T17:30:35Z

Summary

XML tag stripping: New response_sanitizer module strips internal XML tags (thinking, reasoning, scratchpad, etc.), pipe tokens (<|eot_id|>, <|im_end|>, etc.), and reasoning pattern blocks from LLM responses. Also recovers structured tool calls from XML blocks as a fallback. Integrated in both streaming and non-streaming paths in the agent runner and gateway chat handler.
Runtime model metadata: ModelMetadata struct and model_metadata() trait method on LlmProvider with a default implementation. OpenAI provider overrides it to fetch context window from /models/{model} API with OnceCell caching and static fallback on error. Auto-compaction now uses runtime metadata for accurate context window sizing.
Run detail UI: Backend read_by_run_id store method, sessions.run_detail RPC endpoint, and a Preact RunDetail component with Overview/Actions/Messages tabs. Expandable button appears on assistant messages that have a run_id. Tool results now propagate run_id for proper grouping.

Validation

Completed

just format-check passes
cargo clippy -p moltis-agents -p moltis-sessions -p moltis-gateway -- -D warnings clean
cargo test — all tests pass (including 20 new sanitizer tests, 5 new metadata tests, 2 new store tests)
biome check --write applied to JS files

Remaining

./scripts/local-validate.sh <PR> — to be run after PR creation
E2E tests: cd crates/gateway/ui && npx playwright test
Manual QA: send a message and verify clean response, check logs for metadata fetch, expand run detail in chat

Manual QA

Pending — will be performed after local validation completes.

Add response_sanitizer module that strips internal reasoning tags (thinking, reflection, scratchpad, etc.), special control tokens (eot_id, im_end, etc.), and recovers structured tool calls from XML blocks in LLM output. Integrated at both the agent runner level and the gateway streaming path.

Add ModelMetadata struct and model_metadata() trait method to LlmProvider with a default implementation that returns the static context_window() value. Override in OpenAiProvider to fetch context length from the /models API endpoint with OnceCell caching. Use runtime metadata for auto-compaction threshold in the gateway.

Add sessions.run_detail RPC method that returns messages for a specific run_id, plus a RunDetail Preact component with Overview, Actions, and Messages tabs. The component is mounted on assistant messages that have a run_id. Tool results now carry run_id for linking. Includes backend tests and E2E specs.

Apply rustfmt, biome, and clippy fixes across the three feature commits. Add changelog entries for XML tag stripping, runtime model metadata, and run detail UI features.

codspeed-hq · 2026-02-11T17:32:28Z

Merging this PR will improve performance by ×2.6

⚡ 1 improved benchmark
✅ 33 untouched benchmarks
⏩ 1 skipped benchmark¹

Performance Changes

	Benchmark	`BASE`	`HEAD`	Efficiency
⚡	`session_store_list[10]`	39 µs	15.1 µs	×2.6

_{Comparing xml-stripping (d4d15bc) with main (45cbf7c)}

1 benchmark was skipped, so the baseline result was used instead. If it was deleted from the codebase, click here and archive it to remove it from the performance reports. ↩

codecov · 2026-02-11T17:34:32Z

Codecov Report

❌ Patch coverage is 87.48466% with 102 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
crates/sessions/src/message.rs	21.73%	18 Missing ⚠️
crates/gateway/src/server.rs	0.00%	14 Missing ⚠️
crates/agents/src/runner.rs	45.83%	13 Missing ⚠️
crates/agents/src/model.rs	63.33%	11 Missing ⚠️
crates/gateway/src/methods.rs	35.71%	9 Missing ⚠️
crates/gateway/src/session.rs	0.00%	8 Missing ⚠️
crates/agents/src/response_sanitizer.rs	96.83%	7 Missing ⚠️
crates/gateway/src/auth_webauthn.rs	0.00%	6 Missing ⚠️
crates/agents/src/providers/github_copilot.rs	0.00%	5 Missing ⚠️
crates/gateway/src/chat.rs	66.66%	3 Missing ⚠️
... and 4 more

📢 Thoughts on this report? Let us know!

penso added 4 commits February 11, 2026 07:40

chore: fix formatting, clippy, and update changelog

b1d7687

Apply rustfmt, biome, and clippy fixes across the three feature commits. Add changelog entries for XML tag stripping, runtime model metadata, and run detail UI features.

penso added 4 commits February 11, 2026 11:58

Merge main into xml-stripping

f48b6c8

chore: update multiple areas

eb9577f

Merge branch 'main' into xml-stripping

bcc405c

style: format after merging main

d4d15bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Strip XML tags, runtime model metadata, run detail UI#90

Strip XML tags, runtime model metadata, run detail UI#90
penso wants to merge 8 commits intomainfrom
xml-stripping

penso commented Feb 11, 2026

Uh oh!

codspeed-hq bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

codecov bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

penso commented Feb 11, 2026

Summary

Validation

Completed

Remaining

Manual QA

Uh oh!

codspeed-hq bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will improve performance by ×2.6

Performance Changes

Footnotes

Uh oh!

codecov bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codspeed-hq bot commented Feb 11, 2026 •

edited

Loading

codecov bot commented Feb 11, 2026 •

edited

Loading