Built-in Way to Mock/Force LLM Responses for Testing #226

vmg-dev · 2026-01-14T17:27:56Z

vmg-dev
Jan 14, 2026

Following up on the Discord conversation with Alem about enabling easier testing.

When writing e2e tests, I want deterministic responses so I can make reliable assertions. Real LLM calls make tests flaky, slow, and expensive.

I've rolled my own mock layer using query params, but it's a lot of boilerplate to match the chunk format. Would be great to have something like this supported out of the box.

Ideally it would let you say "for this request, return exactly this response" with controllable streaming behavior. For example, I control how long a stream takes to complete so I can test stream resumption with Playwright. I have a bunch of test fixtures on my server and have them removed in prod builds with dead code removal controlled by a build time constant.

Bonus idea: it would be cool to generate fixtures by recording real LLM conversations through devtools.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Built-in Way to Mock/Force LLM Responses for Testing #226

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

Built-in Way to Mock/Force LLM Responses for Testing #226

Uh oh!

Uh oh!

vmg-dev Jan 14, 2026

Replies: 0 comments

vmg-dev
Jan 14, 2026