Add eval reliability checks and regression coverage #124

piyushkr0509 · 2026-02-10T09:32:59Z

@virattt extract eval parsing/reliability logic into a shared core module
add deterministic reliability gates before LLM judging in eval run
add dataset selection support for eval runs (default, regression, custom CSV)
add focused regression eval dataset
add regression tests for env provider key mapping and progress channel behavior
add eval core unit tests for CSV parsing + reliability checks
include xAI provider API-key mapping fix in env config

Add eval reliability gates and regression tests

300e897

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add eval reliability checks and regression coverage #124

Add eval reliability checks and regression coverage #124

piyushkr0509 commented Feb 10, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add eval reliability checks and regression coverage #124

Are you sure you want to change the base?

Add eval reliability checks and regression coverage #124

Conversation

piyushkr0509 commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

piyushkr0509 commented Feb 10, 2026 •

edited

Loading