feat(eval): parallelize inference and evaluator execution #3861

matjanos · 2025-11-27T16:33:58Z

Currently, genkit's evaluation system runs inference sequentially for all test cases in bulkRunAction(). For large datasets (e.g., 150+ test cases), this causes extremely slow evaluation times as each flow/model execution must complete before the next one starts.

Parallelized inference using the existing batchSize to run samples
concurrently (capped at 100) while preserving ordering, per-sample error
capture, and progress logging.
Evaluator actions now execute in parallel to match inference concurrency.
eval:flow continues to use --batchSize to control concurrency; eval:run
behavior is unchanged. Example: genkit eval:flow myFlow data.json
--batchSize 5 now runs both inference and evaluation in parallel batches.

Checklist (if applicable):

PR title is following https://www.conventionalcommits.org/en/v1.0.0/
Tested (manually, unit tested, etc.)
Docs updated (updated docs or a docs bug required)

google-cla · 2025-11-27T16:34:03Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

Add batched parallel inference and evaluator execution

7348129

github-project-automation bot added this to Genkit Backlog Nov 27, 2025

github-actions bot added js tooling labels Nov 27, 2025

test(evaluate): add bulkRunAction and runEvaluation tests

3b5f856

matjanos marked this pull request as ready for review November 27, 2025 16:45

docs(cli): note eval batchSize parallelism

daccc79

github-actions bot added the docs Improvements or additions to documentation label Nov 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(eval): parallelize inference and evaluator execution #3861

feat(eval): parallelize inference and evaluator execution #3861

matjanos commented Nov 27, 2025 •

edited

Loading

Uh oh!

google-cla bot commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(eval): parallelize inference and evaluator execution #3861

Are you sure you want to change the base?

feat(eval): parallelize inference and evaluator execution #3861

Conversation

matjanos commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-cla bot commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

matjanos commented Nov 27, 2025 •

edited

Loading