Add support for evals #630

alzeck · 2025-11-27T19:13:29Z

This pull request adds support for the OpenAI Evals API introduced earlier this year.

Test for retrieving an output item are a bit flaky as it requires the evals run to be completed in order for the fetch to work, happy for any suggestion on how to implement this differently if needed.

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?
Have you added an explanation of what your changes do and why you'd like us to include them?

Copilot

Pull request overview

This pull request adds comprehensive support for the OpenAI Evals API, enabling users to systematically evaluate AI model performance through the ruby-openai gem. The implementation follows the existing codebase patterns with proper module structure, test coverage using VCR cassettes, and detailed documentation.

Key Changes:

Added complete Evals API client implementation with support for evaluations, runs, and output items
Comprehensive RSpec test suite with 13 test cases covering all API endpoints
Extensive README documentation with usage examples for all supported operations

Reviewed changes

Copilot reviewed 35 out of 35 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
lib/openai/evals.rb	New Evals module with nested Runs and OutputItems classes implementing all API endpoints
lib/openai/client.rb	Integration of evals accessor method into main client
lib/openai.rb	Added require statement for the new evals module
spec/openai/client/evals_spec.rb	Complete test suite covering all evals, runs, and output_items operations
README.md	Added comprehensive documentation section with 11 usage examples
spec/fixtures/cassettes/*.yml	38 VCR cassette files for test fixtures

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

README.md

alzeck added 4 commits November 27, 2025 18:04

add support for evals

dc9dde7

add list runs and docs

9d2f21c

reduce lines for evals list

cb6642d

add supported endpoints

b543a4d

Copilot AI review requested due to automatic review settings November 27, 2025 19:13

Copilot started reviewing on behalf of alzeck November 27, 2025 19:13 View session

Copilot finished reviewing on behalf of alzeck November 27, 2025 19:14

Copilot AI reviewed Nov 27, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

update readme

dbabfd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add support for evals #630

Add support for evals #630

Uh oh!

alzeck commented Nov 27, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Add support for evals #630

Are you sure you want to change the base?

Add support for evals #630

Uh oh!

Conversation

alzeck commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

All Submissions:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

alzeck commented Nov 27, 2025 •

edited

Loading