v0.1.19

av released this 13 Sep 10:31

· 172 commits to main since this release

This project provides a unified framework to test generative language models on a large number of different evaluation tasks.

# [Optional] pre-build the image
harbor build lmeval

Refer to the configuration for Harbor services

# Run evals
harbor lmeval --tasks gsm8k,hellaswag

# Open results folder
harbor lmeval results

Full Changelog: v0.1.18...v0.1.19

Assets 2

Provide feedback