#

rubric-based-evaluation

Here is 1 public repository matching this topic...

PabloCabaleiro / pondera

Pondera is a lightweight, YAML-first framework to evaluate AI models and agents with pluggable runners and an LLM-as-a-judge.

python ai agents model-agnostic ai-evaluation llms llm-evaluation llm-evaluation-framework llm-judge agent-evaluation ai-evaluation-framework rubric-based-evaluation yaml-first

Updated Sep 19, 2025
Python

Improve this page

Add a description, image, and links to the rubric-based-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rubric-based-evaluation topic, visit your repo's landing page and select "manage topics."