danielepoterti

Follow

💭

bep bop

Daniele Potertì danielepoterti

💭

bep bop

Follow

doing the bep and the bop

13 followers · 23 following

Milan

Achievements

Achievements

Highlights

Pro

Pinned Loading

evals evals Public

Forked from openai/evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python
Crisp-Unimib/ITALIC Crisp-Unimib/ITALIC Public

ITALIC is a benchmark evaluating language models' understanding of Italian culture, commonsense reasoning and linguistic proficiency in a morphologically rich language.

Python 9 1
Crisp-Unimib/SFAL Crisp-Unimib/SFAL Public

Python 1
Crisp-Unimib/Role-Vectors Crisp-Unimib/Role-Vectors Public

Role Vectors are a novel approach to guiding LLM inference behaviour, an alternative to persona-based prompting.

Python 2