BoolV

A method to evaluate the response of lightweight LLMs to TRUE-FALSE questions

Models

This research work has been done thanks to the computer resources of Wikimedia Switzerland.

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
Mistral-7B		Mistral-7B
Phi-3.5-mini		Phi-3.5-mini
llama-3.2-1B		llama-3.2-1B
llama-3.2-3B		llama-3.2-3B
llama-3.2-8B		llama-3.2-8B
output		output
LICENSE		LICENSE
README.md		README.md
calibration-false-first.py		calibration-false-first.py
calibration.py		calibration.py