Skip to content

A method to evaluate the response of lightweight LLMs to TRUE-FALSE questions across languages

License

Notifications You must be signed in to change notification settings

csisc/BoolV-Multi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BoolV

A method to evaluate the response of lightweight LLMs to TRUE-FALSE questions across languages

Supported Languages

The supported natural languages are the ones that have been featured as the best performing ones on Google Translation:

  • English
  • Afrikaans
  • German
  • Portuguese
  • Spanish
  • Polish

Models

Model Hyperparameters
llama-3.2-3b-instruct-q8_0 3.21 B
Phi-3.5-mini-instruct.Q8_0 3.82 B

Dataset

Dependencies

  • llama-cpp-python
  • pathlib
  • pandas
  • math

Funding

This research work has been done thanks to the computer resources of Wikimedia Switzerland.

About

A method to evaluate the response of lightweight LLMs to TRUE-FALSE questions across languages

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published