Skip to content
/ BoolV Public

A method to evaluate the response of lightweight LLMs to TRUE-FALSE questions

License

Notifications You must be signed in to change notification settings

csisc/BoolV

Repository files navigation

BoolV

A method to evaluate the response of lightweight LLMs to TRUE-FALSE questions

Models

Model Hyperparameters
llama-3.2-1b-instruct-q8_0 1.24 B
llama-3.2-3b-instruct-q8_0 3.21 B
Phi-3.5-mini-instruct.Q8_0 3.82 B
Mistral-7B-Instruct-v0.3.Q8_0 7.25 B
llama-3.2-8b-instruct-q8_0 8.03 B

Dataset

Dependencies

  • llama-cpp-python
  • pathlib
  • pandas
  • math
  • jsonlines

Funding

This research work has been done thanks to the computer resources of Wikimedia Switzerland.

About

A method to evaluate the response of lightweight LLMs to TRUE-FALSE questions

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages