nano-llama31-mlx

MLX port of Andrej Karpathy's nano-llama31. The original project is to Llama 3.1 what nanoGPT is to GPT-2 - a minimal, dependency-light implementation of the Llama 3.1 architecture. This MLX version aims to maintain that spirit while leveraging the capabilities of Apple's MLX framework.

Like the original, this code focuses on the 8B base model of Llama 3.1.

Key Differences from Karpathy's Original nano-llama31

Replaced PyTorch with MLX for all tensor operations and neural network modules
Removed CUDA-specific optimizations (e.g., flash attention) as MLX handles optimizations differently

Usage

Clone this forked repository:

git clone https://github.com/your-username/nano-llama31-mlx.git
cd nano-llama31-mlx

Install the required dependencies:
```
pip install -r requirements
```
Set up Hugging Face access:
- Create a Hugging Face account if you don't have one: Hugging Face
- Generate an access token: Hugging Face Tokens
- Set the token as an environment variable:
```
export HF_JOSEF_TOKEN='your_token_here'
```
Request access to the Llama 3.1 model:
- Go to the Llama 3.1 8B model page on Hugging Face
- Fill out the form to request access to Llama 3.1
- Wait for approval (this may take some time)
Run the inference script:
```
python llama31.py
```
Run the test script:
```
python test_llama31.py
```

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. When contributing, please keep in mind that this is a fork aimed at translating the original work to MLX, so major architectural changes should be carefully considered.

Acknowledgements

This project is a fork of Andrej Karpathy's nano-llama31. We are deeply grateful to Andrej Karpathy and all contributors to the original project for their absolutely fantastic work, which made this MLX adaptation possible.

Disclaimer

This is an unofficial, community-driven fork. For the most up-to-date and original implementation, please refer to Karpathy's repository.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Meta-Llama-3.1-8B-Instruct		Meta-Llama-3.1-8B-Instruct
README.md		README.md
llama31.py		llama31.py
requirements.txt		requirements.txt
test_llama31.py		test_llama31.py
tokenizer.py		tokenizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nano-llama31-mlx

Key Differences from Karpathy's Original nano-llama31

Usage

Contributing

Acknowledgements

Disclaimer

About

Releases

Packages

Languages

JosefAlbers/nano-llama31-mlx

Folders and files

Latest commit

History

Repository files navigation

nano-llama31-mlx

Key Differences from Karpathy's Original nano-llama31

Usage

Contributing

Acknowledgements

Disclaimer

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages