To install the required dependencies for this project, run the following command in your terminal:
pip install pylzma numpy ipykernel jupyter torch --index-url https://download.pytorch.org/whl/cu118
Access the OpenWebText Corpus here.
Explore the foundational papers that have influenced this project:
- Attention is All You Need introduces the Transformer architecture, revolutionizing sequence modeling.
- A Survey of LLMs provides an extensive overview of the landscape of large language models.
- QLoRA: Efficient Finetuning of Quantized LLMs explores techniques for efficient finetuning of quantized language models.
Note: If you don't have an NVIDIA GPU, the device parameter will default to 'cpu' since device = 'cuda' if torch.cuda.is_available() else 'cpu'
. Running on CPU is supported but expect slower runtimes.