LLM from Scratch

Dependencies (assuming Windows)

To install the required dependencies for this project, run the following command in your terminal:

pip install pylzma numpy ipykernel jupyter torch --index-url https://download.pytorch.org/whl/cu118

OpenWebText Download

Access the OpenWebText Corpus here.

Research Papers

Explore the foundational papers that have influenced this project:

Attention is All You Need introduces the Transformer architecture, revolutionizing sequence modeling.
A Survey of LLMs provides an extensive overview of the landscape of large language models.
QLoRA: Efficient Finetuning of Quantized LLMs explores techniques for efficient finetuning of quantized language models.

Note: If you don't have an NVIDIA GPU, the device parameter will default to 'cpu' since device = 'cuda' if torch.cuda.is_available() else 'cpu'. Running on CPU is supported but expect slower runtimes.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.gitignore		.gitignore
README.md		README.md
bigram.ipynb		bigram.ipynb
bpe-v1.ipynb		bpe-v1.ipynb
bpe_tokenizer.json		bpe_tokenizer.json
chatbot.py		chatbot.py
data-extract-v2.py		data-extract-v2.py
data-extract-v3.py		data-extract-v3.py
data-extract.py		data-extract.py
frankenstein_prometheus.txt		frankenstein_prometheus.txt
gpt-v1.ipynb		gpt-v1.ipynb
requirements.txt		requirements.txt
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM from Scratch

Dependencies (assuming Windows)

OpenWebText Download

Research Papers

About

Releases

Packages

Languages

carson-evans/LLM-From-Scratch

Folders and files

Latest commit

History

Repository files navigation

LLM from Scratch

Dependencies (assuming Windows)

OpenWebText Download

Research Papers

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages