Transfomer

writing a Transformer (base for LLMs) in Rust

References

Ashish Vaswani et al. (2017). Attention Is All You Need. Read the Paper (PDF)
Alec Radford et al. Language Models are Unsupervised Multitask Learners. Read the Paper (PDF)
Harvard NLP Group. The Annotated Transformer. Read the Article
3Blue1Brown Deep-Learning Series. Introduction to Transformers , Attention in Transformers

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md