Implement Full Transformer Model in Rust #7

JakubSchwenkbeck · 2024-12-19T18:24:08Z

Implement Full Transformer Model in Rust

Resolves #4

This pull request introduces a complete implementation of a Transformer model based on the "Attention is All You Need" paper. The model includes:

Separate Encoder and Decoder Inputs:
- Handles two distinct input sequences, one for the encoder and one for the decoder.
Embedding Layer:
- Converts tokenized input sequences into dense vector embeddings.
Transformer Encoder:
- Processes the encoder input to produce encoded representations.
Transformer Decoder:
- Utilizes the encoded representation and decoder input to generate context-aware outputs.
Output Projection and Softmax:
- Projects the decoder output into vocabulary space and applies softmax to generate token probabilities.

…bage in - garbage out value-wise)

JakubSchwenkbeck · 2024-12-19T18:24:27Z

MERGE ON WORKING TESTS

JakubSchwenkbeck added 5 commits December 19, 2024 19:05

implemented working stack of full transformer model

dc9bec2

rearranged impl of embedding and prediction of tokens

8f7c4a3

new Softmax wrapper function for vectors

9079080

Implemented Full Transformer Model which runs without any errors (gar…

93e10b8

…bage in - garbage out value-wise)

Implemented Full Transformer Model

b368284

JakubSchwenkbeck merged commit 23f9866 into main Dec 19, 2024
1 check passed

JakubSchwenkbeck deleted the Transformer/Model branch December 19, 2024 21:10