English to Japanese Translator by pytorch 🙊 (Transformer from scratch)

Overview

English to Japanese translator by Pytorch.
The neural network architecture is Transformer.
The layers for Transfomer are implemented from scratch by pytorch. (you can find them under layers/transformer/)
Parallel corpus(dataset) is kftt.

Transformer

Transformer is a neural network model proposed in the paper ‘Attention Is All You Need’
As the paper's title said, transformer is a model based on Attention mechanism. Transformer does not use recursive calculation when training like RNN,LSTM
Many of the models that have achieved high accuracy in various tasks in the NLP domain in recent years, such as BERT, GPT-3, and XLNet, have a Transformer-based structure.

Requirements

poetry 1.0.10+
python 3.8+

Setup

Install dependencies & create a virtual environment in project by running:

$ poetry install

set PYTHONPATH

export PYTHONPATH="$(pwd)"

Download & unzip parallel corpus(kftt) by running:

$ poetry run python ./utils/download.py

Directories

The directory structure is as below.

.
├── const
│   └── path.py
├── corpus
│   └── kftt-data-1.0
├── figure
├── layers
│   └── transformer
│       ├── Embedding.py
│       ├── FFN.py
│       ├── MultiHeadAttention.py
│       ├── PositionalEncoding.py
│       ├── ScaledDotProductAttention.py
│       ├── TransformerDecoder.py
│       └── TransformerEncoder.py
├── models
│   ├── Transformer.py
│   └── __init__.py
├── mypy.ini
├── pickles
│   └── nn/
├── poetry.lock
├── poetry.toml
├── pyproject.toml
├── tests
│   ├── conftest.py
│   ├── layers/
│   ├── models/
│   └── utils/
├── train.py
└── utils
    ├── dataset/
    ├── download.py
    ├── evaluation/
    └── text/

How to run

You can train model by running:

$ poetry run python train.py

epoch: 1
--------------------Train--------------------

train loss: 10.104473114013672, bleu score: 0.0,iter: 1/4403

train loss: 9.551202774047852, bleu score: 0.0,iter: 2/4403

train loss: 8.950608253479004, bleu score: 0.0,iter: 3/4403

train loss: 8.688143730163574, bleu score: 0.0,iter: 4/4403

train loss: 8.4220552444458, bleu score: 0.0,iter: 5/4403

train loss: 8.243291854858398, bleu score: 0.0,iter: 6/4403

train loss: 8.187620162963867, bleu score: 0.0,iter: 7/4403

train loss: 7.6360859870910645, bleu score: 0.0,iter: 8/4403

....

For each epoch, the model at that point is saved under pickles/nn/
When the training is finished, loss.png is saved under figure/

Reference

Attention Is All You Need

Licence

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

English to Japanese Translator by pytorch 🙊 (Transformer from scratch)

Overview

Transformer

Requirements

Setup

Directories

How to run

Reference

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.vscode		.vscode
const		const
corpus		corpus
figure		figure
layers/transformer		layers/transformer
models		models
pickles/nn		pickles/nn
tests		tests
utils		utils
.envrc		.envrc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
mypy.ini		mypy.ini
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
train.py		train.py
train_loss_and_bleu_scores.png		train_loss_and_bleu_scores.png

Folders and files

Latest commit

History

Repository files navigation

English to Japanese Translator by pytorch 🙊 (Transformer from scratch)

Overview

Transformer

Requirements

Setup

Directories

How to run

Reference

Licence

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages