Skip to content

Latest commit

 

History

History
40 lines (30 loc) · 1.95 KB

CHANGELOG.md

File metadata and controls

40 lines (30 loc) · 1.95 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[Unreleased]

Added

  • PyTorch version Transformer & SpeechTransformer model.
  • Instruction for training transformer models on WMT14 EN->DE.
  • Audio extraction for CommonVoice/IWSLT.
  • Support weight pruning.
  • Data sampler and dataset for multilingual machine translation
  • Multilingual Translation task
  • Support int8 quantization for transformer model

Changed

Fixed

  • Compat with TensorFlow v2.4

[0.1.0] - 25th Dec., 2020

Added

  • Basic code structure for Encoder, Decoder, Model, DataPipeline, Tokenizer, Experiment, Metric, and Dataset.
  • (Model) Adds implementation of pre-norm/post-norm Transformer, Speech Transformer, BERT, GPT-2, and Wav2Vec2.0.
  • (Task) Adds implementation of sequence to sequence task and speech to text task (ASR, ST).
  • (DataPipeline, Tokenizer) Adds wrappers for commonly used tokenizers: moses, bpe, jieba, character, sentencepiece, etc.
  • (Dataset) Adds support for reading parallel corpus, speech corpora (libri-trans, MuST-C, and LibriSpeech), and TFRecords.
  • (Experiment) Adds implementation of common training procedure with mixed precision training and various distributed strategies (MirroredStrategy, Horovod, Byteps).
  • (Metric) Adds implementation of BLEU and WER metrics.
  • (Converter) Adds implementation of converting checkpoints from google BERT, OpenAI GPT-2, fairseq Transformer, and fairseq Wav2Vec2.0.
  • Add support for converting checkpoints from publicly
  • Beam search decoding and top-k/p sampling.
  • Supports averaging checkpoints, TFRecord generation, model restoring (see cli/README.md).
  • Step-by-step recipes for training an end-to-end speech translation model (see examples/speech_to_text).