Skip to content

This repository consists of Reproducing the results of the paper "NATURAL_TTS_SYNTHESIS_BY_CONDITIONING_WAVENET_ON_MEL_SPECTROGRAM_PREDICTIONS"

Notifications You must be signed in to change notification settings

CoderBose/ML_Project_Tacotron2

Repository files navigation

REPRODUCING: NATURAL TTS SYNTHESIS BY CONDITIONING WAVENET ON MEL SPECTROGRAM PREDICTIONS

Chandra Shekhar Pandey, Priyanka Bose

This project validates some of the qualitative claims made in the paper

J. Shen et al., "Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions," 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 2018, pp. 4779-4783, doi: 10.1109/ICASSP.2018.8461368.

which describes Tacotron2, a neural network for predicting mel spectograms from text, which can then be used with a vocoder to synthesize speech.

To reproduce this work:

  • Use the Colab_Chameleon_Ubuntu.ipynb notebook to reserve a server on Chameleon, configure it with the required software and libraries, and set up an SSH tunnel between your local system and this server.
  • Then, open the Tacotron2_validation.ipynb notebook on Colab: Tacotron2 validation.
  • In the top-right part of the interface, click on the ▼ next to the word "Connect". Select "Connect to a local runtime." Then paste the URL of your JupyterHub server from the first notebook.
  • You may then run the Tacotron2 notebook on Colab.

About

This repository consists of Reproducing the results of the paper "NATURAL_TTS_SYNTHESIS_BY_CONDITIONING_WAVENET_ON_MEL_SPECTROGRAM_PREDICTIONS"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published