GitHub - annemnvz/TTS-toolkit

TTS-toolkit

A minimalistic toolkit with recipes for the preprocessing and handling of audio and text data oriented to Text-to-Speech modeling. Recipes can be found in python or bash files depending on how convenient each can be.

Note: This is work in progress. I tried to make recipes generic but some parts are specific to my project - I will improve these by adapting them to open source tools.

Comes with recipes for...

Audio processing:

Resample audio (SoX)
PCM modification (by changing bit-depth in this case) (SoX)
Audio split in miliseconds (Pydub)
Audio split in seconds (Wave)

Text processing:

Change text encoding
Text cleanup
Join text files in single metadata.txt
Text split in punctuation marks

Phoneme processing

Transcribe metadata phonetically (save as numpy or regular phonemes)

Dictionary creation and implementation:

To be added.

Getting started

Just clone this repo

git clone https://github.com/annemnvz/TTS-toolkit.git
cd TTS-toolkit

And run the recipe you need to use.

Remember to modify paths and files or adjust to other tools if needed.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
audio-processing		audio-processing
phoneme-processing		phoneme-processing
text-processing		text-processing
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TTS-toolkit

Comes with recipes for...

Audio processing:

Text processing:

Phoneme processing

Dictionary creation and implementation:

Getting started

About

Releases

Packages

Languages

annemnvz/TTS-toolkit

Folders and files

Latest commit

History

Repository files navigation

TTS-toolkit

Comes with recipes for...

Audio processing:

Text processing:

Phoneme processing

Dictionary creation and implementation:

Getting started

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages