🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis

Go to file

Eren Golge 4014e974d5 Remove redun		2018-01-23 15:29:39 +01:00
datasets	Change config to json 3	2018-01-22 08:29:27 -08:00
layers	New files	2018-01-22 06:59:41 -08:00
models	New files	2018-01-22 06:59:41 -08:00
png	Beginning	2018-01-22 01:48:59 -08:00
utils	Change config to json 3	2018-01-22 08:29:27 -08:00
.gitignore	new files	2018-01-22 06:59:21 -08:00
README.md	Change descriptions	2018-01-23 15:28:12 +01:00
__init__.py	Beginning	2018-01-22 01:48:59 -08:00
config.json	Change config to json 3	2018-01-22 08:29:27 -08:00
module.py	Beginning	2018-01-22 01:48:59 -08:00
requirements.txt	Change descriptions	2018-01-23 15:28:12 +01:00
synthesis.py	Beginning	2018-01-22 01:48:59 -08:00
train.py	Change config to json 3	2018-01-22 08:29:27 -08:00

README.md

TTS (Work in Progress...)

Here we have pytorch implementation of:

Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.
Tacotron2 (TODO): Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

At the end, it should be easy to add new models and try different architectures.

You can find here a brief note about possible TTS architectures and their comparisons.

Requirements

Highly recommended to use miniconda for easier installation.

python 3.6
pytorch > 0.2.0
TODO

Data

TODO

Training the network

TODO