mirror of https://github.com/coqui-ai/TTS.git
deep-learningglow-ttshifiganmelganmulti-speaker-ttspythonpytorchspeaker-encoderspeaker-encodingsspeechspeech-synthesistacotrontext-to-speechttstts-modelvocodervoice-cloningvoice-conversionvoice-synthesis
4014e974d5 | ||
---|---|---|
datasets | ||
layers | ||
models | ||
png | ||
utils | ||
.gitignore | ||
README.md | ||
__init__.py | ||
config.json | ||
module.py | ||
requirements.txt | ||
synthesis.py | ||
train.py |
README.md
TTS (Work in Progress...)
Here we have pytorch implementation of:
- Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.
- Tacotron2 (TODO): Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
At the end, it should be easy to add new models and try different architectures.
You can find here a brief note about possible TTS architectures and their comparisons.
Requirements
Highly recommended to use miniconda for easier installation.
- python 3.6
- pytorch > 0.2.0
- TODO
Data
TODO
Training the network
TODO