mirror of https://github.com/coqui-ai/TTS.git
Update readme (#1978)
parent
896e46d0e5
commit
dba2c3570a
19
README.md
19
README.md
|
@ -21,6 +21,8 @@
|
|||

|
||||
[](https://tts.readthedocs.io/en/latest/)
|
||||
|
||||
🚀 [**Sign up to free 🐸Coqui.ai API and start cloning your voice**](https://coqui.ai/)
|
||||
|
||||
📰 [**Subscribe to 🐸Coqui.ai Newsletter**](https://coqui.ai/?subscription=true)
|
||||
|
||||
📢 [English Voice Samples](https://erogol.github.io/ddc-samples/) and [SoundCloud playlist](https://soundcloud.com/user-565970875/pocket-article-wavernn-and-tacotron2)
|
||||
|
@ -75,7 +77,7 @@ Underlined "TTS*" and "Judy*" are 🐸TTS models
|
|||
- Modular (but not too much) code base enabling easy implementation of new ideas.
|
||||
|
||||
## Implemented Models
|
||||
### Text-to-Spectrogram
|
||||
### Spectrogram models
|
||||
- Tacotron: [paper](https://arxiv.org/abs/1703.10135)
|
||||
- Tacotron2: [paper](https://arxiv.org/abs/1712.05884)
|
||||
- Glow-TTS: [paper](https://arxiv.org/abs/2005.11129)
|
||||
|
@ -84,6 +86,7 @@ Underlined "TTS*" and "Judy*" are 🐸TTS models
|
|||
- FastPitch: [paper](https://arxiv.org/pdf/2006.06873.pdf)
|
||||
- FastSpeech: [paper](https://arxiv.org/abs/1905.09263)
|
||||
- SC-GlowTTS: [paper](https://arxiv.org/abs/2104.05557)
|
||||
- Capacitron: [paper](https://arxiv.org/abs/1906.03402)
|
||||
|
||||
### End-to-End Models
|
||||
- VITS: [paper](https://arxiv.org/pdf/2106.06103)
|
||||
|
@ -149,12 +152,12 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
|
|||
```
|
||||
- Get model info (for both tts_models and vocoder_models):
|
||||
- Query by type/name:
|
||||
The model_info_by_name uses the name as it from the --list_models.
|
||||
The model_info_by_name uses the name as it from the --list_models.
|
||||
```
|
||||
$ tts --model_info_by_name "<model_type>/<language>/<dataset>/<model_name>"
|
||||
```
|
||||
For example:
|
||||
|
||||
|
||||
```
|
||||
$ tts --model_info_by_name tts_models/tr/common-voice/glow-tts
|
||||
```
|
||||
|
@ -162,16 +165,16 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
|
|||
$ tts --model_info_by_name vocoder_models/en/ljspeech/hifigan_v2
|
||||
```
|
||||
- Query by type/idx:
|
||||
The model_query_idx uses the corresponding idx from --list_models.
|
||||
The model_query_idx uses the corresponding idx from --list_models.
|
||||
```
|
||||
$ tts --model_info_by_idx "<model_type>/<model_query_idx>"
|
||||
```
|
||||
For example:
|
||||
|
||||
|
||||
```
|
||||
$ tts --model_info_by_idx tts_models/3
|
||||
$ tts --model_info_by_idx tts_models/3
|
||||
```
|
||||
|
||||
|
||||
- Run TTS with default models:
|
||||
|
||||
```
|
||||
|
@ -241,8 +244,6 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
|
|||
|- TTS
|
||||
|- bin/ (folder for all the executables.)
|
||||
|- train*.py (train your target model.)
|
||||
|- distribute.py (train your TTS model using Multiple GPUs.)
|
||||
|- compute_statistics.py (compute dataset statistics for normalization.)
|
||||
|- ...
|
||||
|- tts/ (text to speech models)
|
||||
|- layers/ (model layer definitions)
|
||||
|
|
Loading…
Reference in New Issue