mirror of https://github.com/coqui-ai/TTS.git
🐸 fashion
parent
e9e07844b7
commit
0f4f26c66c
75
README.md
75
README.md
|
@ -1,12 +1,9 @@
|
|||
<img src="https://user-images.githubusercontent.com/1402048/104139991-3fd15e00-53af-11eb-8640-3a78a64641dd.png" data-canonical-src="![TTS banner](https://user-images.githubusercontent.com/1402048/104139991-3fd15e00-53af-11eb-8640-3a78a64641dd.png =250x250)
|
||||
" width="256" height="256" align="right" />
|
||||
|
||||
# TTS: Text-to-Speech for all.
|
||||
# <img src="images/coqui-log-green-TTS.png" height="56"/>
|
||||
|
||||
TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality.
|
||||
TTS comes with [pretrained models](https://github.com/mozilla/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
|
||||
TTS comes with [pretrained models](https://github.com/coqui-ai/TTS/wiki/Released-Models), tools for measuring dataset quality and already used in **20+ languages** for products and research projects.
|
||||
|
||||
[![CircleCI](<https://circleci.com/gh/mozilla/TTS/tree/dev.svg?style=svg>)]()
|
||||
<!-- [![CircleCI](TODO)]() -->
|
||||
[![License](<https://img.shields.io/badge/License-MPL%202.0-brightgreen.svg>)](https://opensource.org/licenses/MPL-2.0)
|
||||
[![PyPI version](https://badge.fury.io/py/TTS.svg)](https://badge.fury.io/py/TTS)
|
||||
|
||||
|
@ -22,33 +19,32 @@ Please use our dedicated channels for questions and discussion. Help is much mor
|
|||
| Type | Platforms |
|
||||
| ------------------------------- | --------------------------------------- |
|
||||
| 🚨 **Bug Reports** | [GitHub Issue Tracker] |
|
||||
| ❔ **FAQ** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/FAQ) |
|
||||
| ❔ **FAQ** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/FAQ) |
|
||||
| 🎁 **Feature Requests & Ideas** | [GitHub Issue Tracker] |
|
||||
| 👩💻 **Usage Questions** | [Discourse Forum] |
|
||||
| 🗯 **General Discussion** | [Discourse Forum] and [Matrix Channel] |
|
||||
| 👩💻 **Usage Questions** | [Github Discussions] |
|
||||
| 🗯 **General Discussion** | [Github Discussions] and [Matrix Channel] |
|
||||
|
||||
[github issue tracker]: https://github.com/mozilla/tts/issues
|
||||
[discourse forum]: https://discourse.mozilla.org/c/tts/
|
||||
[github issue tracker]: https://github.com/coqui-ai/tts/issues
|
||||
[github discussions]: https://github.com/coqui-ai/TTS/discussions
|
||||
[matrix channel]: https://matrix.to/#/!KTePhNahjgiVumkqca:matrix.org?via=matrix.org
|
||||
[Tutorials and Examples]: https://github.com/mozilla/TTS/wiki/TTS-Notebooks-and-Tutorials
|
||||
[Tutorials and Examples]: https://github.com/coqui-ai/TTS/wiki/TTS-Notebooks-and-Tutorials
|
||||
|
||||
|
||||
## 🔗 Links and Resources
|
||||
| Type | Links |
|
||||
| ------------------------------- | --------------------------------------- |
|
||||
| 💾 **Installation** | [TTS/README.md](https://github.com/mozilla/TTS/tree/dev#install-tts)|
|
||||
| 👩🏾🏫 **Tutorials and Examples** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/TTS-Notebooks-and-Tutorials) |
|
||||
| 🚀 **Released Models** | [TTS/Wiki](https://github.com/mozilla/TTS/wiki/Released-Models)|
|
||||
| 💻 **Docker Image** | [Repository by @synesthesiam](https://github.com/synesthesiam/docker-mozillatts)|
|
||||
| 🖥️ **Demo Server** | [TTS/server](https://github.com/mozilla/TTS/tree/master/TTS/server)|
|
||||
| 🤖 **Running TTS on Terminal** | [TTS/README.md](https://github.com/mozilla/TTS#example-synthesizing-speech-on-terminal-using-the-released-models)|
|
||||
| ✨ **How to contribute** |[TTS/README.md](#contribution-guidelines)|
|
||||
| 💾 **Installation** | [TTS/README.md](https://github.com/coqui-ai/TTS/tree/dev#install-tts)|
|
||||
| 👩🏾🏫 **Tutorials and Examples** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/TTS-Notebooks-and-Tutorials) |
|
||||
| 🚀 **Released Models** | [TTS/Wiki](https://github.com/coqui-ai/TTS/wiki/Released-Models)|
|
||||
| 💻 **Docker Image** | [Repository by @synesthesiam](https://github.com/synesthesiam/docker-coqui-aitts)|
|
||||
| 🖥️ **Demo Server** | [TTS/server](https://github.com/coqui-ai/TTS/tree/master/TTS/server)|
|
||||
| 🤖 **Running TTS on Terminal** | [TTS/README.md](https://github.com/coqui-ai/TTS#example-synthesizing-speech-on-terminal-using-the-released-models)|
|
||||
|
||||
## 🥇 TTS Performance
|
||||
<p align="center"><img src="https://discourse-prod-uploads-81679984178418.s3.dualstack.us-west-2.amazonaws.com/optimized/3X/6/4/6428f980e9ec751c248e591460895f7881aec0c6_2_1035x591.png" width="800" /></p>
|
||||
<!-- ## TODO -- 🥇 TTS Performance
|
||||
<p align="center"><img src="" width="800" /></p>
|
||||
|
||||
"Mozilla*" and "Judy*" are our models.
|
||||
[Details...](https://github.com/mozilla/TTS/wiki/Mean-Opinion-Score-Results)
|
||||
"TTS*" and "Judy*" are our models.
|
||||
[Details...](https://github.com/coqui-ai/TTS/wiki/Mean-Opinion-Score-Results) -->
|
||||
|
||||
## Features
|
||||
- High performance Deep Learning models for Text2Speech tasks.
|
||||
|
@ -94,9 +90,9 @@ Please use our dedicated channels for questions and discussion. Help is much mor
|
|||
You can also help us implement more models. Some TTS related work can be found [here](https://github.com/erogol/TTS-papers).
|
||||
|
||||
## Install TTS
|
||||
TTS supports **python >= 3.6, <3.9**.
|
||||
TTS is tested on Ubuntu 18.04 with **python >= 3.6, < 3.9**.
|
||||
|
||||
If you are only interested in [synthesizing speech](https://github.com/mozilla/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option.
|
||||
If you are only interested in [synthesizing speech](https://github.com/coqui-ai/TTS/tree/dev#example-synthesizing-speech-on-terminal-using-the-released-models) with the released TTS models, installing from PyPI is the easiest option.
|
||||
|
||||
```bash
|
||||
pip install TTS
|
||||
|
@ -105,7 +101,7 @@ pip install TTS
|
|||
If you plan to code or train models, clone TTS and install it locally.
|
||||
|
||||
```bash
|
||||
git clone https://github.com/mozilla/TTS
|
||||
git clone https://github.com/coqui-ai/TTS
|
||||
pip install -e .
|
||||
```
|
||||
|
||||
|
@ -233,25 +229,16 @@ In case of any error or intercepted execution, if there is no checkpoint yet und
|
|||
|
||||
You can also enjoy Tensorboard, if you point Tensorboard argument```--logdir``` to the experiment folder.
|
||||
|
||||
## Contribution Guidelines
|
||||
This repository is governed by Mozilla's code of conduct and etiquette guidelines. For more details, please read the [Mozilla Community Participation Guidelines.](https://www.mozilla.org/about/governance/policies/participation/)
|
||||
## Contribution guidelines
|
||||
Please send your Pull Request to ```dev``` branch. Before making a Pull Request, check your changes for basic mistakes and style problems by using a linter. We have cardboardlinter setup in this repository, so for example, if you've made some changes and would like to run the linter on just the changed code, you can use the follow command:
|
||||
|
||||
1. Create a new branch.
|
||||
2. Implement your changes.
|
||||
3. (if applicable) Add [Google Style](https://google.github.io/styleguide/pyguide.html#381-docstrings) docstrings.
|
||||
4. (if applicable) Implement a test case under ```tests``` folder.
|
||||
5. (Optional but Prefered) Run tests.
|
||||
```bash
|
||||
./run_tests.sh
|
||||
```
|
||||
6. Run the linter.
|
||||
```bash
|
||||
pip install pylint cardboardlint
|
||||
cardboardlinter --refspec master
|
||||
```
|
||||
7. Send a PR to ```dev``` branch, explain what the change is about.
|
||||
8. Let us discuss until we make it perfect :).
|
||||
9. We merge it to the ```dev``` branch once things look good.
|
||||
8. Let us discuss until we make it perfect :).
|
||||
9. We merge it to the ```dev``` branch once things look good.
|
||||
|
||||
Feel free to ping us at any step you need help using our communication channels.
|
||||
|
||||
|
@ -263,16 +250,6 @@ If you like to use TTS to try a new idea and like to share your experiments with
|
|||
- Explain your idea and experiment.
|
||||
- Share your results regularly. (Tensorboard log files, audio results, visuals etc.)
|
||||
|
||||
## Major TODOs
|
||||
- [x] Implement the model.
|
||||
- [x] Generate human-like speech on LJSpeech dataset.
|
||||
- [x] Generate human-like speech on a different dataset (Nancy) (TWEB).
|
||||
- [x] Train TTS with r=1 successfully.
|
||||
- [x] Enable process based distributed training. Similar to (https://github.com/fastai/imagenet-fast/).
|
||||
- [x] Adapting Neural Vocoder. TTS works with WaveRNN and ParallelWaveGAN (https://github.com/erogol/WaveRNN and https://github.com/erogol/ParallelWaveGAN)
|
||||
- [x] Multi-speaker embedding.
|
||||
- [x] Model optimization (model export, model pruning etc.)
|
||||
|
||||
### Acknowledgement
|
||||
- https://github.com/keithito/tacotron (Dataset pre-processing)
|
||||
- https://github.com/r9y9/tacotron_pytorch (Initial Tacotron architecture)
|
||||
|
|
Binary file not shown.
After Width: | Height: | Size: 60 KiB |
Loading…
Reference in New Issue