Update docs (#2389)

* Update docs index

* Add MarryTTS docs

* Update docs index

* Add Overflow docs
pull/2390/head
Eren Gölge 2023-03-13 12:42:20 +01:00 committed by GitHub
parent 4ca07514d4
commit c10f9a3699
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
3 changed files with 38 additions and 1 deletions

View File

@ -28,6 +28,7 @@
formatting_your_dataset
what_makes_a_good_dataset
tts_datasets
marytts
.. toctree::
:maxdepth: 2
@ -48,10 +49,10 @@
models/vits.md
models/forward_tts.md
models/tacotron1-2.md
models/overflow.md
.. toctree::
:maxdepth: 2
:caption: `vocoder` Models
```

0
docs/source/marytts.md Normal file
View File

View File

@ -0,0 +1,36 @@
# Overflow TTS
Neural HMMs are a type of neural transducer recently proposed for
sequence-to-sequence modelling in text-to-speech. They combine the best features
of classic statistical speech synthesis and modern neural TTS, requiring less
data and fewer training updates, and are less prone to gibberish output caused
by neural attention failures. In this paper, we combine neural HMM TTS with
normalising flows for describing the highly non-Gaussian distribution of speech
acoustics. The result is a powerful, fully probabilistic model of durations and
acoustics that can be trained using exact maximum likelihood. Compared to
dominant flow-based acoustic models, our approach integrates autoregression for
improved modelling of long-range dependences such as utterance-level prosody.
Experiments show that a system based on our proposal gives more accurate
pronunciations and better subjective speech quality than comparable methods,
whilst retaining the original advantages of neural HMMs. Audio examples and code
are available at https://shivammehta25.github.io/OverFlow/.
## Important resources & papers
- HMM: https://de.wikipedia.org/wiki/Hidden_Markov_Model
- OverflowTTS paper: https://arxiv.org/abs/2211.06892
- Neural HMM: https://arxiv.org/abs/2108.13320
- Audio Samples: https://shivammehta25.github.io/OverFlow/
## OverflowConfig
```{eval-rst}
.. autoclass:: TTS.tts.configs.overflow_config.OverflowConfig
:members:
```
## Overflow Model
```{eval-rst}
.. autoclass:: TTS.tts.models.overflow.Overflow
:members:
```