Commit Graph

2993 Commits (30211512a44bbb62a9fe9e074ef055414960bdc8)

Author SHA1 Message Date
Eren Gölge a878d8fb42 update tests 2021-04-23 18:04:37 +02:00
Eren Gölge 7eb0c60d2e let synthesizer to pass speaker encoder file paths to speaker manager 2021-04-23 18:04:37 +02:00
Eren Gölge f69195739e let speaker manager compute mean x_vector from multiple wav files 2021-04-23 18:04:37 +02:00
Eren Gölge 179722e3a7 new arguments to synthesize.py for loading speaker encoder and speaker wavs 2021-04-23 18:04:37 +02:00
Eren Gölge dfa415a8b8 small refactor in server.py 2021-04-23 18:04:37 +02:00
Eren Gölge c80d21f311 load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager 2021-04-23 18:04:37 +02:00
Eren Gölge ad047c8195 html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker 2021-04-23 18:04:37 +02:00
Eren Gölge f9f3d04d14 remove moved function 2021-04-23 18:04:37 +02:00
Eren Gölge 10c988ac8c update server.py 2021-04-23 18:04:37 +02:00
Eren Gölge 32e6afc009 add `SpeakerManager` tests 2021-04-23 18:04:37 +02:00
Eren Gölge 6d0f5e0459 use SpeakerManager in Synthesizer 2021-04-23 18:04:37 +02:00
Eren Gölge e97126314c add ```unique``` argument to make_symbols to fix the incompat. issue of the
SC-Glow models
2021-04-23 18:04:37 +02:00
Eren Gölge d08888e603 formating speakers.py 2021-04-23 18:04:37 +02:00
Eren Gölge df422223a3 initial SpeakerManager implementation 2021-04-23 18:04:37 +02:00
Eren Gölge 7a7aeb35f5 fix the glow-tts in setup_model 2021-04-23 18:04:37 +02:00
Eren Gölge d42748082a update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
2021-04-23 18:04:37 +02:00
Eren Gölge 2da81f5bb6 add load_chekpoint to speaker encoder 2021-04-23 18:04:37 +02:00
Eren Gölge 1229ccbf07 update argument name in server.py 2021-04-23 18:04:37 +02:00
Eren Gölge af2d36faeb update synthesize.py for multi-speaker setting 2021-04-23 18:04:37 +02:00
Eren Gölge 99dc07a7dd add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set) 2021-04-23 18:04:37 +02:00
Eren Gölge c955a12428 set the default layer size compatible with scglow 2021-04-23 18:04:37 +02:00
Eren Gölge 3ace2440fa fix a mistake from rebase 2021-04-23 18:04:37 +02:00
Eren Gölge aadb2106ec code styling 2021-04-23 18:04:37 +02:00
Eren Gölge af7baa3387 refactoring to allow defining the speaker file externally 2021-04-23 18:04:37 +02:00
kirianguiller f393c08ad4 add usage of new Synthetizer class in the chinese model notebook 2021-04-23 18:04:37 +02:00
kirianguiller 7dccbfdcd5 handle multi speaker and gst in Synthetizer class 2021-04-23 18:04:37 +02:00
Eren Gölge cc4efb437b
Merge pull request #446 from WeberJulian/fix-windows
fix windows support
2021-04-23 18:04:13 +02:00
Edresson d2b6326b8b change optimizer initialization for compatibility with Hifi-GAN official implementation 2021-04-23 07:54:39 -03:00
WeberJulian 4205284f92
Change name of the functions 2021-04-23 10:09:55 +02:00
WeberJulian a26498181b Change back the default value 2021-04-22 16:10:17 +02:00
Julian Weber 355e1f47ab fix dumb mistake 2021-04-22 15:50:29 +02:00
Julian Weber c125b71f36 fix windows support 2021-04-22 15:14:24 +02:00
Eren Gölge a6cd04456e Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-04-22 12:39:12 +02:00
Jörg Thalheim f5fd7f78d4 server: also listen to ipv6
The [::] address will listen to both ipv4/ipv6 addresses.
2021-04-22 12:38:55 +02:00
Eren Gölge ef37633cb3 [ci skip] use prenet_dropout by default with Tacotron models 2021-04-22 12:38:55 +02:00
Eren Gölge 0ee3eeefb5
[ci skip] update CONTRIBUTING.md 2021-04-21 13:55:50 +02:00
Eren Gölge 39ceb3f7df
Update README.md 2021-04-21 13:50:35 +02:00
Eren Gölge 757dfb9289 add `SpeakerManager` tests 2021-04-21 13:28:43 +02:00
Eren Gölge e1d960da9e use SpeakerManager in Synthesizer 2021-04-21 13:13:27 +02:00
Eren Gölge 04b6881b66 add ```unique``` argument to make_symbols to fix the incompat. issue of the
SC-Glow models
2021-04-21 13:12:35 +02:00
Eren Gölge 790946faec formating speakers.py 2021-04-21 13:12:11 +02:00
Eren Gölge ab313814de initial SpeakerManager implementation 2021-04-21 13:11:46 +02:00
Eren Gölge 09890c7421 fix the glow-tts in setup_model 2021-04-21 13:10:40 +02:00
Eren Gölge 8764d02eb2 update argument name external_speaker_embedding_dim -> speaker_embedding_dim
add inference_noise_scale argument to glow-tts
2021-04-21 13:09:44 +02:00
Eren Gölge 8b40720977 add load_chekpoint to speaker encoder 2021-04-21 13:09:04 +02:00
Eren Gölge 37cad38c27 update argument name in server.py 2021-04-21 13:08:45 +02:00
Eren Gölge 9bccee9da8 update synthesize.py for multi-speaker setting 2021-04-21 13:08:25 +02:00
Eren Gölge d0786be2a3
remove matrix link 2021-04-19 11:07:40 +02:00
Eren Gölge e31f174cf8
Merge pull request #435 from Mic92/ipv6
server: also listen to ipv6
2021-04-16 20:34:49 +02:00
Eren Gölge d2fa8add1f add ```unique``` param to keep scglow models compatible (they are duplicate symbols ins the character set) 2021-04-16 19:40:13 +02:00