Commit Graph

2986 Commits (6c495c6a6ee4775d11b8a2026bd03a205f588b31)

Author SHA1 Message Date
Eren Gölge f7582107da
Merge pull request #453 from Edresson/dev
Script for spectrogram extraction using teacher forcing and Glow-TTS inference with MAS.
2021-05-06 17:53:28 +02:00
Edresson 070227d2ab change python3 to python in the extract tts espectrograms script to avoid incompatibility 2021-05-06 11:32:42 -03:00
Edresson 65860a954a remove unused vars on test glow tts 2021-05-05 07:15:36 -03:00
Edresson d78f27ea41 bugfix on tacotron unit test 2021-05-05 06:38:01 -03:00
Edresson e3f56b613b update unit test for extract tts spectrograms script 2021-05-04 20:45:07 -03:00
Edresson 501c8e0302 remove unused vars on extract tts spectrograms script 2021-05-04 19:04:13 -03:00
Eren Gölge 0325c58862
Merge pull request #468 from shaun95/patch-1
Update losses.py
2021-05-03 14:45:24 +02:00
Eren Gölge b10333c238 fix #467 2021-05-03 14:38:42 +02:00
Eren Gölge 8cb27267a4 formatting 2021-05-03 14:26:35 +02:00
Eren Gölge 87d674a038 bumpup librosa version to 0.8.0 2021-05-03 14:25:09 +02:00
Eren Gölge 110d03e0db prevenet numba logs in nosetest 2021-05-03 14:21:18 +02:00
shaun 7d0ec62bf1
Update losses.py
The block of code for use_l1_spec_loss is repeated which doubles the amount of L1 loss when enabled.
The weight for L1 loss in hifigan_ljspeech configutation will likely need to be doubled to compensate (l1_spec_loss_weight)
2021-05-02 14:14:24 +02:00
Edresson 3ecd556bbe add unit test for extract tts spectrograms script 2021-05-01 13:41:56 -03:00
Edresson bb82f4ae8b add unit test for GlowTTS inference with MAS 2021-04-29 19:39:09 -03:00
Edresson 446b1da936 create inference function 2021-04-29 18:18:37 -03:00
Eren Gölge b00b1d4680 remove the death link to the docker image 2021-04-29 18:00:47 +02:00
Eren Gölge 2f579d7416 add the requirements in the MANIFFEST 2021-04-29 18:00:24 +02:00
Eren Gölge f02f0338c2 fix .models.json and add testing to check released models availability 2021-04-29 09:32:36 +02:00
Eren Gölge fd95e9b8a4 [ci skip] Add sam models 2021-04-28 21:57:31 +02:00
Eren Gölge ed1de4e0db Merge branch 'pr/agrinh/457-2' into dev 2021-04-28 21:50:44 +02:00
Eren Gölge 79663bc944 update the readme 2021-04-28 21:49:44 +02:00
Agrin Hilmkil 7ea9bc63b0 Add missing pandas dependency 2021-04-28 13:57:29 +02:00
Agrin Hilmkil 351d0ed6ae Remove unnecessary fsspec usage 2021-04-28 11:21:08 +02:00
Agrin Hilmkil bf2b9958be Sort dependencies alphabetically 2021-04-28 11:21:08 +02:00
Agrin Hilmkil 1c8479f703 Remove unnecessary instruction 2021-04-28 11:20:06 +02:00
Agrin Hilmkil 167f86417e Move dev, tf, notebook dependencies to extras 2021-04-28 11:20:06 +02:00
Eren Gölge af2955fa19 bump up version 2021-04-27 18:02:46 +02:00
Eren Gölge 6353e87166 fix test 2021-04-27 15:04:20 +02:00
Eren Gölge 628abfe644 remove test 2021-04-27 14:35:39 +02:00
Eren Gölge 1235e54738 test for synthesize.py 2021-04-27 14:17:38 +02:00
Eren Gölge 19d9f58009 create dummy model on the fly 2021-04-27 13:27:24 +02:00
Eren Gölge 4719414f2e remove imports 2021-04-27 11:25:17 +02:00
Eren Gölge add97cddc1 move function and remove import 2021-04-27 11:22:56 +02:00
Eren Gölge 8f0519d203 bump up numpy version 2021-04-27 11:13:57 +02:00
Eren Gölge 734e6a515c bug fix 2021-04-27 10:27:45 +02:00
Eren Gölge 6bdd81667e place holders for sc-glow and hifigan models 2021-04-26 19:53:12 +02:00
Eren Gölge 2f0716073e enable multi-speaker CoquiTTS models for synthesize.py 2021-04-26 19:36:53 +02:00
Eren Gölge b531fa699c remove conflicy noise 2021-04-26 15:27:52 +02:00
Eren Gölge f37b488876 Merge branch 'speaker-manager' of https://github.com/coqui-ai/TTS into speaker-manager 2021-04-26 15:25:25 +02:00
Eren Gölge b82daa5e86 style and linter fixes 2021-04-26 15:22:24 +02:00
Edresson 20e42a3381 add save audio option 2021-04-23 15:00:00 -03:00
Edresson 8228091f92 add script for extraction of tts spectrograms 2021-04-23 14:17:46 -03:00
Eren Gölge 4cf211348d styling and linting 2021-04-23 18:04:37 +02:00
Eren Gölge a878d8fb42 update tests 2021-04-23 18:04:37 +02:00
Eren Gölge 7eb0c60d2e let synthesizer to pass speaker encoder file paths to speaker manager 2021-04-23 18:04:37 +02:00
Eren Gölge f69195739e let speaker manager compute mean x_vector from multiple wav files 2021-04-23 18:04:37 +02:00
Eren Gölge 179722e3a7 new arguments to synthesize.py for loading speaker encoder and speaker wavs 2021-04-23 18:04:37 +02:00
Eren Gölge dfa415a8b8 small refactor in server.py 2021-04-23 18:04:37 +02:00
Eren Gölge c80d21f311 load speaker_encoder_ap and compute x_vector directly from the input file in speaker manager 2021-04-23 18:04:37 +02:00
Eren Gölge ad047c8195 html formatting, enable multi-speaker model on the server with a dropdown menu to select the speaker 2021-04-23 18:04:37 +02:00