Julian WEBER
78c2d12a91
PitchExtractor
2021-12-20 11:54:10 +00:00
Julian WEBER
9a2f91327c
get_aux_input
2021-12-20 11:54:10 +00:00
Julian WEBER
b3abd01793
Merge dataset
2021-12-20 11:54:10 +00:00
Edresson
10ff90d6d2
Add remove silence VAD script
2021-12-20 11:54:10 +00:00
Edresson
1bd1a0546b
Add audio resample in the speaker consistency loss
2021-12-20 11:54:10 +00:00
Edresson
1c6bcda950
Add freeze vocoder generator and flow-based decoder option
2021-12-20 11:54:10 +00:00
WeberJulian
2b952d8b97
freeze vits parts
2021-12-20 11:54:10 +00:00
WeberJulian
005bba60b0
get_speaker_weighted_sampler
2021-12-20 11:54:10 +00:00
Edresson
9de4539422
Update the VITS model docs
2021-12-20 11:54:10 +00:00
Edresson
eeb8ac07d9
Add voice conversion fine tuning mode
2021-12-20 11:54:10 +00:00
WeberJulian
6ac31e4152
Added a notbook for d-vector multilingual VITS
2021-12-20 11:54:09 +00:00
Edresson
690b37d0ab
Add support to use the speaker encoder as loss function in VITS model
2021-12-20 11:54:09 +00:00
Edresson
9b011b1cb3
Add H/ASP original checkpoint support
2021-12-20 11:54:09 +00:00
Edresson
0bdfd3cb50
Add the ValueError in the restore checkpoint exception to avoid problems with the optimizer restauration when new keys are addition
2021-12-20 11:54:09 +00:00
Edresson
bbdb5c38e6
Add VITS multispeaker train unit test
2021-12-20 11:54:09 +00:00
Edresson
92f7f4f400
Active the multispeaker mode in multilingual training
2021-12-20 11:54:09 +00:00
Edresson
de78556655
Fix the optimizer parameters bug in multilingual and multispeaker training
2021-12-20 11:54:09 +00:00
Edresson
9be5b75da3
Fix bug after merge
2021-12-20 11:54:09 +00:00
Edresson
76251b619a
Fix d-vector multispeaker training bug
2021-12-20 11:54:09 +00:00
Edresson
e68b042493
Add VITS d-vector unit test
2021-12-20 11:54:09 +00:00
Edresson
959cc8f03c
Add VITS multilingual unit test
2021-12-20 11:54:09 +00:00
Edresson
7ef3ddc6ff
Fix unit tests
2021-12-20 11:54:09 +00:00
Edresson
36dcd11453
Fix pylint issues
2021-12-20 11:54:09 +00:00
Edresson
c53693c155
Implement vocoder Fine Tuning like SC-GlowTTS paper
2021-12-20 11:54:09 +00:00
Edresson
f1f016314e
Fix the bug in M-AILABS formatter
2021-12-20 11:54:09 +00:00
Edresson
c334d39acc
Add voice conversion support for the model VITS trained with external speaker embedding
2021-12-20 11:54:09 +00:00
Edresson
e997889ba8
Fix bug in VITS multilingual inference
2021-12-20 11:54:09 +00:00
Edresson
7c0b8ec572
Fix bugs in the non-multilingual VITS inference
2021-12-20 11:54:09 +00:00
Edresson
3fbbebd74d
Fix pylint issues
2021-12-20 11:54:09 +00:00
Edresson
ac9416fb86
Add multilingual inference support
2021-12-20 11:54:09 +00:00
Edresson
dcb2374bc9
Add multilingual training support to the VITS model
2021-12-20 11:54:09 +00:00
Edresson
f996afedb0
Implement multilingual dataloader support
2021-12-20 11:54:09 +00:00
Edresson
5f1c18187f
Fix pylint issues
2021-12-20 11:54:09 +00:00
Edresson
d91c595c5a
Implement training support with d_vecs in the VITS model
2021-12-20 11:54:09 +00:00
Edresson
6a7db67a91
Allow ignore speakers for all multispeaker datasets
2021-12-20 11:54:09 +00:00
Edresson
e0ad838066
Select randomly a speaker from the speaker manager for the test setences
2021-12-20 11:54:09 +00:00
Edresson
eb3e8affe1
Save speakers embeddings/ids before starting training
2021-12-20 11:54:09 +00:00
Eren Gölge
37803467aa
Merge pull request #1021 from loganhart420/dataset_downloaders
...
Add addtional datasets
2021-12-20 10:42:20 +01:00
Eren Gölge
6b8ed18dcc
Merge pull request #1022 from coqui-ai/papercuts
...
Fix some setup papercuts
2021-12-20 10:40:57 +01:00
Reuben Morais
0635e8b931
Try to speed up CI install times
2021-12-17 14:01:25 +01:00
Reuben Morais
0057494a2a
Remove broken link
2021-12-17 11:37:19 +01:00
Reuben Morais
859ac1a54c
Include usage instructions in README
2021-12-17 11:37:19 +01:00
Reuben Morais
78100851f2
Upload sdist and wheel to PyPI
2021-12-17 11:37:19 +01:00
loganhart420
103c010eca
Add addtional datasets
2021-12-16 07:21:27 -05:00
Eren Gölge
7f1a23787e
Merge pull request #914 from coqui-ai/dev
...
v0.4.2
2021-12-08 16:41:44 +01:00
Jörg Thalheim
bce143c738
server: fix compatibility with tts_models/en/ljspeech/fast_pitch ( #893 )
2021-12-07 14:36:29 +01:00
Eren Gölge
babdd84f91
Fix GST inference
...
commit d3e477875a7e46a101fcf95a1794442823750fe2
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit 074e6d0874d3b34fb6a4991fc17d66dccd413fbb
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit fdece14585ab5a36eed1061a9a838d8e48aa6882
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Wed Nov 3 10:16:12 2021 +0000
Read .wav for GST conditioning from CL
commit cd29e21b8d0a541ee298d2bf5f67223ad60be38f
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 14:43:47 2021 +0100
Fix GST during inference in Tacotron2
commit 908ce39370eadcc9fa8510cdb26c9ead87305427
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:49:37 2021 +0100
Make trim_db value negative
commit 1008a2e0f72fa7ca7f0307424f570386f2f16d42
Author: George Rousssos <25833833+george-roussos@users.noreply.github.com>
Date: Fri Oct 29 12:22:24 2021 +0100
Set find_endpoint db threshold in config.json
2021-12-07 13:28:49 +00:00
Eren Gölge
ce45d9e1af
Make style and lint
2021-12-01 10:42:52 +00:00
Eren Gölge
40cb8ac966
Fix #958
2021-12-01 10:33:34 +00:00
Eren Gölge
512ada7548
Fix callbacks against multi-gpu training
2021-12-01 10:32:14 +00:00