Commit Graph

3787 Commits (5cca4aa8aebe689cebd1dbda70ad648b42ee5407)

Author SHA1 Message Date
Edresson 1251d04387 Fix function name 2021-12-09 13:37:50 +00:00
Edresson 9781e4d516 Lint fixs 2021-12-09 13:37:50 +00:00
Edresson 8ae22725e1 Fix the bug in extract tts spectrograms 2021-12-09 13:37:50 +00:00
Edresson d0cb700e7d Set the new_fields in copy_model_files as None by default 2021-12-09 13:37:50 +00:00
Edresson 5ba416e2e8 Remove the call to get_speaker_manager 2021-12-09 13:37:50 +00:00
Edresson cad82a9296 Remove the data from the set_d_vectors_from_file function 2021-12-09 13:37:50 +00:00
Edresson ec31dacbb7 Remove unusable speaker manager function 2021-12-09 13:37:50 +00:00
Edresson 86b2536491 Turn more clear the VITS loss function 2021-12-09 13:37:50 +00:00
Edresson 5fc127bb7a Remove the unusable fine-tuning model 2021-12-09 13:37:50 +00:00
Edresson 89019d49a2 Create a module for the VAD script 2021-12-09 13:36:48 +00:00
WeberJulian 390096fe0f fix d-vector 2021-12-09 13:36:48 +00:00
WeberJulian 868cf6424f Fix small issues 2021-12-09 13:36:48 +00:00
WeberJulian e04577575e Fix use_speaker_embedding logic 2021-12-09 13:36:48 +00:00
WeberJulian 5f40e96010 Fix continue path 2021-12-09 13:36:48 +00:00
WeberJulian 61251bd86c Fix phonemes 2021-12-09 13:36:48 +00:00
WeberJulian b1df118b81 fix imports for load_meta_data 2021-12-09 13:36:48 +00:00
WeberJulian 686c7381e2 fix phonemes per language 2021-12-09 13:36:48 +00:00
WeberJulian 215a74b32e fix linter 2021-12-09 13:36:48 +00:00
WeberJulian 3e9ca4b95d make style 2021-12-09 13:36:48 +00:00
WeberJulian e1bdeacd2e Add torchaudio in requirements.txt 2021-12-09 13:36:08 +00:00
WeberJulian 88d6399e12 fix test sentence synthesis 2021-12-09 13:35:43 +00:00
WeberJulian 20ac31dc71 fix f0_cache_path in dataset 2021-12-09 13:35:12 +00:00
WeberJulian 6ed55ba57e fix test vits 2021-12-09 13:35:12 +00:00
WeberJulian 21b49c3acd fix collate_fn 2021-12-09 13:34:33 +00:00
Julian WEBER ec83ffbd7a PitchExtractor 2021-12-09 13:34:33 +00:00
Julian WEBER 3440c54bbe get_aux_input 2021-12-09 13:34:33 +00:00
Julian WEBER 5c89803968 Merge dataset 2021-12-09 13:33:35 +00:00
Edresson c80cf67d3d Add remove silence VAD script 2021-12-09 13:33:05 +00:00
Edresson 3ac428340d Add audio resample in the speaker consistency loss 2021-12-09 13:32:25 +00:00
Edresson 39aff6685e Add freeze vocoder generator and flow-based decoder option 2021-12-09 13:31:04 +00:00
WeberJulian de41165af4 freeze vits parts 2021-12-09 13:31:04 +00:00
WeberJulian 9d2c445e3d get_speaker_weighted_sampler 2021-12-09 13:31:04 +00:00
Edresson 56480360cf Update the VITS model docs 2021-12-09 13:29:58 +00:00
Edresson cd7639ca70 Add voice conversion fine tuning mode 2021-12-09 13:29:58 +00:00
WeberJulian 2be38aad3f Added a notbook for d-vector multilingual VITS 2021-12-09 13:29:58 +00:00
Edresson 3cd889a9d4 Add support to use the speaker encoder as loss function in VITS model 2021-12-09 13:29:58 +00:00
Edresson a3901032f4 Add H/ASP original checkpoint support 2021-12-09 13:28:16 +00:00
Edresson fee01daa09 Add the ValueError in the restore checkpoint exception to avoid problems with the optimizer restauration when new keys are addition 2021-12-09 13:27:21 +00:00
Edresson ecf327a118 Add VITS multispeaker train unit test 2021-12-09 13:27:21 +00:00
Edresson 2bba769e67 Active the multispeaker mode in multilingual training 2021-12-09 13:27:21 +00:00
Edresson 256197b6aa Fix the optimizer parameters bug in multilingual and multispeaker training 2021-12-09 13:27:21 +00:00
Edresson f4abb19515 Fix bug after merge 2021-12-09 13:26:33 +00:00
Edresson d7042ecfd8 Fix d-vector multispeaker training bug 2021-12-09 13:26:33 +00:00
Edresson 08da902af3 Add VITS d-vector unit test 2021-12-09 13:18:36 +00:00
Edresson 859cf1bfac Add VITS multilingual unit test 2021-12-09 13:18:36 +00:00
Edresson 82611cfcd3 Fix unit tests 2021-12-09 13:18:36 +00:00
Edresson cfa9910f9d Fix pylint issues 2021-12-09 13:16:32 +00:00
Edresson 9071bf326f Implement vocoder Fine Tuning like SC-GlowTTS paper 2021-12-09 13:16:32 +00:00
Edresson 3df5d9a619 Fix the bug in M-AILABS formatter 2021-12-09 13:11:06 +00:00
Edresson d653227e59 Add voice conversion support for the model VITS trained with external speaker embedding 2021-12-09 13:11:06 +00:00