Commit Graph

57 Commits (p3_11)

Author SHA1 Message Date
Eren Gölge dae79b0acd
Remove `/` prefix from the relative path (#2065) 2022-10-10 13:32:27 +02:00
Eren Gölge 9e5a469c64
d-vector handling (#1945)
* Update BaseDatasetConfig

- Add dataset_name
- Chane name to formatter_name

* Update compute_embedding

- Allow entering dataset by args
- Use released model by default
- Use the new key format

* Update loading

* Update recipes

* Update other dep code

* Update tests

* Fixup

* Load multiple embedding files

* Fix argument names in dep code

* Update docs

* Fix argument name

* Fix linter
2022-09-13 14:10:33 +02:00
Eren Gölge 72d85e53c9
Update model file extension (#1422)
* Update model file ext to ```.pth```

* Update docs

* Rename more

* Find model files
2022-03-22 17:55:00 +01:00
Edresson Casanova f81892483d
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement Emotion Encoder training support (#1349)
* Rename Speaker encoder module to encoder

* Add a generic emotion dataset formatter

* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config

* Add class map in emotion config

* Add Base encoder config

* Add evaluation encoder script

* Fix the bug in plot_embeddings

* Enable Weight decay for encoder training

* Add argumnet to disable storage

* Add Perfect Sampler and remove storage

* Add evaluation during encoder training

* Fix lint checks

* Remove useless config parameter

* Active evaluation in speaker encoder test and use multispeaker dataset for this test

* Unit tests fixs

* Remove useless tests for speedup the aux_tests

* Use get_optimizer in Encoder

* Add BaseEncoder Class

* Fix the unitests

* Add Perfect Batch Sampler unit test

* Add compute encoder accuracy in a function
2022-03-11 14:43:40 +01:00
Eren Gölge 75c507c36a Update VITS LJspeech recipe 2022-02-25 10:57:35 +01:00
Eren Gölge 2033e17c44 Add VITS model tests 2021-12-29 16:51:40 +00:00
Eren Gölge fcfd95669a Update model test configs 2021-06-28 17:03:19 +02:00
Edresson bc5307caa0 add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix 2021-05-26 20:35:58 -03:00
Eren Gölge faedea4b60 set use_phonemes False in configs 2021-05-19 00:08:07 +02:00
Eren Gölge 0213e1cbf4 update configs for tts models to match the field typed with the expected
values
2021-05-12 00:57:38 +02:00
Eren Gölge 5aee30443f refactoring tests after Coqpit 2021-05-11 11:30:00 +02:00
Eren Gölge a21c0b5585 config update 2 WIP 2021-05-11 11:28:35 +02:00
Eren Gölge 32e6afc009 add `SpeakerManager` tests 2021-04-23 18:04:37 +02:00
Eren Gölge a7f6045644 Merge branch 'reformat' into hifigan-reformat 2021-04-12 12:00:17 +02:00
Eren Gölge 5fd2333c91 move bash based tests to a separate folder and rename the calling script file. Fix align_tts training test. 2021-04-09 03:07:58 +02:00
Eren Gölge 5de7eb708b fix the config 2021-04-09 00:36:49 +02:00
Eren Gölge 53f54898bc small fixes 2021-04-08 14:22:47 +02:00
Eren Gölge a7fb49843b update test config 2021-04-08 13:18:03 +02:00
Eren Gölge c8953f4da9 l1_spec_loss parameters in melgan test config 2021-04-07 12:36:36 +02:00
Eren Gölge 7a382a5c2b stowed aligntts commit and small refactoring with feed_forward layers 2021-03-30 14:39:16 +02:00
Eren Gölge fd5c24c3b8 add missing config file 2021-03-18 13:33:23 +01:00
Eren Gölge 43672e206c test config for tacotron model 2021-03-18 13:33:00 +01:00
Eren Gölge 65533f33e9 fix #374 2021-03-18 13:33:00 +01:00
gerazov 2451a813a2 refactored keep_all_best 2021-03-08 02:57:11 +01:00
gerazov 62147994d4 added keep_best and keep_after to test configs. 2021-03-08 02:56:36 +01:00
Eren Gölge 6cd642c2e1 add missing phonemes to test_config.json 2021-03-08 02:54:47 +01:00
Eren Gölge 83e50757ae use single process dataloder in tests 2021-03-08 02:54:21 +01:00
Eren Gölge 08581deb61 linter updates 2021-03-08 02:53:02 +01:00
erogol 79c841ccd3 mass refactoring and update 2021-01-11 17:26:58 +01:00
erogol d382d759b3 small fixes and test fixes 2021-01-08 15:48:40 +01:00
erogol fede46e96e pylint and test fixes 2021-01-06 13:19:40 +01:00
erogol 070146e143 add monotonic dynamic convolution attention 2021-01-06 13:18:41 +01:00
erogol 06612ce305 test fixes 2020-12-07 15:57:34 +01:00
erogol 6cc464ead6 fix ton of tesnting bugs 2020-11-12 16:33:29 +01:00
erogol 116e2299b0 adding more tests and refactoring 2020-11-09 13:30:42 +01:00
erogol 73581cd94c renaming train scripts and updating tests 2020-10-29 16:50:07 +01:00
sanjaesc ea9d8755de add wavernn tests + name refactoring 2020-10-27 12:17:38 +01:00
Edresson c1fff5b556 add unit tests for SC-GST 2020-09-29 17:03:25 -03:00
mueller91 45b3c3d1b0 fix: Update common_voice.tsv and test_preprocessors.py to current .tsv format
(common_voice.tsv is the first 6 lines of the dev.tsv)
2020-09-21 11:57:38 +02:00
erogol d4319fe42d linter changes and train_tts_test and train_vocoder_test fixes 2020-09-07 13:46:35 +02:00
SanjaESC 84b7ab6ee6 Added support for Tacotron2 GST + abbility to condition style input with wav or tokens 2020-08-05 19:16:19 +02:00
erogol fe081d4f7c fixing rebase issues 2020-08-05 18:33:22 +02:00
Edresson def7e49f59 travis unit tests fix and add Tacotron and Tacotron 2 GST and MultiSpeaker Tests 2020-08-05 18:00:20 +02:00
SanjaESC eb51d5409a pylint 2020-08-05 17:36:36 +02:00
SanjaESC bdf6944665 fix fft_size key error 2020-08-05 17:36:36 +02:00
erogol f35504f187 rebranding and replacing import statements 2020-08-04 10:51:19 +02:00
erogol 82dd465365 Mass refactoring 2020-07-17 11:16:05 +02:00
erogol ad235f0481 update test server_config and mitigate https://github.com/librosa/librosa/issues/1160 in server package test 2020-07-12 15:37:10 +02:00
erogol 6a8052fd72 new scale stats 2020-07-11 03:57:37 +02:00
erogol 07d2d28ae6 update tests 2020-07-08 10:21:45 +02:00