Commit Graph

1851 Commits (48a40c47304ddf8670b539174497cd0acab0f0e5)

Author SHA1 Message Date
erogol 48a40c4730 remove unused import 2020-10-06 11:32:24 +02:00
erogol a2606fbc22 format utils 2020-10-06 11:02:54 +02:00
Eren Gölge 4873601694
Merge pull request #531 from WeberJulian/french-cleaners
Adding support for french cleaners
2020-09-30 15:30:50 +02:00
Eren Gölge 592bb668fd
Merge pull request #533 from Edresson/dev
add Speaker Conditional GST support
2020-09-30 02:56:37 +02:00
Edresson c1fff5b556 add unit tests for SC-GST 2020-09-29 17:03:25 -03:00
Edresson 99d5a0ac07 add Speaker Conditional GST support 2020-09-29 16:09:27 -03:00
Julian WEBER ea7c2e15c0 Adding french abbreviations 2020-09-29 15:43:39 +02:00
Julian WEBER 54b4031391 Merge remote-tracking branch 'origin/dev' into french-cleaners 2020-09-29 14:24:51 +02:00
Julian WEBER da134eeee4 Subjective improvements 2020-09-29 14:20:52 +02:00
Julian WEBER b2817e9e93 Adding french cleaners 2020-09-29 14:20:24 +02:00
Eren Gölge cf02ace5b7
Merge pull request #530 from mueller91/fix_split_dataset
fix: split_dataset
2020-09-28 12:42:40 +02:00
erogol 154f90bc44 format speaker encoder imports 2020-09-28 11:19:19 +02:00
erogol e097bc6c5d Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-09-28 11:15:32 +02:00
Eren Gölge 8e2dc79c3a
Merge pull request #526 from mueller91/dev
Fix: Check storage params only for speaker encoder
2020-09-28 11:15:23 +02:00
erogol 6a70c63f24 correct glow-tts loss 2020-09-27 03:28:42 +02:00
erogol 665f7ca714 linter fix 2020-09-24 12:57:54 +02:00
mueller91 227b9c8864 fix: split_dataset() runtime reduced from O(N * |items|) to O(N) where N is the size of the eval split (max 500)
I notice a significant speedup on the initial loading of large datasets such as common voice (from minutes to seconds)
2020-09-23 23:27:51 +02:00
mueller91 cfeeef7a7f fix: broken imports and missing files after merging in latest commits from mozilla/dev into mueller91/dev.
speaker_encoder's config.json and visuals.py are missing in the current dev branch of MozillaTTS, and some imports are broken.
2020-09-22 20:10:41 +02:00
mueller91 1fe5eb054f Merge branch 'dev' of https://github.com/mozilla/TTS into dev
 Conflicts:
	TTS/bin/train_encoder.py
	requirements.txt
2020-09-22 19:58:53 +02:00
mueller91 df4caec4b7 add: check_config for speaker_encoder 2020-09-22 19:52:09 +02:00
Eren Gölge 82376298d9
Merge pull request #527 from WeberJulian/dev
fix: fixing the RenamingUnpickler fix
2020-09-22 18:16:40 +02:00
WeberJulian 3c212be5a8
fix: fixing the RenamingUnpickler fix 2020-09-22 17:36:05 +02:00
erogol 6a87b7e9aa acknowledgement for glow tts 2020-09-22 13:10:59 +02:00
erogol cefd9579fc Add glowtts to README 2020-09-22 11:39:48 +02:00
mueller91 0ea7f4e2bd fix: make speaker encoder's storage parameters non-restriced 2020-09-22 10:39:40 +02:00
mueller91 7029452228 fix: make speaker encoder's storage parameters non-restriced 2020-09-22 10:31:42 +02:00
erogol 08394e46b9 travis update for cython 2020-09-22 04:02:59 +02:00
erogol e52a81aa43 cython deps 2020-09-22 04:00:22 +02:00
erogol 10258724d1 linter fixes 2020-09-22 03:54:16 +02:00
erogol a6df617eb1 Merge branch 'glow-tts-amp-time_depth_conv' into dev 2020-09-21 14:23:45 +02:00
erogol 8150d5727e Merge branch 'dev' of https://github.com/mozilla/TTS into dev 2020-09-21 14:21:55 +02:00
erogol e0b9fa887f glow-tts modules added 2020-09-21 14:15:40 +02:00
erogol e4c6386603 change import for normalization layer 2020-09-21 13:09:52 +02:00
Eren Gölge c514628d02
Merge pull request #519 from mueller91/dev
Speaker Encoder: New Datasets + DataLoader optimized
2020-09-21 12:48:42 +02:00
mueller91 9b4aac94a8 fix: linter issues 2020-09-21 12:13:02 +02:00
mueller91 45b3c3d1b0 fix: Update common_voice.tsv and test_preprocessors.py to current .tsv format
(common_voice.tsv is the first 6 lines of the dev.tsv)
2020-09-21 11:57:38 +02:00
erogol c008003506 do not check sample rate as loading stats file for normalization to enable interpolation for different sample rate vocoder 2020-09-18 12:52:19 +02:00
mueller 6b0621c794 cleanup 2020-09-17 16:46:43 +02:00
mueller a273b1a210 add: add random noise to dataset 2020-09-17 14:23:40 +02:00
mueller e36a3067e4 add: save wavs instead feats to storage.
This is done in order to mitigate staleness when caching and loading from data storage
2020-09-17 14:14:30 +02:00
mueller 1511076fde add: Configurable encoder dataset storage to reduce disk I/O
add: Averaged time for data loader to console and Tensorboard output
2020-09-17 12:29:38 +02:00
erogol 3660c57f1e time seperable convolution encoder, huber loss for duration predictor 2020-09-17 03:10:58 +02:00
mueller 95d2906307 add: Mozilla Commonvoice, VoxCeleb1+2, LibriTTS to Speaker Encoder Training 2020-09-16 16:49:53 +02:00
mueller c909ca3855 Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|) 2020-09-16 15:55:55 +02:00
mueller d733b90255 Improve runtime of __parse_items() from O(|speakers|*|items|) to O(|items|) 2020-09-16 15:09:02 +02:00
Eren Gölge e732db76f1
Merge pull request #513 from maxbachmann/master
use difflib for string matching
2020-09-15 10:24:01 +02:00
maxbachmann 60ce862113
use difflib for string matching 2020-09-14 23:55:34 +02:00
erogol f1a75468c2 fix arguments 2020-09-12 04:00:25 +02:00
erogol 7c2c4d6f27 pass x_mask to layer norm 2020-09-12 03:41:37 +02:00
erogol 53523eebbe layer norm before GLU 2020-09-12 03:41:37 +02:00