Eren Gölge
d0eb3e4ef2
Add get_tests_data_path
2022-02-25 11:24:13 +01:00
Eren Gölge
235f7d9b02
Extend glow_tts model tests
2022-02-25 11:24:13 +01:00
Eren Gölge
5176ae9e53
Fixes small compat. issues
2022-02-25 11:21:19 +01:00
Eren Gölge
edec27738b
Delete `use_espeak_phonemes` from tests
2022-02-25 11:18:00 +01:00
Eren Gölge
0a47a7eac0
Update tests
2022-02-25 11:12:44 +01:00
Eren Gölge
b341951b78
Update loader tests
2022-02-25 11:12:44 +01:00
Eren Gölge
196ae74273
Update data loader tests
2022-02-25 11:05:06 +01:00
Eren Gölge
75c507c36a
Update VITS LJspeech recipe
2022-02-25 10:57:35 +01:00
Eren Gölge
04202da1ac
Make style
2022-02-25 10:48:03 +01:00
Eren Gölge
961e98a461
Add OOV case to tokenizer tests
2022-02-25 10:48:03 +01:00
Eren Gölge
8c8093ce23
Make style
2022-02-25 10:48:03 +01:00
Eren Gölge
f1ea3ad182
Remove old text processing tests
2022-02-25 10:48:02 +01:00
Eren Gölge
ba3b60c90f
Test TTSTokenizer
2022-02-25 10:48:02 +01:00
Eren Gölge
79a84410f2
Test punctuations
2022-02-25 10:48:02 +01:00
Eren Gölge
99d9bb7a17
Test Phonemizers
2022-02-25 10:48:02 +01:00
Eren Gölge
a1df4f9887
Test character classes
2022-02-25 10:45:24 +01:00
Eren Gölge
a51b031bff
Merge branch 'dev' into dev-fix-glowtts-infer
2022-02-21 12:01:40 +03:00
Edresson Casanova
28a7464975
Fix the bug in split dataset function ( #1251 )
...
* Fix the bug in split_dataset
* Make eval_split_size configurable
* Change test_loader to use load_tts_samples function
* Change eval_split_portion to eval_split_size and permits to set the absolute number of samples in eval
* Fix samplers unit test
* Add data unit test on GitHub workflow
2022-02-21 11:59:36 +03:00
Edresson Casanova
531821545e
Fix inference test issue
2022-02-19 12:21:32 +00:00
Edresson Casanova
5218d6b7a4
Fix unit tests issue
2022-02-19 12:15:03 +00:00
Edresson Casanova
fc7081fc5e
Add Inference test using TTS API in all models unit tests
2022-02-18 21:06:08 +00:00
Edresson Casanova
5cca4aa8ae
Add FastPitch Speaker embedding train unit test
2022-02-18 20:16:52 +00:00
Edresson Casanova
759f9ac76a
Add Glow-TTS d-vectors training unit test
2022-02-18 20:03:36 +00:00
Edresson Casanova
06cad27e31
Add Glow-TTS multi-speaker unit test
2022-02-18 18:20:47 +00:00
Eren Gölge
127118c637
Update TTS.tts formatters ( #1228 )
...
* Return Dict from tts formatters
* Make style
2022-02-11 23:03:43 +01:00
Edresson Casanova
0860d73cf8
Remove Tensorflow requeriment ( #1225 )
...
* Remove TF modules
* Remove TF unit tests
* Remove TF vocoder modules
* Remove TF convert scripts
* Remove TF requirement
* Remove the Docs TF instructions
* Remove TF inference support
2022-02-10 16:14:54 +01:00
Eren Gölge
8fd1ee1926
Print urls when BadZipError
2022-01-01 15:26:35 +00:00
Eren Gölge
254c110ec1
Print testing model
2022-01-01 13:57:01 +00:00
Eren Gölge
61874bc0a0
Fix your_tts inference from the listed models
2021-12-31 13:45:05 +00:00
Eren Gölge
36cef5966b
Fix resnet speaker encoder
2021-12-30 15:36:35 +00:00
Eren Gölge
348b5c96a2
Fix speaker encoder test
2021-12-30 15:36:35 +00:00
Eren Gölge
497332bd46
Add custom asserts to tests
2021-12-30 14:08:17 +00:00
Eren Gölge
2033e17c44
Add VITS model tests
2021-12-29 16:51:40 +00:00
Eren Gölge
56378b12f7
Fix speaker encoder init
2021-12-21 12:26:25 +00:00
Eren Gölge
704dddcffa
Make style
2021-12-20 11:54:10 +00:00
WeberJulian
8b3769c957
Fix seed in test_samplers to avoid random fails
2021-12-20 11:54:10 +00:00
WeberJulian
6f01eed672
Add test for language_weighted_sampler
2021-12-20 11:54:10 +00:00
Edresson
a57ddfb4ec
Add remove silence vad script Unit test
2021-12-20 11:54:10 +00:00
Edresson
e068fab6b2
Add find unique phonemes unit tests
2021-12-20 11:54:10 +00:00
WeberJulian
54e33bff61
Make a multilingual test use chars
2021-12-20 11:54:10 +00:00
WeberJulian
09eda31a3f
Fix tests
2021-12-20 11:54:10 +00:00
Edresson
06d89f93a8
Add VITS multilingual d-vectors unit test
2021-12-20 11:54:10 +00:00
Edresson
f394d60695
Fix the bug in multispeaker vits
2021-12-20 11:54:10 +00:00
WeberJulian
1472b6df49
make style
2021-12-20 11:54:10 +00:00
WeberJulian
3b5592abcf
fix test vits
2021-12-20 11:54:10 +00:00
Edresson
bbdb5c38e6
Add VITS multispeaker train unit test
2021-12-20 11:54:09 +00:00
Edresson
92f7f4f400
Active the multispeaker mode in multilingual training
2021-12-20 11:54:09 +00:00
Edresson
e68b042493
Add VITS d-vector unit test
2021-12-20 11:54:09 +00:00
Edresson
959cc8f03c
Add VITS multilingual unit test
2021-12-20 11:54:09 +00:00
Edresson
3fbbebd74d
Fix pylint issues
2021-12-20 11:54:09 +00:00
Michael Hansen
3bc043faeb
Upgrade to gruut 2.0 ( #882 )
2021-10-31 11:41:55 +01:00
Eren Gölge
2df0752e73
Model zoo tests ( #900 )
...
* Fix VITS model multi-speaker init
* Remove gdrive support in model manager
* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge
25759d6a61
Split tests
2021-10-21 17:30:15 +00:00
Eren Gölge
e62d3c5cf7
Use absolute imports for tts configs and models
2021-10-21 16:29:06 +00:00
Eren Gölge
4dbe7ed0de
Fix all-zero duration case for GlowTTS
2021-10-01 09:24:26 +00:00
Eren Gölge
7edbe04fe0
Fix WaveRNN config and test
2021-09-30 16:20:12 +00:00
Eren Gölge
4cacbf0d45
Fix WaveRNN test
2021-09-30 14:47:56 +00:00
Eren Gölge
2766dd1d6e
Fix #813 - GlowTTS training ( #814 )
...
* Fix #813
* Update glow_tts recipe
* Fix glow-tts test
* Linter fix
* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge
1e7db32e90
Test FastPitch train
2021-09-11 10:19:47 +00:00
Eren Gölge
26f76fce22
Remove SpeedySpeech from .models.json
2021-09-10 17:47:27 +00:00
Eren Gölge
7ec23e69d4
Skip TF tests on GPU
2021-09-10 17:28:58 +00:00
Eren Gölge
1ebf9ec6bf
Remove speedy_speech implementation
2021-09-10 17:28:20 +00:00
Eren Gölge
7d8f77385a
Use `glow-tts` in synthesis tests
2021-09-10 17:27:33 +00:00
Eren Gölge
d6e29ef98a
Style update
2021-09-10 08:30:33 +00:00
Eren Gölge
3abc3a1d32
Fix GPU init in tests
2021-09-10 08:28:10 +00:00
Eren Gölge
ed4b1d8514
Test `TTS.tts.utils.helpers`
2021-09-10 08:25:21 +00:00
Eren Gölge
8b7e094bde
Implement `forward_tts`
...
- Generic API for feed-forward TTS models (FastPitch, SpeedySpeech)
- Tests for `forward-tts`
- Edit FastPitchConfig and SpeedySpeechConfig to use `forward_tts`
2021-09-10 08:24:33 +00:00
Eren Gölge
4761853c5c
Fix imports
2021-09-08 13:34:40 +00:00
Eren Gölge
e72c265cd4
Fix linter issues
2021-09-06 15:16:58 +00:00
Eren Gölge
fd287aa438
Update loader tests for dict return
2021-09-06 15:16:58 +00:00
Eren Gölge
2c4bbbf9b9
Use pyworld for pitch
2021-09-06 15:16:58 +00:00
Eren Gölge
076d0cb258
Add tests for certain FastPitch functions
2021-09-06 15:16:58 +00:00
Eren Gölge
d63a6bb690
Set BaseDatasetConfig for tests
2021-09-06 15:16:58 +00:00
Eren Gölge
fba257104d
Compute F0 using librosa
2021-09-06 15:16:58 +00:00
Katsuya Iida
165e5814af
Update Japanese phonemizer ( #758 )
...
* Update default ja vocoder
* update
* Japanese phonemizer test
* Run make style
Co-authored-by: Eren Gölge <egolge@coqui.ai>
2021-09-01 09:33:15 +02:00
Eren Gölge
f186856e5d
Add option to sort input sequnce by audio len
2021-08-30 08:10:35 +00:00
Eren Gölge
c312acac7d
Implement VITS model 🚀
...
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge
003e5579e8
Enable `custom_symbols` in text processing
...
Models can define their own custom symbols lists with custom
`make_symbols()`
2021-08-09 18:02:36 +00:00
Eren Gölge
e4648ffef1
Fix multi-speaker init of Tacotron models & tests
2021-08-09 18:02:36 +00:00
Agrin Hilmkil
ced4cfdbbf
Allow saving / loading checkpoints from cloud paths ( #683 )
...
* Allow saving / loading checkpoints from cloud paths
Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.
Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.
* Append suffix _fsspec to save/load function names
* Add a lower bound to the fsspec dependency
Skips the 0 major version.
* Add missing changes from refactor
* Use fsspec for remaining artifacts
* Add test case with path requiring fsspec
* Avoid writing logs to file unless output_path is local
* Document the possibility of using paths supported by fsspec
* Fix style and lint
* Add missing lint fixes
* Add type annotations to new functions
* Use Coqpit method for converting config to dict
* Fix type annotation in semi-new function
* Add return type for load_fsspec
* Fix bug where fs not always created
* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge
75b201c6c1
Merge pull request #673 from coqui-ai/fix_stopnet
...
Fix stopnet training for Tacotron models
2021-07-24 12:25:38 +02:00
Eren Gölge
fc0c4600bd
Fix stopnet training
2021-07-24 11:39:54 +02:00
Eren Gölge
30eed347b6
Merge pull request #581 from Edresson/dev
...
Compute speaker embeddings in batch for the LSTM Speaker Encoder and Compute embeddings/ finding chars using config file.
2021-07-23 17:22:51 +02:00
WeberJulian
c79a82ed07
refix linter
2021-07-13 23:12:18 +02:00
Edresson
2e5baffa9c
Merge fix and eval split as argparse
2021-07-13 01:47:32 -03:00
Edresson
4eac1c4651
bug fix on train_encoder and unit tests
2021-07-11 12:00:39 -03:00
Eren Gölge
1e9538aaef
Add more model tests to `test_synthesize`
2021-07-04 11:45:49 +02:00
Eren Gölge
47b3b10d6d
Bump up to v0.1.0 🚀
2021-06-29 13:07:59 +02:00
Eren Gölge
9790eddada
Fix wrong argument name 🛠️
2021-06-28 17:03:47 +02:00
Eren Gölge
626c9d41e6
Update tests for the new trainer API
2021-06-28 17:03:19 +02:00
Eren Gölge
fcfd95669a
Update model test configs
2021-06-28 17:03:19 +02:00
Eren Gölge
00c82c516d
rename to
2021-06-28 17:03:19 +02:00
Eren Gölge
fdfb18d230
downsize melgan test model size
2021-06-28 17:03:19 +02:00
Eren Gölge
25238e0658
fix glow-tts `inference()`
2021-06-28 17:03:19 +02:00
Eren Gölge
82582993cc
use one testing sentence in tts tests
2021-06-28 17:03:19 +02:00
Eren Gölge
419735f440
refactor and fix multi-speaker training in Trainer and Tacotron models
2021-06-28 17:03:19 +02:00
Eren Gölge
269e5a734e
add max_decoder_steps argument to tacotron models
2021-06-28 17:03:19 +02:00
Eren Gölge
304d60197b
reduce multiband melgan test model size
2021-06-28 17:03:19 +02:00
Eren Gölge
db6a97d1a2
rename external speaker embedding arguments as `d_vectors`
2021-06-28 17:03:19 +02:00
Eren Gölge
877bf66b61
reduce size of the metadata.csv used at testing
2021-06-28 17:03:19 +02:00
Eren Gölge
87c61d210a
update test to be less demanding
2021-06-28 17:03:19 +02:00
Eren Gölge
6d6896fd99
reduce fullband-melgan test model size
2021-06-28 17:03:19 +02:00
Eren Gölge
1443d03af1
update test for the new input output API of the tts models
2021-06-28 17:03:19 +02:00
Eren Gölge
ef4ea9e527
update imports for `formatters`
2021-06-28 17:03:19 +02:00
Eren Gölge
6c495c6a6e
fix glow-tts inference and forward functions for handling `cond_input`
...
and refactor its test
2021-06-28 17:03:19 +02:00
Eren Gölge
b500338faa
make style
2021-06-28 17:03:19 +02:00
Eren Gölge
d25f017b42
update `setup_model.py` imports
2021-06-28 17:03:19 +02:00
Eren Gölge
7dff6be871
update tts training tests to use the trainer
2021-06-28 17:03:19 +02:00
Eren Gölge
9134c7dfb6
update `sequence_mask` import globally
2021-06-28 17:03:19 +02:00
Eren Gölge
8def3c87af
trainer-API updates
2021-06-28 17:03:19 +02:00
Eren Gölge
42554cc711
rename MyDataset -> TTSDataset
2021-06-28 17:03:19 +02:00
Edresson Casanova
eb84bb2bc8
Merge branch 'dev' into dev
2021-06-26 15:32:19 -03:00
Eren Gölge
6c7bbcaef0
Use `en-us` for testing phoneme models
2021-06-25 16:52:17 +02:00
Michael Hansen
a41f53fe72
Fix silly error in tests
2021-06-25 14:41:35 +02:00
Michael Hansen
3f172b84d8
Fix linting issues
2021-06-25 14:41:31 +02:00
Michael Hansen
4d8426fa0a
Use eSpeak IPA lexicons by default for phoneme models
2021-06-25 14:41:05 +02:00
Michael Hansen
47191f3ecc
Add tests for gruut phonemization
2021-06-25 14:41:05 +02:00
Edresson
28bec238ca
fix Lint checks
2021-06-18 14:33:50 -03:00
Edresson
83644056e3
fix Lint checks
2021-06-18 14:32:28 -03:00
Eren Gölge
db48c69f0f
reduce fullband melgan model size for testing
2021-06-02 11:44:53 +02:00
Eren Gölge
49c5e5d820
maket style japanese PR
2021-06-02 11:44:46 +02:00
Eren Gölge
0c14278c30
reorg test files
2021-06-02 11:40:26 +02:00
Eren Gölge
73b4083c6c
Merge pull request #502 from kaiidams/kaiidams/kokoro
...
Japanese Tacotron 2 model
2021-06-02 10:20:08 +02:00
Katsuya Iida
1cc18d1972
Move unittest of Japanese phonemizer.
2021-06-01 18:51:34 +09:00
Eren Gölge
bec85ac58d
make style
2021-05-31 16:37:15 +02:00
Eren Gölge
301c516abd
Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev
2021-05-31 15:46:25 +02:00
Edresson
cc192b6843
add resnet speaker encoder train unit test
2021-05-29 22:43:41 -03:00
Eren Gölge
925c08cf95
replace unidecode with anyascii
2021-05-27 14:02:44 +02:00
Edresson
bc5307caa0
add unit tests for SoftmaxAngleProtoLoss and ResnetSpeakerEncoder and bugfix
2021-05-26 20:35:58 -03:00
Edresson
7a9a27282a
fix unit tests
2021-05-26 18:14:06 -03:00
Eren Gölge
f5a9950a2e
phoneme files for testing
2021-05-19 03:16:57 +02:00
Eren Gölge
9b706c5583
enable phonemes in test to match the attention masks
2021-05-19 03:15:54 +02:00
Eren Gölge
8a7c40736c
set use_phonemes false
2021-05-19 01:27:26 +02:00
Eren Gölge
faedea4b60
set use_phonemes False in configs
2021-05-19 00:08:07 +02:00
Eren Gölge
ccfaa6b1d5
add `needs_phonemizer` field to models.json. If set true these models
...
are only compatible with v0.0.13 or below.
2021-05-18 17:57:28 +02:00
Eren Gölge
a14fcf2a13
remove text_processing test
2021-05-18 17:57:28 +02:00
Eren Gölge
12722501bb
styling
2021-05-15 23:48:31 +02:00
Eren Gölge
7e02cff924
reduce pwgan test batch size
2021-05-12 16:10:08 +02:00
Eren Gölge
da49089a72
update melgan training test batch size
2021-05-12 10:12:11 +02:00
Eren Gölge
0213e1cbf4
update configs for tts models to match the field typed with the expected
...
values
2021-05-12 00:57:38 +02:00
Eren Gölge
715b0a65a0
update main.yml for python x64
...
fix test
2021-05-12 00:57:29 +02:00
Eren Gölge
843d1b3d98
linter fixes
2021-05-11 11:30:00 +02:00
Eren Gölge
19fb1d743d
style update
2021-05-11 11:30:00 +02:00
Eren Gölge
5aee30443f
refactoring tests after Coqpit
2021-05-11 11:30:00 +02:00
Eren Gölge
87384c6008
get_device_id() for tests
2021-05-11 11:29:18 +02:00
Eren Gölge
21dd4d7960
fix load_config imports for Coqpit
2021-05-11 11:29:18 +02:00
Eren Gölge
9f2d2d2081
add speaker encoder train test
2021-05-11 11:29:18 +02:00
Eren Gölge
9ee70af9bb
code styling
2021-05-11 11:29:18 +02:00
Eren Gölge
70fc7a7e71
remove bash running tests
2021-05-11 11:29:18 +02:00
Eren Gölge
1be45eae38
add new tests for vocoder trainings using coqpit
2021-05-11 11:29:18 +02:00