Eren Gölge
5e4bd9bfe8
Merge branch 'cpu-only-docker-image' of https://github.com/coqui-ai/TTS into cpu-only-docker-image
2022-05-12 18:50:06 +02:00
Eren Gölge
085517b79a
Fix Dockerfile
2022-05-12 18:48:35 +02:00
Reuben Morais
7bb45a20ec
Build and publish CPU only Docker image
2022-05-12 18:47:41 +02:00
Eren Gölge
27cf388a79
Update CI tests ( #1572 )
...
* Use direct model URLs in CI
* Fixup
* Fixup
2022-05-12 18:41:01 +02:00
Eren Gölge
4857967063
🐍 Python 3.10.x support and drop Python 3.6 support ( #1565 )
...
* Update requirements
* Update CI for p3.10
* Update numpy requirement
* Drop 🐍 p3.6 support
Numpy also dropped support for p3.6
* Bind cython v0.29.28
* Bind pyworld to v0.2.10
> 0.2.10 is not p3.10.x compatible
* Update Dockerfile
2022-05-12 15:50:25 +02:00
Edresson Casanova
a97eed696a
Fix the bug in eSpeak wrapper for eSpeak version 1.48.15 ( #1560 )
2022-05-12 15:15:18 +02:00
Eren Gölge
e45ae57aef
Merge pull request #1550 from coqui-ai/fix-upsampling-asserts
...
Fix VITS upsampling asserts
2022-05-12 14:51:41 +02:00
Edresson Casanova
175ca06388
Add reinit text encoder and duration predictor parameter ( #1562 )
...
* Add reinit encoder and duration predictor option
* Add .data to prevent any overlooked autograd hook
2022-05-12 09:08:36 -03:00
Edresson Casanova
182711043c
Fix the VITS upsampling asserts
...
Fix style
2022-05-12 09:08:29 -03:00
Taras Sereda
f9d91a55f2
Improve data_path resolvement ( #1567 )
2022-05-12 13:10:35 +02:00
Eren Gölge
d5d590bc36
Fix Dockerfile
2022-05-12 12:55:27 +02:00
Reuben Morais
6484be687c
Build and publish CPU only Docker image
2022-05-11 15:01:00 +02:00
Eren Gölge
2fc38f67d2
Update SpeakerManager init in Synthesizer
2022-05-11 11:32:27 +02:00
Eren Gölge
c3f8c4d5eb
Return default SpeakerManager if no d_vector_file
2022-05-11 11:31:45 +02:00
Eren Gölge
121e9ed685
Pass use_cuda to init_encoder
2022-05-11 11:31:17 +02:00
Eren Gölge
c18bd21b3f
Return durations at VITS inference
2022-05-11 11:30:05 +02:00
Eren Gölge
5021a03de0
Use torch.no_grad for VITS inference
2022-05-11 11:29:36 +02:00
Eren Gölge
3f03e3012c
Fix batch_group_size in VITS
2022-05-07 13:44:44 +02:00
code-review-doctor
fa887ef5f9
Fix issue probably-meant-fstring found at https://codereview.doctor ( #1532 )
2022-05-07 13:33:40 +02:00
Arvind Suresh
a34076af35
Update documentation for multi-gpu training
2022-05-07 13:30:03 +02:00
Eren Gölge
a0a9279e4b
Fix GAN optimizer order
...
commit 212d330929
Author: Edresson Casanova <edresson1@gmail.com>
Date: Fri Apr 29 16:29:44 2022 -0300
Fix unit test
commit 44456b0483
Author: Edresson Casanova <edresson1@gmail.com>
Date: Fri Apr 29 07:28:39 2022 -0300
Fix style
commit d545beadb9
Author: Edresson Casanova <edresson1@gmail.com>
Date: Thu Apr 28 17:08:04 2022 -0300
Change order of HIFI-GAN optimizers to be equal than the original repository
commit 657c5442e5
Author: Edresson Casanova <edresson1@gmail.com>
Date: Thu Apr 28 15:40:16 2022 -0300
Remove audio padding before mel spec extraction
commit 76b274e690
Merge: 379ccd7b
6233f4fc
Author: Edresson Casanova <edresson1@gmail.com>
Date: Wed Apr 27 07:28:48 2022 -0300
Merge pull request #1541 from coqui-ai/comp_emb_fix
Bug fix in compute embedding without eval partition
commit 379ccd7ba6
Author: WeberJulian <julian.weber@hotmail.fr>
Date: Wed Apr 27 10:42:26 2022 +0200
returns y_mask in VITS inference (#1540 )
* returns y_mask
* make style
2022-05-07 13:29:11 +02:00
Edresson Casanova
60034674f9
Remove audio padding before mel spec extraction
2022-05-07 13:12:09 +02:00
WeberJulian
fbdf76b2fc
returns y_mask in VITS inference ( #1540 )
...
* returns y_mask
* make style
2022-05-03 13:49:24 +02:00
Edresson Casanova
6233f4fcd7
Bug fix in compute embedding without eval partition
2022-04-26 13:58:03 -03:00
Edresson Casanova
a41e860a66
Update Coqpit requirement ( #1539 )
2022-04-26 17:39:36 +02:00
Edresson Casanova
8d228ab22a
Trick to Upsampling to High sampling rates using VITS model ( #1456 )
...
* Add upsample VITS support
* Fix the bug in inference
* Fix lint checks
* Add RMS based norm in save_wav method
* Style fix
* Add the period for VITS multi-period discriminator in model_args
* Bug fix in speaker encoder load in inference time
* Add unit tests
* Remove useless detach_z_vocoder parameter
* Add docs for VITS upsampling
* Fix the docs
* Rename TTS_part_sample_rate to encoder_sample_rate
* Add upsampling_init and upsampling_z methods
* Add asserts for encoder_sample_rate part
* Move upsampling tests to test_vits.py
2022-04-26 11:47:46 +02:00
Eren Gölge
c410bc58ef
Bump to v0.6.2
2022-04-20 11:46:26 +02:00
WeberJulian
30bea7d53c
Update manage.py ( #1514 )
2022-04-19 14:27:32 +02:00
Yanlong Wang
b45d5c5c60
Improve docsQA default questions ( #1411 )
2022-04-19 14:24:34 +02:00
Eren Gölge
7133f8f47d
Print Model's license when downloading ( #1512 )
...
* Print model license while downloading
* Make style
* Add a new license link
* Make style
2022-04-19 14:18:49 +02:00
WeberJulian
4953636b14
Add African models ( #1511 )
...
* Add african models
* Set default license for all models
2022-04-19 14:18:30 +02:00
jackiexiao
e8573bfe3e
Update CONTRIBUTING.md ( #1463 )
...
fix header
```
## Call for sharing language models
```
2022-04-15 14:43:46 +02:00
Reuben Morais
c18100d112
Merge branch 'docker-ci' into dev ( Fixes #1498 )
2022-04-15 02:32:51 +02:00
Reuben Morais
27fcb5dabf
Add Dockerfile and build/push CI
2022-04-15 02:17:10 +02:00
Eren Gölge
164c7dd676
Update requirements coqui_trainer -> trainer ( #1478 )
2022-04-08 14:47:09 +02:00
Edresson Casanova
060e0f9368
Add EmbeddingManager and BaseIDManager ( #1374 )
2022-03-31 13:41:16 +02:00
WeberJulian
1b22f03e98
Fix G2P backend of the released models ( #1461 )
...
* Fix enforce phonemizer
* Add new models
* Fix .model.json
2022-03-30 12:47:11 +02:00
WeberJulian
c66a6241fd
Enforce phonemizer definition for synthesis ( #1441 )
...
* Enforce phonemizer definition for synthesis
* Fix train_tts, tokenizer init can now edit config
* Add small change to trigger CI pipeline
* fix wrong output path for one tts_test
* Fix style
* Test config overides by args and tokenizer
* Fix style
2022-03-25 23:15:33 +01:00
Edresson Casanova
37896e1743
Bug fix in freeze encoder ( #1391 )
...
* Fix the bug in freeze encoder
* Remove emb_l definition for non-multilingual training
* Fix unit tests
2022-03-24 18:16:04 +01:00
Edresson Casanova
464dc658ff
Merge pull request #1431 from coqui-ai/silero-vad
...
Replace webrtcvad by silero-vad
2022-03-24 08:29:32 -03:00
Edresson Casanova
3435bc8fca
Fix style tests
2022-03-23 15:05:32 -03:00
Edresson Casanova
0ae1e0248c
Fix the bug for emptly audio files
2022-03-23 14:39:31 -03:00
Edresson Casanova
ea53d6feb3
Replace webrtcvad by silero-vad
2022-03-23 14:39:31 -03:00
Eren Gölge
3af01cfe3b
Update base model wrt 👟 ( #1406 )
2022-03-23 17:24:20 +01:00
WeberJulian
3c7c14607b
Add formatting tests ( #1437 )
...
* Add style checks to `make lint`
* Bump target-version in black config
2022-03-23 17:23:36 +01:00
Eren Gölge
1c3623af33
Fix model manager ( #1436 )
...
* Fix manager
* Make style
2022-03-23 12:57:14 +01:00
Eren Gölge
72d85e53c9
Update model file extension ( #1422 )
...
* Update model file ext to ```.pth```
* Update docs
* Rename more
* Find model files
2022-03-22 17:55:00 +01:00
Edresson Casanova
ccdc2300dc
Add eval_split and eval_split_size in the call of load_tts_samples for all recipes ( #1424 )
2022-03-22 12:54:41 +01:00
Eren Gölge
2e6e8f651d
Update CheckSpectrograms notebook ( #1418 )
2022-03-18 16:48:24 +01:00
Eren Gölge
c7f9ec07c8
Hinge Gruut version to 2.2.3 ( #1419 )
2022-03-18 16:47:50 +01:00