Commit Graph

4543 Commits (add_lang_code)

Author SHA1 Message Date
WeberJulian d6e223f484 woops 2023-11-08 10:17:54 +01:00
WeberJulian ea56ec041f update docs 2023-11-08 10:16:36 +01:00
WeberJulian 5c81500e3e Remove ununsed config and args 2023-11-08 10:14:38 +01:00
WeberJulian 9f106034a1 Add lang code in XTTS doc 2023-11-07 14:54:17 +01:00
Eren Gölge f846a9f300
Update to v0.20.1 2023-11-07 14:17:36 +01:00
Edresson Casanova cbdbc44e0f
Fix XTTS v2.0 training recipe (#3154)
* Fix XTTS v2.0 training recipe

* Update XTTS v2 model hash
2023-11-07 14:16:44 +01:00
Eren Gölge 5e992d8704
Merge pull request #3149 from coqui-ai/fixup_xtts_v2
Bug fixes and add support for multiples speaker references on XTTS inference
2023-11-07 10:36:20 +01:00
Edresson Casanova 5f9ab6cfaa
Fix style
Co-authored-by: Aarni Koskela <akx@iki.fi>
2023-11-06 19:22:34 -03:00
Edresson Casanova 905900afc9 Update XTTS v1.1 recipe 2023-11-06 19:14:50 -03:00
Edresson Casanova 2470599d18 Drop XTTS v1 2023-11-06 19:12:04 -03:00
Edresson Casanova 13243df526 Update XTTS v1.1 files 2023-11-06 19:10:21 -03:00
Edresson Casanova cabff9f323 Update XTTS v2.0 recipe 2023-11-06 17:47:14 -03:00
Edresson Casanova 09fb317e6d Remove unused code 2023-11-06 17:36:32 -03:00
Edresson Casanova b146de4ce8 Bug fix on XTTS v2.0 Trainer 2023-11-06 20:26:01 +01:00
Edresson Casanova f444f296f2 Add multiples references on xtts inference tests 2023-11-06 20:25:06 +01:00
Edresson Casanova 1b6f8d0e46 Update unit tests and recipes 2023-11-06 20:25:06 +01:00
Edresson Casanova 72b2bac0f8 Load reference in 24khz to avoid issued with multiple sr references 2023-11-06 20:25:06 +01:00
Edresson Casanova 00294ffdf6 Update XTTS docs 2023-11-06 20:24:06 +01:00
Edresson Casanova 459ad70dc8 Add support for multiples speaker references on XTTS inference 2023-11-06 20:22:35 +01:00
Edresson Casanova 9942000c50 Update XTTS v2 recipe model files 2023-11-06 20:20:28 +01:00
Eren Gölge f0cb19ecca
Drop diffusion from XTTS (#3150)
* Drop diffusion for XTTS

* Make style

* Drop diffusion deps in code

* Restore thrashed
2023-11-06 20:15:49 +01:00
Eren G??lge 5d418bb84a Update docs 2023-11-06 18:48:41 +01:00
Eren G??lge 9bbf6eb8dd Drop use_ne_hifigan 2023-11-06 18:43:38 +01:00
Eren G??lge 9d54bd7655 Fixup XTTS 2023-11-06 18:13:58 +01:00
Eren Gölge c713a839da
Update VERSION 2023-11-06 15:51:56 +01:00
Eren Gölge 7eedfc67da
Update README.md 2023-11-06 15:37:32 +01:00
Edresson Casanova e45227d9ff
XTTS v2.0 (#3137)
* Implement most similar ref training approach

* Use non-enhanced hifigan for test samples

* Add Perceiver

* Update GPT Trainer for perceiver support

* Update XTTS docs

* Bug fix masking with XTTS perceiver

* Bug fix on gpt forward

* Bug Fix on XTTS v2.0 training

* Add XTTS v2.0 unit tests

* Add XTTS v2.0 inference unit tests

* Bug Fix on diffusion inference

* Add XTTS v2.0 training recipe

* Placeholder model entry

* Add cloning params to config

* Make prompt embedding configurable

* Make cloning configurable

* Cheap fix for a cheaper fix

* Prevent resampling

* Update model entry

* Update docs

* Update requirements

* Code linting

* Add xtts v2 to sep tests

* Bug fix on XTTS get_gpt_cond_latents

* Bug fix on rebase

* Make style

* Bug fix in Japenese tokenizer

* Add num2words to deps

* Remove unused kwarg and added num_beams=1 as default

---------

Co-authored-by: Eren G??lge <egolge@coqui.ai>
2023-11-06 14:58:18 +01:00
Aarni Koskela 38f6f8f0bb
Run `make style` & re-enable it in CI (#3127) 2023-11-06 11:36:37 +01:00
Eren Gölge 6fef4f9067
Bump up to v0.19.1 2023-10-30 10:37:28 +01:00
Eren Gölge eccc94be9b
Merge pull request #2983 from vltmedia/dev
Bug: self.model_name needed to be initialized.
2023-10-28 10:39:25 +02:00
Eren Gölge 2d6bd716ef
Merge pull request #3109 from coqui-ai/tts_3067
fix for issue 3067
2023-10-28 10:37:52 +02:00
Eren Gölge 788959d720
Merge pull request #3103 from coqui-ai/fix_xttsv1.1_again
Second round of issue fixing for XTTS v1.1
2023-10-28 10:33:19 +02:00
WeberJulian 1c98821359 Remove unused load_audio function 2023-10-27 22:27:18 +02:00
Aya Jafari 041b4b6723 fix for issue 3067 2023-10-26 13:06:01 -03:00
WeberJulian d4e08c8d6c Add features to get_conditioning_latents 2023-10-26 14:57:33 +02:00
WeberJulian c1133724a1 Move lang token add to tokenizer 2023-10-26 14:52:13 +02:00
WeberJulian 6fa46d197d Fix get_conditioning_latents when using only ne 2023-10-26 14:51:35 +02:00
Eren Gölge edd3a28723
Bump up to v0.19.0 2023-10-25 13:29:38 +02:00
Eren Gölge 16ba377f61
Merge pull request #3086 from coqui-ai/xtts_trainer
XTTS v1.1 GPT Trainer
2023-10-25 13:28:47 +02:00
Edresson Casanova 01839af926 Bug fix on XTTS masking training 2023-10-24 18:30:14 -03:00
Edresson Casanova 8af3d2dbcd Add a dedicated workflow for XTTS tests 2023-10-24 09:52:44 -03:00
VLT Media 818aa0eb7e
Merge branch 'coqui-ai:dev' into dev 2023-10-23 23:36:33 -04:00
Edresson Casanova de1d521c8a Update XTTS docs 2023-10-23 13:35:15 -03:00
Edresson Casanova 0f96abb5ec Add FT inference example on XTTS docs 2023-10-23 13:23:30 -03:00
Edresson Casanova 67ca70aff4 Fix Delightful TTS layers unit test 2023-10-23 11:47:10 -03:00
Edresson Casanova 37b7945474 Update XTTS train not implemented error to point to the XTTS docs 2023-10-23 11:39:17 -03:00
Edresson Casanova 1ee8096799 Update XTTS docs 2023-10-23 11:13:09 -03:00
Edresson Casanova 6fefc36e5a Update XTTS docs 2023-10-23 11:03:57 -03:00
Edresson Casanova 8853e1c3ec Update XTTS recipe to only download checkpoint if it is needed 2023-10-23 10:45:41 -03:00
Edresson Casanova 653f2e75ef Update xtts trainer recipe 2023-10-23 09:58:16 -03:00