Commit Graph

4605 Commits (xtts_demo)

Author SHA1 Message Date
Eren Gölge 1936330ada
Update xtts.md 2023-12-01 23:52:06 +01:00
Edresson Casanova e9a2c0606a Add gc.collect() 2023-12-01 15:37:09 -03:00
Edresson Casanova 490af290d3 Delete unused variables 2023-12-01 15:21:33 -03:00
Edresson Casanova eb18b27afc
Delete trainer to freeze memory 2023-12-01 14:07:33 -03:00
Edresson Casanova 5dd217a759 Update XTTS finetuner docs 2023-12-01 09:47:09 -03:00
Edresson Casanova 68964fca0d Add XTTS fine-tuner docs 2023-12-01 09:13:34 -03:00
Edresson Casanova 1a60767d83 Add max_audio_length parameter 2023-11-27 12:10:43 -03:00
Edresson Casanova ceb8b05abe Update 2023-11-27 11:16:41 -03:00
Edresson Casanova e6c51e3666 Add intuitive error messages 2023-11-27 10:53:43 -03:00
Edresson Casanova c5cb7eb791 Add erros messages 2023-11-27 10:41:09 -03:00
Edresson Casanova eaa5355c91 Add parameters to be able to set then on colab demo 2023-11-27 10:01:48 -03:00
Edresson Casanova 335b8c37b3 Update gradio demo 2023-11-24 16:31:14 -03:00
Edresson Casanova 70f2cb9c0e Update gradio demo 2023-11-24 15:53:34 -03:00
Edresson Casanova c76fb856d1 Update gradio demo 2023-11-24 15:40:35 -03:00
Edresson Casanova 8967fc7ef2 Update gradio demo 2023-11-24 14:26:26 -03:00
Edresson Casanova af74cd4426 Bug fix on XTTS inference 2023-11-24 12:07:00 -03:00
Edresson Casanova 3fc2880127 Convert stereo to mono 2023-11-24 10:25:24 -03:00
Edresson Casanova fa9bb26ebb Update demo 2023-11-24 10:22:12 -03:00
Edresson Casanova 626d9e16fb Fix demo freezing issue 2023-11-24 08:44:21 -03:00
Edresson Casanova 7cc348ed76 Uses tabs instead of columns 2023-11-23 17:50:41 -03:00
Edresson Casanova cc4f37e1b0 Add training and inference columns 2023-11-23 16:30:49 -03:00
Edresson Casanova 774c4c1743 Add XTTS FT demo data processing pipeline 2023-11-22 18:11:52 -03:00
Eren Gölge 29dede20d3
Merge pull request #3249 from coqui-ai/run_ci_for_v0.20.6
Run CI for v0.20.6
2023-11-17 15:45:26 +01:00
Eren Gölge c011ab7455 Update to v0.20.6 2023-11-17 15:16:32 +01:00
Eren G??lge 52cb1e2f68 Update model hash for v2.0.2 2023-11-17 15:16:32 +01:00
Edresson Casanova 6075fa208c Ensures that only GPT model is in training mode during XTTS GPT training (#3241)
* Ensures that only GPT model is in training mode during training

* Fix parallel wavegan unit test
2023-11-17 15:15:22 +01:00
Eren G??lge a3279f9294 Make style 2023-11-17 15:15:22 +01:00
Eren G??lge f21067a84a Make k_diffusion optional 2023-11-17 15:15:21 +01:00
Eren G??lge 44494daa27 Update CI version 2023-11-17 15:15:21 +01:00
Eren G??lge c864acf2b7 Update versions 2023-11-17 15:15:21 +01:00
Edresson Casanova 11283fce07
Ensures that only GPT model is in training mode during XTTS GPT training (#3241)
* Ensures that only GPT model is in training mode during training

* Fix parallel wavegan unit test
2023-11-17 15:13:46 +01:00
Eren Gölge 14579a4607
Merge pull request #3248 from coqui-ai/slacker_deps
Update versions
2023-11-17 15:13:19 +01:00
Eren G??lge 44880f09ed Make style 2023-11-17 13:43:34 +01:00
Eren G??lge 26efdf6ee7 Make k_diffusion optional 2023-11-17 13:42:33 +01:00
Eren G??lge 08d11e9198 Update CI version 2023-11-17 13:01:32 +01:00
Eren G??lge 63d7145647 Update versions 2023-11-17 12:10:46 +01:00
Eren Gölge 7e4375da2b
Update to v0.20.6 2023-11-16 17:52:13 +01:00
Julian Weber fbc18b8c34
Fix zh bug (#3238) 2023-11-16 17:51:37 +01:00
Julian Weber 675f983550
Add sentence splitting (#3227)
* Add sentence spliting

* update requirements

* update default args v2

* Add spanish

* Fix return gpt_latents

* Update requirements

* Fix requirements
2023-11-16 11:01:11 +01:00
Enno Hermann 3c2d5a9e03
Remove duplicate AudioProcessor code and fix ExtractTTSpectrogram.ipynb (#3230)
* chore: remove unused argument

* refactor(audio.processor): remove duplicate stft+griffin_lim

* chore(audio.processor): remove unused compute_stft_paddings

Same function available in numpy_transforms

* refactor(audio.processor): remove duplicate db_to_amp

* refactor(audio.processor): remove duplicate amp_to_db

* refactor(audio.processor): remove duplicate linear_to_mel

* refactor(audio.processor): remove duplicate mel_to_linear

* refactor(audio.processor): remove duplicate build_mel_basis

* refactor(audio.processor): remove duplicate stft_parameters

* refactor(audio.processor): use pre-/deemphasis from numpy_transforms

* refactor(audio.processor): use rms_volume_norm from numpy_transforms

* chore(audio.processor): remove duplicate assert

Already checked in numpy_transforms.compute_f0

* refactor(audio.processor): use find_endpoint from numpy_transforms

* refactor(audio.processor): use trim_silence from numpy_transforms

* refactor(audio.processor): use volume_norm from numpy_transforms

* refactor(audio.processor): use load_wav from numpy_transforms

* fix(bin.extract_tts_spectrograms): set quantization bits

* fix(ExtractTTSpectrogram.ipynb): adapt to current TTS code

Fixes #2447, #2574

* refactor(audio.processor): remove duplicate quantization methods
2023-11-16 10:57:06 +01:00
Eren Gölge 88630c60e5
Update to v0.20.5 2023-11-15 14:02:51 +01:00
Edresson Casanova 73a5bd08c0
Fix XTTS GPT padding and inference issues (#3216)
* Fix end artifact for fine tuning models

* Bug fix on zh-cn inference

* Remove ununsed code
2023-11-15 14:02:05 +01:00
Ikko Eltociear Ashimine 15f0ac57d6
Update README.md (#3215)
Dicord -> Discord
2023-11-15 13:59:56 +01:00
Julian Weber 04901fb2e4
Add speed control for inference (#3214)
* Add speed control for inference

* Fix XTTS tests

* Add speed control tests
2023-11-14 16:07:17 +01:00
Eren Gölge d96f3885d5
Update to v0.20.4 2023-11-13 17:07:25 +01:00
Eren Gölge ac3df409a6
Merge pull request #3208 from coqui-ai/fix_max_mel_len
fix max generation length for XTTS
2023-11-13 14:32:56 +01:00
Eren Gölge f32a465711
Merge pull request #3207 from coqui-ai/update_xtts_cloning
Update XTTS cloning
2023-11-13 14:32:43 +01:00
Eren G??lge 92fa988aec Fixup 2023-11-13 13:44:06 +01:00
WeberJulian b85536b23f fix max generation length 2023-11-13 13:18:45 +01:00
Eren G??lge b2682d39c5 Make style 2023-11-13 13:01:01 +01:00