Commit Graph

4474 Commits (fix_xtts_v1.1_)

Author SHA1 Message Date
prakharpbuf c1875f68df
typos and minor fixes (#2508)
* Update tacotron1-2.md

* Update README.md

* Update Tutorial_2_train_your_first_TTS_model.ipynb

* Update synthesizer.py

There is no arg called --speaker_name

* Update formatting_your_dataset.md

* Update AnalyzeDataset.ipynb

* Update AnalyzeDataset.ipynb

* Update AnalyzeDataset.ipynb

* Update finetuning.md

* Update train_yourtts.py

* Update train_yourtts.py

* Update train_yourtts.py

* Update finetuning.md
2023-04-26 15:22:57 +02:00
Eren Gölge 2071088bab
Bump up to v0.13.3 2023-04-17 16:13:35 +02:00
Eren Gölge b2bc2ac797
Merge pull request #2532 from coqui-ai/bangla_model
Bangla models
2023-04-17 16:13:00 +02:00
Eren Gölge 1a6a5710fd Make lint 2023-04-17 15:02:56 +02:00
Eren Gölge a44a0e1fd2 Update model urls 2023-04-17 14:53:27 +02:00
Eren Gölge d3e215f8bd Add link 2023-04-17 13:48:55 +02:00
Eren Gölge 2533a18d62 Add BN tests 2023-04-17 13:37:10 +02:00
Eren Gölge 2d49c05259 Remove import 2023-04-17 13:05:29 +02:00
Eren Gölge 5e5768d784 Fix API 2023-04-17 13:05:19 +02:00
Eren Gölge bce819a624 Add docs for adding a new lang frontend 2023-04-17 12:54:35 +02:00
Eren Gölge 6505553da5 Add BN requirements 2023-04-17 12:54:14 +02:00
Eren Gölge cd83991067 Add BN phonemizer 2023-04-17 12:54:00 +02:00
Eren Gölge 36be05290d Add models 2023-04-17 12:52:32 +02:00
Eren Gölge e4c5c27854
Bump up to v0.13.2 2023-04-14 10:23:39 +02:00
Eren Gölge dba5cec497
Merge pull request #2509 from coqui-ai/update_vad
Update VAD
2023-04-13 19:35:17 +02:00
Eren Gölge e07c6f54fd
Merge pull request #2515 from coqui-ai/tts_cmd
🐸Studio models by `tts`
2023-04-13 19:34:28 +02:00
Eren Gölge 5a9bda13f3 Make style 2023-04-13 14:19:06 +02:00
Eren Gölge c9375e4b8b Make style 2023-04-13 14:17:06 +02:00
Eren Gölge 758ef84cc2 Using 🐸Studio models with `tts` command 2023-04-13 14:14:41 +02:00
Eren G??lge 537dc0e933 Update VAD 2023-04-13 00:39:46 +02:00
Eren Gölge e33e7170ed Bump up to v0.13.1 2023-04-12 16:20:53 +02:00
Eren Gölge 8da3342676 Ping API 2023-04-12 16:20:53 +02:00
Eren Gölge 73d963718a
Merge pull request #2495 from coqui-ai/api_voice_conversion
Api voice conversion
2023-04-11 16:40:14 +02:00
Eren Gölge cbb592b295 Fixup 2023-04-10 14:50:11 +02:00
Eren Gölge b8b9f09de5 Fixup 2023-04-10 14:06:31 +02:00
Eren Gölge 76511972e9 Add freevc to the models list 2023-04-10 14:03:08 +02:00
Eren Gölge 209f0a509a Add voice conversion api tes 2023-04-10 13:37:47 +02:00
Eren Gölge a49c1931d9 Fixup 2023-04-10 13:33:42 +02:00
Eren Gölge 5bd1fb6b2c Fix API for voice conversion 2023-04-10 13:32:16 +02:00
Eren Gölge 30109af2a0
Merge pull request #2480 from MattyB95/librosa_v0.10.0
Update Librosa Version To V0.10.0
2023-04-07 12:32:33 +02:00
Matthew Boakes 5bdd6f7c18 Updated Librosa Dependency Specification 2023-04-06 12:36:24 +01:00
Eren Gölge 1233365cf4 Bump up to v0.13.0 2023-04-05 15:09:31 +02:00
Eren Gölge ad8b9bf2be
🐸 Coqui Studio API integration (#2484)
* Warn when lang is not avail

* Make style

* Implement Coqui Studio API

* Test

* Update docs

* Set action

* Make style

* Make lint

* Update README

* Make style

* Fix action

* Run actions
2023-04-05 15:06:50 +02:00
Wesley Pyburn ce79160576
Fix errors in README.md (#2478)
Running the sample code below results in an error `language_id = self.tts_model.language_manager.name_to_id[language_name]`.
The fix is running the code with the correct language strings, the readme has been updated in this PR to work.

I assume this small typo leads to #2456 and #2458
2023-04-05 12:23:07 +02:00
Matthew Boakes 4c829e74a1 Update Librosa Version To V0.10.0 2023-04-05 00:59:20 +01:00
Yingzhi WANG 95fa2c9fd6
fix typo (#2475) 2023-04-03 23:31:09 +02:00
p0p 91cf1b2da9
[minor] batch["speaker_ids"] getting set two times (#2470)
* [minor] batch["speaker_ids"] getting set two times

just to make it consistent with language_ids

* Update vits.py

style.
2023-04-03 11:35:21 +02:00
Rajiv P c2d15cd413
[minor] hifigan_generator.py typo (#2462)
resblock2 description updated.
2023-03-28 12:43:36 +02:00
Eren Gölge d309f50e53
Implement FreeVC (#2451)
* Update .gitignore

* Draft FreeVC implementation

* Tests and relevant updates

* Update API tests

* Add missings

* Update requirements

* :(

* Lazy handle for vc

* Update docs for voice conversion

* Make style
2023-03-25 18:33:23 +01:00
Eren Gölge 090cadf270
Update numba version (#2435) 2023-03-21 11:40:42 +01:00
Khalid Bashir 14c80dd1fd
vits.py training fixed due to return_complex (#2418)
Torch set default value for `return_complex=True` for `torch.stft` method
This turned warning into error:-
```
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1591, in fit
    self._fit()
  File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1544, in _fit
    self.train_epoch()
  File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1309, in train_epoch
    _, _ = self.train_step(batch, batch_num_steps, cur_step, loader_start_time)
  File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1162, in train_step
    outputs, loss_dict_new, step_time = self._optimize(
  File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 1023, in _optimize
    outputs, loss_dict = self._model_train_step(batch, model, criterion, optimizer_idx=optimizer_idx)
  File "/usr/local/lib/python3.10/dist-packages/trainer/trainer.py", line 970, in _model_train_step
    return model.train_step(*input_args)
  File "/workspace/coqui-tts/TTS/tts/models/vits.py", line 1293, in train_step
    mel_slice_hat = wav_to_mel(
  File "/workspace/coqui-tts/TTS/tts/models/vits.py", line 191, in wav_to_mel
    spec = torch.stft(
  File "/usr/local/lib/python3.10/dist-packages/torch/functional.py", line 641, in stft
    return _VF.stft(input, n_fft, hop_length, win_length, window,  # type: ignore[attr-defined]
RuntimeError: stft requires the return_complex parameter be given for real inputs, and will further require that return_complex=True in a future PyTorch release.
```
2023-03-19 00:22:04 +01:00
Eren Gölge 12f3365185 Merge branch 'dev' 2023-03-17 13:31:08 +01:00
Eren Gölge 2db262747e
Bump up to v0.12.0 2023-03-17 13:21:03 +01:00
Roee Shenberg 3c15f0619a
Bug fixes in OverFlow audio generation (#2380) 2023-03-15 12:02:11 +01:00
Eren Gölge b8d9837d27
Merge pull request #2407 from dveni/patch-1
Update vits.py
2023-03-14 10:28:23 +01:00
Daniel Vera Nieto dfb48737fb Style fixed 2023-03-13 16:11:15 +01:00
Eren Gölge 9bb62c570d
Merge pull request #2390 from coqui-ai/dev
v0.12.0
2023-03-13 12:43:38 +01:00
Eren Gölge c10f9a3699
Update docs (#2389)
* Update docs index

* Add MarryTTS docs

* Update docs index

* Add Overflow docs
2023-03-13 12:42:20 +01:00
Eren Gölge 4ca07514d4
Remove doc bot (#2399) 2023-03-13 12:42:01 +01:00
Dani Vera 0d12229b64
Update vits.py
This should fix the issue https://github.com/coqui-ai/TTS/issues/1986 without breaking batch data sampling.
2023-03-10 18:35:16 +01:00