coqui-ai/TTS - TTS - Gitea: ArmstrongLabs

Commit Graph

Author	SHA1	Message	Date
Edresson Casanova	11283fce07	Ensures that only GPT model is in training mode during XTTS GPT training (#3241 ) * Ensures that only GPT model is in training mode during training * Fix parallel wavegan unit test	2023-11-17 15:13:46 +01:00
Eren G??lge	26efdf6ee7	Make k_diffusion optional	2023-11-17 13:42:33 +01:00
Eren G??lge	63d7145647	Update versions	2023-11-17 12:10:46 +01:00
Julian Weber	675f983550	Add sentence splitting (#3227 ) * Add sentence spliting * update requirements * update default args v2 * Add spanish * Fix return gpt_latents * Update requirements * Fix requirements	2023-11-16 11:01:11 +01:00
Matthew Boakes	1b9c400bca	PyTorch 2.1 Updates (Weight Norm and TorchAudio I/O) (#3176 ) * Replaced PyTorch weight_norm With parametrizations.weight_norm * TorchAudio: Migrating The I/O Functions To Use The Dispatcher Mechanism * Corrected Code Style --------- Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-11-09 16:31:03 +01:00
Edresson Casanova	e45227d9ff	XTTS v2.0 (#3137 ) * Implement most similar ref training approach * Use non-enhanced hifigan for test samples * Add Perceiver * Update GPT Trainer for perceiver support * Update XTTS docs * Bug fix masking with XTTS perceiver * Bug fix on gpt forward * Bug Fix on XTTS v2.0 training * Add XTTS v2.0 unit tests * Add XTTS v2.0 inference unit tests * Bug Fix on diffusion inference * Add XTTS v2.0 training recipe * Placeholder model entry * Add cloning params to config * Make prompt embedding configurable * Make cloning configurable * Cheap fix for a cheaper fix * Prevent resampling * Update model entry * Update docs * Update requirements * Code linting * Add xtts v2 to sep tests * Bug fix on XTTS get_gpt_cond_latents * Bug fix on rebase * Make style * Bug fix in Japenese tokenizer * Add num2words to deps * Remove unused kwarg and added num_beams=1 as default --------- Co-authored-by: Eren G??lge <egolge@coqui.ai>	2023-11-06 14:58:18 +01:00
Aarni Koskela	6277f09c5f	requirements.txt: loosen pandas pin (1.4 would need to be compiled from source on macs)	2023-09-26 20:43:59 +03:00
WeberJulian	089ad66df2	Lower the versions constraints	2023-09-25 17:00:41 +02:00
WeberJulian	bbfdfbffdf	Update transformers to latest	2023-09-25 11:46:38 +02:00
WeberJulian	f1c1d14c54	Add back umap	2023-09-25 11:12:01 +02:00
WeberJulian	a2a15392e0	fix package versions	2023-09-25 11:01:36 +02:00
Julian Weber	6916aa37ab	Fix fsspec requirement (#2970 ) * Fix requirment for fsspec * Use the right version this time	2023-09-19 15:54:12 -03:00
Eren G??lge	ee7cee0e35	Fixup	2023-09-13 18:21:44 +02:00
Unik	32b8ebb633	Updated scipy version (#2914 )	2023-09-04 11:39:19 +02:00
Paul O'Leary McCann	c0aabb8596	Make Japanese-specific dependencies optional (#2776 ) * Don't install MeCab by default * Add optional [ja] deps, like [dev] etc * Add JA requirements file * Add JA requirements to requirements_all This should help the tests run.	2023-07-24 11:28:27 +02:00
Eren G??lge	6b9ebf5aab	Merge branch 'p3_11' into dev	2023-06-28 12:13:04 +02:00
Eren Gölge	c844b6570a	Inference API for 🐶Bark (#2685 ) * Add bark requirements * Draft Bark implementation * Download HF models * Update synthesizer * Add bark model * Make style * Update pylintrc * Update model URLs * Update Bark Config * Fix here and ther * Make style * Make lint * Update requirements * Update requirements	2023-06-28 11:55:27 +02:00
Eren G??lge	d659dbe3c6	Remove fairseq	2023-06-26 19:31:56 +02:00
Eren G??lge	a1c431e6a9	Fixups	2023-06-26 12:55:18 +02:00
Eren G??lge	0cce2c0e89	Correct python_version	2023-06-22 14:07:35 +02:00
Eren G??lge	a58fb6c01b	Update requirements	2023-06-22 13:53:19 +02:00
Eren G??lge	9190f1a5f3	Update requirements	2023-06-21 12:22:37 +02:00
Eren G??lge	8597ee13af	Update requirements	2023-06-21 12:21:22 +02:00
Eren G??lge	deebc0cc16	Add bark requirements	2023-05-23 10:12:26 +02:00
manmay nakhashi	a3d5801c44	Tortoise TTS inference (#2547 ) * initial commit * Tortoise inference * revert path change * style fix * remove accidental remove * style fixes * style fixes * removed unwanted assests and deps * remove changes * remove cvvp * style fix black * added tortoise config and updated config and args, refactoring the code * added tortoise to api * Pull mel_norm from url * Use TTS cleaners * Let download model files * add ability to pass tortoise presets through coqui api * fix tests * fix style and tests * fix tts commandline for tortoise * Add config.json to tortoise * Use kwargs * Use regular model api for loading tortoise * Add load from dir to synthesizer * Fix Tortoise floats * Use model_dir when there are multiple urls * Use `synthesize` when exists * lint fixes and resolve preset bug * resolve a download bug and update model link * fix json * do tortoise inference from voice dir * fix * fix test * fix speaker id and remove assests * update inference_tests.yml * replace inference_test.yml * fix extra dir as None * fix tests * remove space * Reformat docstring * Add docs * Update docs * lint fixes --------- Co-authored-by: Eren Gölge <egolge@coqui.ai> Co-authored-by: Eren Gölge <erogol@hotmail.com>	2023-05-16 00:58:21 +02:00
Edresson Casanova	51a3d45025	Add FR and ES gruut languages as requirement to avoid inference issues (#2572 ) * Add all gruut supported languages as requeriment to avoid inference issues * Remove unused gruut languages	2023-05-03 09:49:01 -03:00
Eren Gölge	6505553da5	Add BN requirements	2023-04-17 12:54:14 +02:00
Matthew Boakes	5bdd6f7c18	Updated Librosa Dependency Specification	2023-04-06 12:36:24 +01:00
Matthew Boakes	4c829e74a1	Update Librosa Version To V0.10.0	2023-04-05 00:59:20 +01:00
Eren Gölge	d309f50e53	Implement FreeVC (#2451 ) * Update .gitignore * Draft FreeVC implementation * Tests and relevant updates * Update API tests * Add missings * Update requirements * :( * Lazy handle for vc * Update docs for voice conversion * Make style	2023-03-25 18:33:23 +01:00
Eren Gölge	090cadf270	Update numba version (#2435 )	2023-03-21 11:40:42 +01:00
p0p4k	a365a7e888	numpy version for py310 (#2316 ) * numpy version for py310 requested in #2315 * Update requirements.txt	2023-02-13 10:34:00 +01:00
Martin Weinelt	994be163e1	Use packaging.version for version comparisons (#2310 ) * Use packaging.version for version comparisons The distutils package is deprecated¹ and relies on PEP 386² version comparisons, which have been superseded by PEP 440³ which is implemented through the packaging module. With more recent distutils versions, provided through setuptools vendoring, we are seeing the following exception during version comparisons: > TypeError: '<' not supported between instances of 'str' and 'int' This is fixed by this migration. [1] https://docs.python.org/3/library/distutils.html [2] https://peps.python.org/pep-0386/ [3] https://peps.python.org/pep-0440/ * Improve espeak version detection robustness On many modern systems espeak is just a symlink to espeak-ng. In that case looking for the 3rd word in the version output will break the version comparison, when it finds `text-to-speech:`, instead of a proper version. This will not break during runtime, where espeak-ng would be prioritized, but the phonemizer and tokenizer tests force the backend to `espeak`, which exhibits this breakage. This improves the version detection by simply looking for the version after the "text-to-speech:" token. * Replace distuils.copy_tree with shutil.copytree The distutils module is deprecated and slated for removal in Python 3.12. Its usage should be replaced, in this case by a compatible method from shutil.	2023-01-29 23:47:00 +01:00
Edresson Casanova	49dfaa5234	Update the Trainer requirement version for a compatible one (#2276 )	2023-01-11 01:01:46 +01:00
Eren Gölge	c5412532ac	Remove langs expect en and de (#2135 )	2022-11-09 11:58:34 +01:00
Edresson Casanova	371772c355	Replace pyworld by pyin (#1946 ) * Replace pyworld by pyin * Fix unit tests	2022-09-09 10:43:14 +02:00
harmlessman	5abbe56642	Korean Phonemizer (#1822 ) * Update requirements.txt install jamo for korean * Update formatters.py add KSS formatter KSS is a korean single speech dataset (12hours) * Add files via upload add phonemizer for korean * Add files via upload add korean phonemizer * Update requirements.txt * change code style with `black` and `pylint` * reflecting pylint's Evaluation * reflecting pylint's Evaluation * reflecting pylint's Evaluation-2 * isort * edit about separator write test case and add 'nltk' for requirements.txt * add korean g2p (g2pkk) * isort * TTS/tts/utils/text/phonemizers/ko_kr_phonemizer.py:43:24: W0621: Redefining name 'text' from outer scope (line 58) (redefined-outer-name) TTS/tts/utils/text/korean/korean.py:28:8: R1705: Unnecessary "else" after "return" (no-else-return) * black	2022-09-08 12:06:07 +02:00
p0p4k	d9bad91a66	Update requirements.txt; inflect==5.6 (#1809 ) New inflect version (6.0) depends on pydantic which has some issues irrelevant to 🐸 TTS. #1808 Force inflect==5.6 (pydantic free) install to solve dependency issue.	2022-08-01 11:48:02 +02:00
p0p4k	669966d963	Update requirements.txt (#1791 ) Support for #1775	2022-07-26 13:06:40 +02:00
Noran Raskin	a790df4e94	Training recipes for thorsten dataset (#1020 ) * Fix style * Fix isort * Remove tensorboardX from requirements Co-authored-by: logan hart <72301874+loganhart420@users.noreply.github.com> Co-authored-by: Eren Gölge <egolge@coqui.ai>	2022-05-30 12:07:31 +02:00
Eren Gölge	4857967063	🐍 Python 3.10.x support and drop Python 3.6 support (#1565 ) * Update requirements * Update CI for p3.10 * Update numpy requirement * Drop 🐍p3.6 support Numpy also dropped support for p3.6 * Bind cython v0.29.28 * Bind pyworld to v0.2.10 > 0.2.10 is not p3.10.x compatible * Update Dockerfile	2022-05-12 15:50:25 +02:00
Edresson Casanova	a41e860a66	Update Coqpit requirement (#1539 )	2022-04-26 17:39:36 +02:00
Eren Gölge	164c7dd676	Update requirements coqui_trainer -> trainer (#1478 )	2022-04-08 14:47:09 +02:00
WeberJulian	c66a6241fd	Enforce phonemizer definition for synthesis (#1441 ) * Enforce phonemizer definition for synthesis * Fix train_tts, tokenizer init can now edit config * Add small change to trigger CI pipeline * fix wrong output path for one tts_test * Fix style * Test config overides by args and tokenizer * Fix style	2022-03-25 23:15:33 +01:00
Edresson Casanova	ea53d6feb3	Replace webrtcvad by silero-vad	2022-03-23 14:39:31 -03:00
Eren Gölge	c7f9ec07c8	Hinge Gruut version to 2.2.3 (#1419 )	2022-03-18 16:47:50 +01:00
Eren Gölge	95e551dd0a	Update requirements.txt for coqui-trainer	2022-03-07 14:31:25 +01:00
Eren Gölge	45f1e1f786	Update requirements.txt	2022-03-06 14:24:19 +01:00
Eren Gölge	fc8264d9d2	Update requirements	2022-02-25 11:31:20 +01:00
Eren Gölge	33b98e6cc3	Update requirements.txt	2022-02-25 11:26:59 +01:00

1 2 3

133 Commits (e535cfe07c3498e78e9a5ceba9925d3eae4167ca)