Commit Graph

175 Commits (2f2482f9b4130ef1cb72caaadc9314a523ec2e88)

Author SHA1 Message Date
Thomas Werkmeister 2f2482f9b4 reading all speakers upfront 2019-07-10 18:38:55 +02:00
Thomas Werkmeister d23e29ea1f extracted id to torch code 2019-07-02 14:40:01 +02:00
Thomas Werkmeister ba8cc8054b disabling multispeaker with num_speakers=0 2019-07-01 14:01:34 +02:00
Thomas Werkmeister 04e452d8cb Merge branch 'tacotron-gst' of github.com:mozilla/TTS into multispeaker 2019-07-01 14:00:22 +02:00
Eren Golge 464cc29756 Make optional reampling of the read wav 2019-06-26 14:11:30 +02:00
Thomas Werkmeister 05ff8801d1 config, benchmark notebook, synthesis fixed 2019-06-26 13:31:16 +02:00
Thomas Werkmeister d172a3d3d5 multispeaker 2019-06-26 12:59:14 +02:00
Eren Golge 51f1cd67e3 bug fix 2019-06-15 01:22:27 +02:00
Eren Golge 037ec13453 config update, audio.py update and modularize synthesize.py 2019-06-14 16:18:49 +02:00
Eren Golge e061ed091a modularize synthesis 2019-06-12 12:12:22 +02:00
Eren Golge 0f8936d744 GST inference 2019-06-12 12:12:01 +02:00
Eren Golge 63eea4a364 bug fix 2019-06-06 10:24:34 +02:00
Eren Golge 7410daceb2 Adapt TTS for TacotronGST and some changes for Audio.py , better config.json naming 2019-06-05 18:33:57 +02:00
Eren Golge 4678c66599 forward_attn_mask and config update 2019-06-04 00:39:29 +02:00
Eren Golge f096f1052f config updates, update audio.py, update mailabs preprocessor 2019-06-03 15:34:36 +02:00
Eren Golge 70929387c0 Merge branch 'dev-tacotron2' 2019-05-28 14:59:24 +02:00
Eren Golge 0dbed8fef7 New method to convert Tacotron output to mel psectrograms 2019-05-27 14:41:59 +02:00
Eren Golge ba492f43be Set tacotron model parameters to adap to common_layers.py - Prenet and Attention 2019-05-27 14:40:28 +02:00
Eren Golge d4b900f6c9 use soundfile for faster read 2019-05-23 02:02:22 +02:00
Eren Golge e62659da94 update separate stopnet flow to make it faster. 2019-05-17 16:15:43 +02:00
Eren Golge 832dc3eafa bug fix 2019-05-15 12:37:31 +02:00
Eren Golge bb2b705e01 small bug fixes 2019-05-14 13:53:26 +02:00
Eren Golge 5e679f746d save figures in visualize of set 2019-05-12 17:35:44 +02:00
Eren Golge 6331bccefc make dropout oprional #2 2019-05-12 17:35:31 +02:00
Eren Golge e2439fde9a make location attention optional and keep all attention weights in attention class 2019-04-29 11:37:01 +02:00
Eren Golge 01dbfb3a0f Server update s 2019-04-18 17:35:20 +02:00
Eren Golge 3c2d500f53 Changesat windowing and some comments 2019-04-12 16:13:40 +02:00
Eren Golge 9466505f27 Make eos bos chars optional 2019-04-12 16:12:15 +02:00
Eren Golge e2cf35bb10 Make loss masking optional 2019-04-10 16:41:08 +02:00
Eren Golge 8a47b46195 print warning if a layer in ehckpoint is not defined in model definition 2019-04-08 19:32:07 +02:00
Eren Golge 961af0f5cd setup_model externally based on model selection. Make forward attention and prenet type configurable in config.json 2019-04-05 17:49:18 +02:00
Eren Golge 7baaf140f9 Remove start character for phonme sequenceing 2019-04-04 10:49:09 +02:00
Eren Golge 2e361e2306 strip sting after phonemizer 2019-03-29 17:05:44 +01:00
Eren Golge 103971c893 text processing updates with tests 2019-03-29 17:04:10 +01:00
Eren Golge 6edd8bc6dd add git branch and restore_path to copied config file for each run 2019-03-29 17:01:57 +01:00
Eren Golge 1ed4978e69 text processing update 2019-03-27 14:57:36 +01:00
Eren Golge 76d5e065db phoneme_to_sequence bug fix 2019-03-27 14:57:26 +01:00
Eren Golge fdca8402c7 config updates 2019-03-26 15:46:26 +01:00
Eren Golge d8908692c5 refactor partial reinit script as a function. Allow user to select layers to reinit in finutunning 2019-03-23 17:19:40 +01:00
Eren Golge 06a7aeb26d git commit bug fix for phonimizer 2019-03-23 16:44:38 +01:00
Eren Golge f96945443e add start char but remove end char 2019-03-22 23:48:44 +01:00
Eren Golge d6307fbb7f config update 2019-03-22 19:12:58 +01:00
Eren Golge ff7258062c skip the alst empty char in phonemes to sequence. It breaks the alingment 2019-03-20 12:24:04 +01:00
Eren Golge 5acc9db4ac
Add empty character to phonemes 2019-03-12 10:16:42 +01:00
gnosly 95de2cd559 added missing phonemes, synthesizer.py now setup the correct input layer 2019-03-11 21:56:40 +01:00
Eren Golge b9b79fcf0f inference truncated NEED TO BE TESTED 2019-03-11 17:40:09 +01:00
Eren Golge 5754116c19 bos char addded 2019-03-06 22:06:01 +01:00
Eren Golge a2a22d253f synthesis update compatible with multiplt architecture 2019-03-06 13:11:46 +01:00
Eren Golge 08162157ee generic train.py for multiple architectures set on config.json 2019-03-06 13:11:22 +01:00
Eren Golge 1e8fdec084 Modularize functions in Tacotron 2019-03-05 13:25:50 +01:00