diff --git a/README.md b/README.md index 7d9884b0..20799943 100644 --- a/README.md +++ b/README.md @@ -17,9 +17,11 @@ If you are new, you can also find [here](http://www.erogol.com/text-speech-deep- [Details...](https://github.com/mozilla/TTS/wiki/Mean-Opinion-Score-Results) ## Features -- High performance Text2Speech models on Torch and Tensorflow 2.0. -- High performance Speaker Encoder to compute speaker embeddings efficiently. -- Integration with various Neural Vocoders (PWGAN, MelGAN, WaveRNN) +- High performance Deep Learning models for Text2Speech related tasks. + - Text2Speech models (Tacotron, Tacotron2). + - Speaker Encoder to compute speaker embeddings efficiently. + - Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS) +- Ability to convert Torch models to Tensorflow 2.0 for inference. - Released trained models. - Efficient training codes for PyTorch. (soon for Tensorflow 2.0) - Codes to convert Torch models to Tensorflow 2.0. diff --git a/vocoder/train.py b/vocoder/train.py index d401e72e..03e14f4a 100644 --- a/vocoder/train.py +++ b/vocoder/train.py @@ -52,7 +52,7 @@ def setup_loader(ap, is_val=False, verbose=False): # sampler = DistributedSampler(dataset) if num_gpus > 1 else None loader = DataLoader(dataset, batch_size=1 if is_val else c.batch_size, - shuffle=False, + shuffle=True, drop_last=False, sampler=None, num_workers=c.num_val_loader_workers