TTS/notebooks/dataset_analysis
Eren Gölge ef4ea9e527 update imports for `formatters` 2021-06-28 17:03:19 +02:00
..
AnalyzeDataset.ipynb update imports for `formatters` 2021-06-28 17:03:19 +02:00
CheckDatasetSNR.ipynb Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-03-18 14:09:47 +01:00
CheckSpectrograms.ipynb update CheckSpec notebook 2021-03-24 12:52:56 +01:00
PhonemeCoverage.ipynb update imports for `formatters` 2021-06-28 17:03:19 +02:00
README.md Mass refactoring 2020-07-17 11:16:05 +02:00
analyze.py reformatting and styling 2021-04-12 11:47:39 +02:00

README.md

Simple Notebook to Analyze a Dataset

By the use of this notebook, you can easily analyze a brand new dataset, find exceptional cases and define your training set.

What we are looking in here is reasonable distribution of instances in terms of sequence-length, audio-length and word-coverage.

This notebook is inspired from https://github.com/MycroftAI/mimic2