sapo/README.md

# Sapo

A bash script that can convert txt to wav using the all powerful https://github.com/coqui-ai/TTS

## TTS

https://github.com/coqui-ai/TTS

### INSTALL TTS

> pip install TTS

### FIX LONG UTTERANCES PROBLEM

https://dirk.net/2021/10/31/tts-fix-max-decoder-steps/

### OTHER DEPENDENCIES

> sed yad sox

  * As a text editor I use xed. If you prefer, however, another text editor by default (gedit, geany, mousepad etc), please substitute __xed__ in _line 82_ of __Sapo.sh__ with the respective command of your preffered editor.


### SED SCRIPT

sapofonetix.sed is a script that substitutes words that get mispelled  with other letter combinations, that have the right pronunciation result, e.g. 
> s/biscuit/biskit/g;s/Biscuit/biskit/g

will substitute the word _biscuit_ (or _Biscuit_ in plural) with the word _biskeet_ (_Biskeet_), that its pronunciation sounds more proper.

The list of words is growing as the script gets used more, ___<u>feel free to chime in!</u>___


### SCREENSHOTS

  * File selection dialog

![0.png](screenshots/0.png)

---

  * The file is delimited to lines with fewer characters each, so there  will be no problem with the text-to-speech conversion due to excessively long lines. However, the user can edit the file further before thw speech conversion.

![1.png](screenshots/1.png)

---

![2.png](screenshots/2.png)

---

  * Progress bar , and rough estimate of time left (probably depends on hardware)

![3.png](screenshots/3.png)

---

  * Process complete, the final wav file is inside the created **Sapo_filename** folder, named **filename.wav**. 

    If the wav files (one for each line of  text file) are too many, the final wav file 
 will not be produced. In this case concatetate the wav files in smaller batches ( every 500 files), and then concatenate _those_ to the final sound file, using the **sox** command, for example:

> cd Sapo_1_1.txt
> 
> sox {000001..000500}.wav ~/Desktop/1f.wav
> 
> sox {000501..001000}.wav ~/Desktop/2f.wav
> 
> sox {001001..001500}.wav ~/Desktop/3f.wav
> 
> cd ~/Desktop
> 
> sox {1..3}f.wav final.wav
> 

![4.png](screenshots/4.png)
Initial commit 2022-03-04 23:44:10 +00:00			`# Sapo`

			`A bash script that can convert txt to wav using the all powerful https://github.com/coqui-ai/TTS`

Update README.md 2022-03-04 23:51:17 +00:00			`## TTS`
Initial commit 2022-03-04 23:44:10 +00:00
Update README.md 2022-03-04 23:51:17 +00:00			`https://github.com/coqui-ai/TTS`
Initial commit 2022-03-04 23:44:10 +00:00
Update README.md 2022-03-04 23:51:17 +00:00			`### INSTALL TTS`
Initial commit 2022-03-04 23:44:10 +00:00
Update README.md 2022-03-04 23:51:17 +00:00			`> pip install TTS`
Initial commit 2022-03-04 23:44:10 +00:00
Update README.md 2022-03-04 23:51:17 +00:00			`### FIX LONG UTTERANCES PROBLEM`
Initial commit 2022-03-04 23:44:10 +00:00
Update README.md 2022-03-04 23:51:17 +00:00			`https://dirk.net/2021/10/31/tts-fix-max-decoder-steps/`
update files 2022-03-06 18:34:52 +00:00
update README.md 2022-03-06 18:36:06 +00:00			`### OTHER DEPENDENCIES`
update files 2022-03-06 18:34:52 +00:00
			`> sed yad sox`
update README.md 2022-03-06 21:50:56 +00:00
update README.md 2022-03-07 00:26:43 +00:00			`* As a text editor I use xed. If you prefer, however, another text editor by default (gedit, geany, mousepad etc), please substitute __xed__ in _line 82_ of __Sapo.sh__ with the respective command of your preffered editor.`


upload README.md 2022-03-06 22:04:27 +00:00			`### SED SCRIPT`

			`sapofonetix.sed is a script that substitutes words that get mispelled with other letter combinations, that have the right pronunciation result, e.g.`
			`> s/biscuit/biskit/g;s/Biscuit/biskit/g`

			`will substitute the word _biscuit_ (or _Biscuit_ in plural) with the word _biskeet_ (_Biskeet_), that its pronunciation sounds more proper.`

			`The list of words is growing as the script gets used more, ___<u>feel free to chime in!</u>___`


update README.md 2022-03-06 21:50:56 +00:00			`### SCREENSHOTS`

upload README.md 2022-03-06 23:00:19 +00:00			`* File selection dialog`

update README.md 2022-03-06 21:50:56 +00:00			`![0.png](screenshots/0.png)`

			`---`

upload README.md 2022-03-06 23:00:19 +00:00			`* The file is delimited to lines with fewer characters each, so there will be no problem with the text-to-speech conversion due to excessively long lines. However, the user can edit the file further before thw speech conversion.`

update README.md 2022-03-06 21:50:56 +00:00			`![1.png](screenshots/1.png)`

			`---`

			`![2.png](screenshots/2.png)`

			`---`

upload README.md 2022-03-06 23:00:19 +00:00			`* Progress bar , and rough estimate of time left (probably depends on hardware)`
update README.md 2022-03-06 21:50:56 +00:00
			`![3.png](screenshots/3.png)`

			`---`

upload README.md 2022-03-06 23:00:19 +00:00			`* Process complete, the final wav file is inside the created Sapo_filename folder, named filename.wav.`

			`If the wav files (one for each line of text file) are too many, the final wav file`
			`will not be produced. In this case concatetate the wav files in smaller batches ( every 500 files), and then concatenate _those_ to the final sound file, using the sox command, for example:`

			`> cd Sapo_1_1.txt`
			`>`
			`> sox {000001..000500}.wav ~/Desktop/1f.wav`
			`>`
			`> sox {000501..001000}.wav ~/Desktop/2f.wav`
			`>`
			`> sox {001001..001500}.wav ~/Desktop/3f.wav`
			`>`
			`> cd ~/Desktop`
			`>`
Update README.md 2022-03-06 23:06:27 +00:00			`> sox {1..3}f.wav final.wav`
upload README.md 2022-03-06 23:00:19 +00:00			`>`

Update README.md 2022-03-06 23:06:27 +00:00			`![4.png](screenshots/4.png)`