Coqui TTS

Text-to-speech extension for Oobabooga's text-generation-webui using Coqui TTS.

Installation

Assuming you already have the WebUI set up:

Install eSpeak-NG and ensure it is in your PATH
Activate the conda environment with the cmd_xxx.bat or using conda activate textgen
Enter the text-generation-webui/extensions/ directory and clone this repository

cd text-generation-webui/extensions/
git clone https://github.com/Fire-Input/text-generation-webui-coqui-tts coqui_tts

install the requirements

pip install -r extensions/coqui_tts/requirements.txt

Notes

The coqui_tts extension will automatically download the pretrained model tts_models/en/vctk/vits by default. It is less than 200MB in size, and will be downloaded to \home\USER\.local\share\tts for Linux and C:\Users\USER\AppData\Local\tts for Windows.
When running oobabooga, the tts package (version TTS==0.17.4) may throw an error about numpy if you are using python < 3.11, try pip install numpy==1.24.4 and pip install numba==0.57.1 to install the most compatible version of numpy and numba for this version. Ignore any error messages about incompatible package versions as the tts package needs to update its requirements.txt to later versions of numpy and numba and restart the WebUI.
Custom models are not supported yet.
Everytime you generate a new audio, Coqui will print out a log message to the console. This is normal and unfortunately cannot be disabled.
Audio files are saved to text-generation-webui/extensions/coqui_tts/outputs/
A lot of the code is copied from the ElevenLabs extension.
And some code copied from da3dsoul's fork.
I do not have a Coqui Studio API key, so I cannot test it. Therefore, it is not supported yet.

Testing Environment

Windows 11
Conda Installation with WSL2
WSL2 Ubuntu 22.04
Python 3.9.16
numpy==1.21.6
Conda 23.3.1
CUDA 11.7
WebUI commit: 68dcbc7ebda3f0d9700dde43d0d29324f5c244b1
eSpeak-NG 1.50

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
outputs		outputs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
script.py		script.py
tts_preprocessor.py		tts_preprocessor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Coqui TTS

Installation

Notes

Testing Environment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

Fire-Input/text-generation-webui-coqui-tts

Folders and files

Latest commit

History

Repository files navigation

Coqui TTS

Installation

Notes

Testing Environment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages