[Feature request] Specify speaker_wav for voice cloning in Coqui server #254

eginhard · 2025-01-10T18:53:18Z

🚀 Feature Description

With #252, all Coqui TTS models can be used in the tts-server for speech synthesis. However, some models like XTTS also support voice cloning with a speaker_wav reference file.

Solution

Add a file selector to the tts-server to specify the speaker_wav file(s), similar to the existing support for style_wav for certain models.

The text was updated successfully, but these errors were encountered:

eginhard added enhancement New feature or request good first issue Good for newcomers labels Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] Specify speaker_wav for voice cloning in Coqui server #254

[Feature request] Specify speaker_wav for voice cloning in Coqui server #254

eginhard commented Jan 10, 2025

[Feature request] Specify speaker_wav for voice cloning in Coqui server #254

[Feature request] Specify speaker_wav for voice cloning in Coqui server #254

Comments

eginhard commented Jan 10, 2025