How to use Multi-Singer for English SVS

Hello there, @Rongjiehuang  @a-ggghost @SunMail-hub 

I'm trying to use Multi-Singer for SVS for English singers and I'm new to speech-related tasks. Thus I have a few questions about how to adapt this to English and about a few steps that are not clear to me.

## 1 - Encoder

From my understanding, the encoder that's being used is the same in [here](https://github.com/dipjyoti92/speaker_embeddings_GE2E) and I wonder if re-training would be necessary since in the Multi-Singer paper it was mentioned that the encoder was trained in several datasets with languages that include English (supposing that the [provided checkpoint](https://github.com/Rongjiehuang/Multi-Singer/blob/main/pretrained1.pt) comes from that training)?

## 2 - Multi-Singer

Should I re-train Multi-Singer?

## 3 - Modified Fastspeech 2 + Multi-Singer for SVS

Is the modified version that was used in the paper in the linked repo of [Fastspeech 2](https://github.com/ming024/FastSpeech2)? Because the architecture in the repo is different from the one in the Multi-Singer paper in the Appendix.

Also, to generate the acoustic features with Fastspeech for an unseen singer how would the singer embedding be added without pre-training Fastspeech with the new singer? 

I know that's a lot to ask, but could you give an example of how to use Fastspeech 2 + Multi-Singer to accomplish SVS for an unseen singer?

Thanks a lot in advance and sorry for the number of questions :P


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use Multi-Singer for English SVS #12

1 - Encoder

2 - Multi-Singer

3 - Modified Fastspeech 2 + Multi-Singer for SVS

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to use Multi-Singer for English SVS #12

Description

1 - Encoder

2 - Multi-Singer

3 - Modified Fastspeech 2 + Multi-Singer for SVS

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions