You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm having problem training at a hight sampling rate, I'm not very good at code but I thought that it is just changing some parameter and ask chatgpt, but I'm faced problems like n_mel is wrong there is a mismatch, tensor a must be the same as tensor b. I couldn't find any help or the way I'm trying is wrong.
And also I have some questions to the VITS training, first thing is that I don't know how to continue a training from a checkpoint, I let chatgpt if the code has some hints but I didn't managed to continue a training, but actually it isn't very important as I feel my computer pretty consistant but sometimes windows shuts down to update...
The output model is a single .pth, I've used VITS models and they were G_xxxxx.pth and a D_xxxxx.pth and both weren't this big. I trained a 22050hz model to try, and I failed to load that model into my program, and maybe it's the config.json's fault as it is very different to others vits models I've tried...
Also I left default the phoneme language: phoneme_language="en-us", as I'm training for a person that speaks en-us I didn't change.
The code i used to train 22050hz is the default and I hope someone teach me how to train at 48khz or 44.1khz :)
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I'm having problem training at a hight sampling rate, I'm not very good at code but I thought that it is just changing some parameter and ask chatgpt, but I'm faced problems like n_mel is wrong there is a mismatch, tensor a must be the same as tensor b. I couldn't find any help or the way I'm trying is wrong.
And also I have some questions to the VITS training, first thing is that I don't know how to continue a training from a checkpoint, I let chatgpt if the code has some hints but I didn't managed to continue a training, but actually it isn't very important as I feel my computer pretty consistant but sometimes windows shuts down to update...
The output model is a single .pth, I've used VITS models and they were G_xxxxx.pth and a D_xxxxx.pth and both weren't this big. I trained a 22050hz model to try, and I failed to load that model into my program, and maybe it's the config.json's fault as it is very different to others vits models I've tried...
Also I left default the phoneme language: phoneme_language="en-us", as I'm training for a person that speaks en-us I didn't change.
The code i used to train 22050hz is the default and I hope someone teach me how to train at 48khz or 44.1khz :)
Beta Was this translation helpful? Give feedback.
All reactions