xinkez
xinkez
@jmvalin I find that the vad used in the opus codes is great. I want to change the frame duration from 20ms to 16ms, but I don't know the principle...
Thank you. Now it works. Also two more questions, as I know there are 12 speakers in the vcc 2018 training dataset, why do you set the parameter to 9...
Thank you. I run our your updated codes. Now the training is good, but when it come to dev stage, it output the errors in the following. Should the number...
@xiaochunxin With the training going on, the "loss", "metrics_d" and "metrics_g" will become nan, and these parameters cannot return to normal state. 
Yes, I'm using the corpus of VCC2018. First I used all the 12 speakers to train the model, the loss is normal, but the generated wavs after training is not...
@baipeng1 I also met the problem of "can not find the model named pytorch_model.bin", have you solved it?
> accelerate version >= 0.25 change the .bin to .safetensors. > > just replace "pytorch_model.bin" with "model.safetensors" I replaced "pytorch_model.bin" with "model.safetensors", however it output such errors below. Do this...
> My AR model still can't accurately predict speech duration after 20 epochs/ 800K steps of training. Did you get reasonable results with your AR model? I noticed that the...
> 2.3.0 Hi, do you solve the problem? Thanks in advance.
您好,我看到您更新了代码,我想自己训练模型的话,请问 数据准备的脚本和训练的脚本分别是哪个?或者说可以更新下训练相关的readme吗?谢谢