vits
vits copied to clipboard
Can you provide typical loss figures ? KL diverging
Hello,
I am training from scratch using custom data. The Hifi-GAN part has converged relatively quickly, and the generated samples in the evaluation tensorboard sound really good.
However, the inference samples from phonemes don't seem to improve. Moreover, The kl-loss, which, as I understand, should be the next loss to converge, is rather diverging.
Here are my generator losses :

The jump is because the training was interrupted and the step number is wrong when loading from a checkpoint.
So, is this normal ? Do you have typical loss figures to share to compare with ? Thanks.