seekerzz
seekerzz
Thanks for your reply! Have you tried the multi-speaker situation? I used the code for LibriTTS training. However, the performance is bad and KL is high (at the 10^3 level)....
By the way, this is my training curve  I did not train the length predictor (just using the ground truth length).
Thanks for the quick reply!😁 I add the speaker embedding into the text embedding (as I think Z can be viewed a style mapping from text X to mel Y,...
Yes! I am going to try their conditioning method. If it succeed I will share the result.😊
Hello! I find there might be a mistake in the code (just now)! In `VAENAR.py`  But in `posterior.py`  I'm trying to train the multi-speaker version again to see...
Hello, I mean the position of Mu and Logvar are misplaced.
Thanks for your reply! I have understood that they can replace each other's variable name! My main problem for the multi-speaker training is that the prior cannot converge. The posterior...