seekerzz

Results 7 comments of seekerzz

Thanks for your reply! Have you tried the multi-speaker situation? I used the code for LibriTTS training. However, the performance is bad and KL is high (at the 10^3 level)....

By the way, this is my training curve ![image](https://user-images.githubusercontent.com/30581485/134841751-47387804-08a6-4527-86a6-5a0e86231203.png) I did not train the length predictor (just using the ground truth length).

Thanks for the quick reply!😁 I add the speaker embedding into the text embedding (as I think Z can be viewed a style mapping from text X to mel Y,...

Yes! I am going to try their conditioning method. If it succeed I will share the result.😊

Hello! I find there might be a mistake in the code (just now)! In `VAENAR.py` ![image](https://user-images.githubusercontent.com/30581485/137303848-9638391b-2440-4a0d-a3d5-92c5d9b1f0b2.png) But in `posterior.py` ![image](https://user-images.githubusercontent.com/30581485/137303869-0f965160-4df0-430c-a7a5-340d73f8a492.png) I'm trying to train the multi-speaker version again to see...

Hello, I mean the position of Mu and Logvar are misplaced.

Thanks for your reply! I have understood that they can replace each other's variable name! My main problem for the multi-speaker training is that the prior cannot converge. The posterior...