Can someone explain why restricting the posterior `z` as diagonal Gaussian?

Open seekerzz opened this issue 4 years ago • 1 comments

Maybe I do not understand this paper throughly, but can someone explain this? The posterior z is modelled as diagonal Gaussian. And in the Zero initialization part, ensures that the posterior distribution as a simple normal distribution. If it is a simple distribution, why a complex prior flow is needed to learn its distribution?

Sep 18 '21 02:09 seekerzz

This is the initialization to ensure the posterior distribution as a normal. During training, the posterior distribution will become more and more complex when we update the parameters.

Oct 01 '21 21:10 XuezheMax