VisionU

Results 2 issues of VisionU

In first stage, In oder to train model easy, we remove posenet, use VAE encoder instead(cat with noise like in-painting SD). Now it train 13000 step, texture is much different...

Very good job! I run you code in Colab,use anyone-video-2 kpts in your lib, just choose my reference img, but the results seem to no good, can you check it?...