Litong.G comments

Results 5 comments of


                                            Litong.G

Questions about the CausalVideoVAE

Did CausalVideoVAE use 17 frames for training, or was it trained with a variable number of frames? I'm asking because I noticed the same model can perform inference with any...

训练出的loss不收敛

有没有生成视频的示例呢？我在项目readme里面并没有看到任何视频训练结果的case

The density_for_timestep_sampling and loss_weighting for SD3 Training！！！

> currently we're using sigmoid sampling for timesteps which seems fine but no one has really ablated whether it leaves fine details out Actually, sigmoid and lognorm are mathematically equivalent....

Why not use higher compression ratio in VAE?

Well, any idea to alleviate the ghost artifacts? > Temporal compression by 8x can result in significant ghosting artifacts, which cannot be reflected in the evaluation metrics.

CogVideo是可以使用图片训练的，请问如何使用图片进行训练

> 教程还没写完，预计这一周写完这部分有什么进展吗？