Classifier Free Guidance for StableVideoDiffusion

Open tykim0507 opened this issue 1 year ago • 0 comments

Hello, I have some questions about the implementation of classifier free guidance for the current I2V Stable Video Diffusion model.

In training, what is the probability of dropping out the first frame latent concatenated and the image latent going into the cross attention? I read your paper, but I couldn't find the drop out probability of the image condition.
Is linear classifier guidance considered in the implementation for training?

Thank you

May 14 '24 08:05 tykim0507