ShenXiaolei
ShenXiaolei
Is it divided into 256 buckets based on the amplitude of the optical flow (linear or log)?
Thanks for your great work. Have you ever encounter the phenomenon of overfit?
With unconditional generation during training, should reference embedding concat to the normal_hidden_states?
Is the implementation of variable aspect ratios, resolutions, durations different from that of NaViT? Are there any plans to implement the NaViT?
Tnanks for the great work. Could you please provide the checkpoints of the MobileNet V3 model? Thanks!
at dev brach, init pipeline as following code, but the output image is covered with a red layer `# brushnet-based version unet = UNet2DConditionModel.from_pretrained( "stable-diffusion-v1-5/stable-diffusion-v1-5", subfolder="unet", revision=None, torch_dtype=weight_dtype, ) text_encoder...
detach
why use detach() for promptB?Is it the solution for "Parameter at index 1 with name xxx has been marked as ready twice"?