冯祥卫
冯祥卫
看显存大小吧
have you sovled it?I meet same problem.
Hi, its related to the size of the input image.
> try to multiple of 64.
> 我在用Accelerator的deepspeed做u-net微调时,即使batch_size=1,仍会出现显存溢出 多大显存啊
https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/stable_video_diffusion/pipeline_stable_video_diffusion.py, it seems the SVD inference code only adopt the cfg of image prompts.
refer 3.2.3 of Hierarchical Masked 3D Diffusion Model for Video Outpainting. I think this training code adopt two cfg so that correspondent changes should be in the inference stage.
you may can try to train unet.
我也遇到了同样的问题,在干干净净的视频出现了 watermark 和 bilibli,估计是训练数据的问题。