VBench icon indicating copy to clipboard operation
VBench copied to clipboard

generating videos from images

Open cyy-1234 opened this issue 1 year ago • 9 comments

Hello, author. After reading your paper, I have some doubts about certain details. May I directly input a video clip (without any prompts) for evaluation? Is your method applicable to generating videos from images? Looking forward to your response.

cyy-1234 avatar Feb 18 '24 09:02 cyy-1234

Thank you for your interest in our work. VBench provides video quality dimensions that are relevant to the input text. You can only test relevant dimensions. [   "subject consistency",   "background consistency",   "temporal flickering",   "motion smoothness",   "aesthetic quality",   "imaging quality",   "dynamic degree",]

yinanhe avatar Feb 18 '24 09:02 yinanhe

hello,Are there any requirements for the naming of the generated video, such as '{prompt}-{index}.mp4'? Is this format necessary?

cyy-1234 avatar Feb 18 '24 09:02 cyy-1234

image hi,Why do we need to subtract one?

cyy-1234 avatar Feb 19 '24 05:02 cyy-1234

hello,Are there any requirements for the naming of the generated video, such as '{prompt}-{index}.mp4'? Is this format necessary?

We have not updated the code to read frames directly for the time being, so such naming is necessary, or you can modify the code to read in frames as needed

yinanhe avatar Feb 19 '24 05:02 yinanhe

image hi,Why do we need to subtract one?

When calculating the subject consistency dimension, the consistency between the current frame and the previous frame and the consistency between the current frame and the first frame are calculated, the total number of computations is num_frames -1

yinanhe avatar Feb 19 '24 05:02 yinanhe

Hi,author, image I have successfully executed the evaluation with only video, and the results for 'background_consistency,' 'temporal_flickering,' 'motion_smoothness,' 'aesthetic_quality,' 'imaging_quality,' and 'dynamic_degree' seem to be fine. image The only area of confusion lies in the 'subject_consistency' aspect,Two numerical values are inconsistent. image

cyy-1234 avatar Feb 19 '24 06:02 cyy-1234

image image image

Hi, when I run multiple videos at once to calculate aesthetic_quality, I always get the error 'CUDA out of memory.' What could be the reason? According to my understanding, the model should only load one video at a time, and the size of the input videos is small.

cyy-1234 avatar Feb 19 '24 08:02 cyy-1234

This does not seem normal, you need to check your local environment

yinanhe avatar Feb 27 '24 08:02 yinanhe

Hello, author. After reading your paper, I have some doubts about certain details. May I directly input a video clip (without any prompts) for evaluation? Is your method applicable to generating videos from images? Looking forward to your response.

@cyy-1234 Hi, please check out VBench-I2V for evaluating image-to-video models: https://github.com/Vchitect/VBench/tree/master/vbench2_beta_i2v

ziqihuangg avatar Apr 19 '24 07:04 ziqihuangg

Hi, we're closing this issue as it appears your questions have been addressed. However, feel free to open a new issue or reopen this one, if you have further questions or if anything else comes up related to this issue.

ziqihuangg avatar Jun 04 '24 02:06 ziqihuangg

image image image

Hi, when I run multiple videos at once to calculate aesthetic_quality, I always get the error 'CUDA out of memory.' What could be the reason? According to my understanding, the model should only load one video at a time, and the size of the input videos is small.

Hi, have you solved this question? I have the same problem.

Crazy-wu-20 avatar Jun 14 '24 13:06 Crazy-wu-20

Hi, may I ask what's your video length (in seconds)? It's possible that the frame count caused OOM

ziqihuangg avatar Aug 01 '24 07:08 ziqihuangg