OpenDiT icon indicating copy to clipboard operation
OpenDiT copied to clipboard

Does FastSeq support video generation models such as Latte?

Open yhy-2000 opened this issue 1 year ago • 3 comments

Hi, thank you for making this open source.

I've noticed that parameters such as 'sequence_parallel_size' and 'sequence_parallel_group' only appear in 'DiT' modules (such as 'DistAttn') but not in 'Latte' modules. Does this mean that FastSeq supports only image generation but not video generation? If so, could you explain why?

Thanks!!

yhy-2000 avatar Mar 05 '24 13:03 yhy-2000

Another question is, why flashattn and layernorm_kernel is forbidden during sampling? (https://github.com/NUS-HPC-AI-Lab/OpenDiT/blob/c15d82b738d0efb7f8f9e79c2f5277cbb417c8e2/sample.py#L70) Looking forward to you reply. Thanks in advance.

yhy-2000 avatar Mar 05 '24 17:03 yhy-2000

Hi! We are still working on adapting Fastseq to the Latte model and will release it in the future. You can manually set the enable_flashattn to True when sampling. It is just default to False. We will polish it.

KKZ20 avatar Mar 06 '24 07:03 KKZ20

Hi! We are still working on adapting Fastseq to the Latte model and will release it in the future. You can manually set the enable_flashattn to True when sampling. It is just default to False. We will polish it.

Thanks for your fast reply. I have another question: what is the difference between sequence_parallel_type 'longseq' and 'ulysses'?

yhy-2000 avatar Mar 07 '24 09:03 yhy-2000

dsp support now

oahzxl avatar Mar 21 '24 05:03 oahzxl