stable-diffusion.cpp icon indicating copy to clipboard operation
stable-diffusion.cpp copied to clipboard

[Feature] HunyuanVideo-1.5

Open Green-Sky opened this issue 2 months ago • 3 comments

Feature Summary

Support for tencent's new 8.3B video generation model

Detailed Description

In their own words:

HunyuanVideo-1.5 is a video generation model that delivers top-tier quality with only 8.3B parameters, significantly lowering the barrier to usage. It runs smoothly on consumer-grade GPUs, making it accessible for every developer and creator. This repository provides the implementation and tools needed to generate creative videos.

https://huggingface.co/tencent/HunyuanVideo-1.5

Additional context

This achievement is built upon several key components, including meticulous data curation, an advanced DiT architecture with selective and sliding tile attention(SSTA), enhanced bilingual understanding through glyph-aware text encoding , progressive pre-training and post-training, and an efficient video super-resolution network.

Image Image

Also "Flex-Block-Attention": https://github.com/Tencent-Hunyuan/flex-block-attn

Green-Sky avatar Nov 22 '25 12:11 Green-Sky

wip

leejet avatar Nov 22 '25 12:11 leejet

"Requirements: Hopper (SM90) GPUs, or other architectures with SM90 PTX ISA support"

So it's not feasible for most people.

GreenShadows avatar Nov 23 '25 10:11 GreenShadows

"Requirements: Hopper (SM90) GPUs, or other architectures with SM90 PTX ISA support"

So it's not feasible for most people.

That is only really referencing their specific implementation and has not much to do with us.

Green-Sky avatar Nov 23 '25 12:11 Green-Sky