Yuqian Hong
Yuqian Hong
### Describe the bug I have tried to train a controlnet with lora, but after I merge the controlnet model and lora, the weight of controlnet seems not changed. (maybe...
I am interested in lightweighting ControlNet, and I wonder how to do that. Is there any train script?
Hello, I am currently exploring the impact of separating the image encoder in our model architecture on both the training process and the quality of generated images. Specifically, I am...
sFID is a metric for evaluating the quality of generated images, and its calculation method is similar to FID (Fréchet Inception Distance). However, it utilizes features from other layers of...
### Model/Pipeline/Scheduler description MAGI-1 is a video generation model that has achieved stunning visual effects. ### Open source status - [x] The model implementation is available. - [x] The model...
TODO for this PR: - Add support for multiple models, including InternVL2.5, Idefics2, GLM-4V, etc.; - ~~Define a consistent model naming convention and update the corresponding `DataCollator` and `ModelLoader` accordingly;~~...
This PR is a continuation of the following discussion https://github.com/huggingface/diffusers/issues/4679 https://github.com/huggingface/diffusers/pull/4899, and it addresses the following issues: 1. Loading SAI's control-lora files and enabling controlled image generation. 2. Building a...