diffusers icon indicating copy to clipboard operation
diffusers copied to clipboard

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Results 1293 diffusers issues
Sort by recently updated
recently updated
newest added

### Describe the bug When I use with torch.autocast("cuda"): images = pipe(...) the output is noise ![validation_0_fe0c3de4f9a024ad739f](https://github.com/huggingface/diffusers/assets/61224076/0529f04e-7e04-499a-b6ef-26e32b4bf033) ### Reproduction import torch from diffusers import PixArtAlphaPipeline pipe = PixArtAlphaPipeline.from_pretrained("PixArt-alpha/PixArt-XL-2-1024-MS", torch_dtype=torch.float16) pipe.enable_model_cpu_offload()...

bug

### Describe the bug I've tried to combine InstructPix2Pix model and IP-Adapter (`pipe.load_ip_adapter("h94/IP-Adapter", subfolder="models", weight_name="ip-adapter_sd15.bin"`), but I always get `RuntimeError: mat1 and mat2 shapes cannot be multiplied (771x1280 and 1024x3072)`....

bug

### Describe the bug mask ![download-30](https://github.com/huggingface/diffusers/assets/1147704/96ef8de3-88f8-47a7-8c48-a83c567d09fc) ![download-29](https://github.com/huggingface/diffusers/assets/1147704/0286b4b3-0fe6-439c-b4b7-95dec25ee077) i got mask with this ``` def make_inpaint_condition (image, image_mask): image = np.array(image.convert("RGB")).astype(np.float32) / 255.0 image_mask = np.array(image_mask.convert("L")).astype(np.float32) / 255.0 assert image.shape[0:1] ==...

bug

# What does this PR do? Fixes #7011 ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if...

# What does this PR do? There are a couple of differences between transformers workflows and ours. e.g. We don't have a clone our repo in our docker image. Update...

# What does this PR do? Fix an inconsistency between `from_pretrained` and `save_pretrained` within the Multicontrolnet class. The `from_pretrained` function returns a directory structure like this: controlnet, controlnet_1, controlnet_2, whereas...

# What does this PR do? I found that for some model weights, the key values in the dictionary are not applied in order, and this can cause a potential...

Thanks for your great job!!! Now, I know how to infer GLIGEN with diffusers[https://github.com/gligen/diffusers/tree/gligen/examples/gligen](url). But how can I train GLIGEN with diffusers like ControlNet[https://github.com/huggingface/diffusers/blob/main/examples/controlnet/train_controlnet.py](url)? Thanks again.

I believe the current lack of easy access to VAE training is stopping diffusion models from disrupting even more industries. I'm talking about consistent details on things that are less...

community-examples
Good second issue

# What does this PR do? Added support for SDXL finetune on AscendNPU and fixed the bug causing the hang out when saving models using the deepspeed distributed framework. DeepSpeed...