diffusers
diffusers copied to clipboard
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
# What does this PR do? This PR adds the ability to provide a list of enabled embeddings for prompt conversion with the `TextualInversionLoaderMixin`. Right now, in order to disable...
I read from the hugging face docs that the parameter "num_class_images" refers to the "Minimal" class images for prior preservation loss. If with_prior_preservation = True and there are not enough...
# What does this PR do? Fixes #6158. Attempt at integrating https://github.com/guoyww/AnimateDiff/tree/sdxl. Relevant discussion: https://github.com/huggingface/diffusers/pull/5928#issuecomment-1851737456 Continuation of #6195. Unfortunately, the code base from current main and the one in 6195...
**What API design would you like to have changed or added to the library? Why?** Since the vq diffusion pipeline is deprecated in #6169 (sadly no docs can be found...
# What does this PR do? This PR adds support for the Stable Video Diffusion version of MotionCtrl as a community pipeline. This is the continuation of #6844 to keep...
update
# What does this PR do? - load_dataset data_dir Supports custom local datasets. Just like https://github.com/huggingface/diffusers/pull/5747#event-10922633415 - Update fingerprint hash parameters. Avoid irrelevant parameters causing repeated calculations in the map...
**Is your feature request related to a problem? Please describe.** As an Apple MPS user, it always feels somewhat like we're second-class citizens with respect to the latest and greatest...
### Model/Pipeline/Scheduler description [This paper](https://github.com/CompVis/attribute-control) shows you can control attributes of subjects in a text-to-image model. E.g., for the prompt "A cat and dog", you can change the attributes of...
### Model/Pipeline/Scheduler description Achieving faithful image-to-noise inversion with Denoising Diffusion models remains a challenge, particularly for more recent models trained to generate images with a small number of denoising steps....
### Describe the bug I trained a textual inversion model on SDXL pretrained model ( RealVisXL3.0 ), and then when I want to inference, I use this textual inversion model...