diffusers issues

Latents / seeds are a mess. Make it easier to replicate a generated image using a seed.

10

**Problem:** We often generate images with a batch_size >1. However, images in the batch (after the first image) by default have an **seed that is unknown to the user**, so...

exo-pla-net

Simplify CrossAttention to run on Apple Neural Engine

3

**Is your feature request related to a problem? Please describe.** I'm trying to convert portions of unet into CoreML. However, CrossAttention fails to compile to the Apple Neural Engine. **Describe...

MatthewWaller

Running revision="fp16", torch_dtype=torch.float16 on mps M1

### Describe the bug I'm using the following code: ``` !pip install diffusers !pip install transformers scipy ftfy pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4", revision="fp16", torch_dtype=torch.float16, use_auth_token=True) prompt = "Pineapple on a white...

MatthewWaller

bug

(Textual Inversion) Initialise Vector For New Token From Multiple Existing Tokens

4

I'd like to propose an idea analagous to https://github.com/huggingface/diffusers/issues/369. The current fine tuning script for textual inversion initialises the new `placeholder_token`'s embedding with an existing `initializer_token` (and enforces that the...

rsomani95

Push to Hub Design

5

It would be nice to discuss a bit the push to hub design of the library. IMO we have two different use cases for `push_to_hub`. 1. The complement of `from_pretrained(...)`....

patrickvonplaten

Represent learnt concept in textual inversion with more than one token

10

### Describe the bug As we discuss in #266 > The original textual inversion support [using more than one vector](https://github.com/rinongal/textual_inversion/blob/main/ldm/modules/embedding_manager.py#L39) to represent the learnt concept. For the current implementation, if...

Luvata

enhancement

Parameter Initialization doesn't match with Latent Diffusion Model

3

### Describe the bug Thanks for releasing this great work, it really makes using diffusion a easy thing! But later when I tried to train a `UNet2DConditionModel` from scratch, I...

Karbo123

bug

Added multitoken training for textual inversion. Issue 369

6

I used multiple tokens to represent a concept by adding num_vec_per_token number of tokens to the tokenizer which can be initialized with the initial_token. The tokens would be labeled as...

isamu-isozaki

Textual Inversion training notebook only takes in remote images

7

The [Textual Inversion training notebook](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_textual_inversion_training.ipynb) in Google Colab only takes in remote images, instead of also being able to take in local images either uploaded from the desktop or linked...

minimaxir

stale

Standardize on using `image` argument in all pipelines

2

This standardizes the use of the argument `image` in all pipelines instead of a mix of `init_image` and `image`. Resolves #1257

fboulnois

diffusers
diffusers copied to clipboard

Metadata

Latents / seeds are a mess. Make it easier to replicate a generated image using a seed.

Simplify CrossAttention to run on Apple Neural Engine

Running revision="fp16", torch_dtype=torch.float16 on mps M1

(Textual Inversion) Initialise Vector For New Token From Multiple Existing Tokens

Push to Hub Design

Represent learnt concept in textual inversion with more than one token

Parameter Initialization doesn't match with Latent Diffusion Model

Added multitoken training for textual inversion. Issue 369

Textual Inversion training notebook only takes in remote images

Standardize on using `image` argument in all pipelines

← Metadata

Owner

Metadata

diffusers diffusers copied to clipboard

Metadata

← Metadata

Owner

Metadata

diffusers
diffusers copied to clipboard