diffusers [Design Discussion] allowing `from_pretrained()` to also load single file checkpoints

Since we were considering adding an option like single_file_format to save_pretrained() of DiffusionPipeline, it makes sense to have something similar in from_pretrained() to have better feature parity.

We currently support loading single file checkpoints in DiffusionPipeline via from_single_file(). Some examples below:

from diffusers import StableDiffusionPipeline

# Download pipeline from huggingface.co and cache.
pipeline = StableDiffusionPipeline.from_single_file(
    "https://huggingface.co/WarriorMama777/OrangeMixs/blob/main/Models/AbyssOrangeMix/AbyssOrangeMix.safetensors"
)

# Download pipeline from local file
# file is downloaded under ./v1-5-pruned-emaonly.ckpt
pipeline = StableDiffusionPipeline.from_single_file("./v1-5-pruned-emaonly")

# Enable float16 and move to GPU
pipeline = StableDiffusionPipeline.from_single_file(
    "https://huggingface.co/runwayml/stable-diffusion-v1-5/blob/main/v1-5-pruned-emaonly.ckpt",
    torch_dtype=torch.float16,
)
pipeline.to("cuda")

(Taken from the docs here)

Proposed API design

Calling from_pretrained() on a DiffusionPipeline requires users to mandatorily pass pretrained_model_name_or_path, which can be a repo id on the Hub or a local directory containing checkpoints in the diffusers format.

(Docs)

Now, if we want to add support for loading a compatible single file checkpoint in from_pretrained(), we could have an API like so:

from diffusers import DiffusionPipeline

repo_id = "WarriorMama777/OrangeMixs"
pipe = DiffusionPipeline.from_pretrained(repo_id, weight_name="Models/AbyssOrangeMix/AbyssOrangeMix.safetensors")

Like before, repo_id could either be an actual repo id on the Hub or a local directory.
weight_name can either be just the filename of the single file checkpoint to be loaded or the relative path to the checkpoint (w.r.t the underlying repo / directory).
When weight_name is provided in from_pretrained():
- We immediately check if the file exists in the repository or the directory and flag an error if necessary in case it's not found.
- Once it's checked, we hit the codepath that we're hitting currently when using from_single_file(). Logic to do that should be completely separated as a utility and should not come into from_pretrained(). We can just call the utility from from_pretrained().
- How can we detect errors here as early as possible? What if the checkpoint is not compatible or doesn't have all the components we need (what if the vae or any other component is missing)? Is there any robust way?
Once this support is foolproof, we can start deprecating the use of from_single_file().

Some thoughts

I don't think this is a very new design. Users are already familiar with weight_name and how it's to be used through load_lora_weights() (which is quite popular at this point IMO).
I think we must force users to pass weight_name. Too much intelligent guessing here would lead to ugly consequences in the code and I am not sure if it's worth bearing the fruits for.

Jan 05 '24 02:01 sayakpaul

Cc: @patrickvonplaten @DN6

Jan 05 '24 02:01 sayakpaul

weight_name or file_name makes sense to me! Let's maybe make sure we have the same loading logic here as in load_lora

Jan 05 '24 12:01 patrickvonplaten

Let's maybe make sure we have the same loading logic here as in load_lora

Elaboate a bit?

Jan 05 '24 12:01 sayakpaul

Hello, a dev from SDNext here,

FromSingleFileMixin refers to download_from_original_stable_diffusion_ckpt but the former does not accept a state_dict and the later does.

While you are redesigning this workflow, would there be a clean way to enable a state_dict to be passed to FromSingleFileMixin to bypass loading from disk/hub?

Use case: Multiple state_dict objects stored in RAM for quick swap to the active pipeline.

Jan 08 '24 16:01 AI-Casanova

Thanks for letting us know about it. What do you mean by the former?

Jan 08 '24 16:01 sayakpaul

@sayakpaul FromSingleFileMixin breaks the ability to pass a state_dict here https://github.com/huggingface/diffusers/blob/e0f349c2b07975810b7c4faeeafe2f4124f3cfc9/src/diffusers/loaders/single_file.py#L234

Whereas the downstream function https://github.com/huggingface/diffusers/blob/e0f349c2b07975810b7c4faeeafe2f4124f3cfc9/src/diffusers/pipelines/stable_diffusion/convert_from_ckpt.py#L1134 can recieve a state_dict

Jan 08 '24 16:01 AI-Casanova

I haven't fully worked through the implications but it appears that the pathing and downloading might be bypassed by if isinstance(pretrained_model_link_or_path, dict) and that keyword overrides for vae etc should still function.

Jan 08 '24 16:01 AI-Casanova

Yes, that is exactly what we will do. Similar to what we do in load_lora_weights().

Keep sharing your inputs with us, it's very helpful!

Jan 09 '24 11:01 sayakpaul

Let's maybe make sure we have the same loading logic here as in load_lora

Elaboate a bit?

Same function signature names, same loading functions (also cc @DN6 here) as we talked about it. This PR is very much related btw: https://github.com/huggingface/diffusers/pull/6428

Jan 09 '24 14:01 patrickvonplaten

Yes, definitely. Makes sense to work on this issue after #6428 is merged?

Jan 09 '24 14:01 sayakpaul

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Feb 04 '24 15:02 github-actions[bot]

since #6428 is merged, perhaps good time to follow-up here?

Feb 04 '24 15:02 vladmandic

Yes, this is on my mind. Will start working on this soon!

Feb 04 '24 17:02 sayakpaul

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Mar 02 '24 15:03 github-actions[bot]

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Mar 27 '24 15:03 github-actions[bot]

@yiyixuxu @sayakpaul not stale?

Mar 27 '24 15:03 vladmandic

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Apr 21 '24 15:04 github-actions[bot]

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

May 16 '24 15:05 github-actions[bot]

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Sep 14 '24 15:09 github-actions[bot]

No need to consider this as @DN6 has worked a great deal on making from_single_file() more and more robust. CLosing this.

Nov 21 '24 15:11 sayakpaul