laksjdjf comments

Results 11 comments of


                                            laksjdjf

Gradient checkpointing blocks LoRA weight updates

It seems that at least one of the inputs must have "requires_grad=True" for the [torch.utils.checkpoint](https://pytorch.org/docs/stable/checkpoint.html) to work. A simple solution to this problem for UNet-only training is to set `unet.conv_in.requires_grad_(True)`

Gradient checkpointing blocks LoRA weight updates

> I face the same problem when opening gradient checkpointing, Is there a way to solve this problem under text-encoder and unet joint training? Kohya's repo seems to have solved...

Gradient checkpointing blocks LoRA weight updates

> Interesting, but what if you didn't want to train the embeddings? In the case of LoRA, the embeddings parameter is not passed to the optimizer. Therefore, it is not...

Question on parameterization "v"

maybe p14 in https://arxiv.org/abs/2202.00512

Fix ControlNetModel.from_unet do not load add_embedding

> Let's fix the tests :) @sayakpaul I removed meaningless spaces for test. Are there any other operations required?

Fix ControlNetModel.from_unet do not load add_embedding

@sayakpaul Is this error relevant to this pr? ``` =========================== short test summary info ============================ FAILED tests/pipelines/unidiffuser/test_unidiffuser.py::UniDiffuserPipelineFastTests::test_attention_slicing_forward_pass - requests.exceptions.ReadTimeout: (ReadTimeoutError("HTTPSConnectionPool(host='huggingface.co', port=443): Read timed out. (read timeout=10)"), '(Request ID: 869e35f5-5627-4acf-9853-7187ce7d0656)') ====...

laksjdjf

Gradient checkpointing blocks LoRA weight updates

Gradient checkpointing blocks LoRA weight updates

Gradient checkpointing blocks LoRA weight updates

Question on parameterization "v"

Fix ControlNetModel.from_unet do not load add_embedding

Fix ControlNetModel.from_unet do not load add_embedding

Enable Training on 6GB Cards... with DeepSpeed?

Enable Training on 6GB Cards... with DeepSpeed?

subprocess.CalledProcessError, please help me

Implementation ideas of backpropagation of concat operator