Apply lora by model patching

Open StAlKeR7779 opened this issue 2 years ago • 0 comments

Rewrite lora to be applied by model patching as it gives us benefits:

On model execution calculates result only on model weight, while with hooks we need to calculate on model and each lora
As lora now patched in model weights, there no need to store lora in vram

Results: Speed:

VRAM:

As based on #3547 wait to merge.

Jun 26 '23 01:06 StAlKeR7779