stable-diffusion.cpp icon indicating copy to clipboard operation
stable-diffusion.cpp copied to clipboard

CUDA build file size

Open DarthAffe opened this issue 1 year ago • 3 comments

Something happened to the size of the cuda-build - it absolutely exploded. Is it expected/intended to be that big with the latest release?

DarthAffe avatar Jul 28 '24 14:07 DarthAffe

That should be due to optimizations/specializations done upstream in llama.cpp. But most of those should be unused here. It's also an NxM type problem, because it emits code for N quantization specializations for M cuda architectures.

Green-Sky avatar Jul 28 '24 14:07 Green-Sky

Hmm, ok - but it really would be great, if it could be addressed at least partially since a 7.5 times increase in size might be a bit unreasonable. (Which is mostly due to my practical problem of it being too big to distribute in the c# ecosystem now :p).

DarthAffe avatar Jul 28 '24 15:07 DarthAffe

Can be a help #395

ring-c avatar Sep 18 '24 09:09 ring-c