Lazy loading

Open Ar57m opened this issue 2 years ago • 0 comments

can you guys implement it on the app mlcchat as llama.cpp? cause in low ram devices it crashes instantly when trying to generate text

Sep 08 '23 00:09 Ar57m