pavel schudel

Results 3 comments of pavel schudel

having the same error with `llmware/bling-sheared-llama-1.3b-0.1` on 24 rtx 3090. when I lead the model it takes almost all the memory even though it should talk far less then 24GB....

do you have any estimate when will it be?