Patrick Devine comments

Results 426 comments of


                                            Patrick Devine

incomprehensible answers from Gemma:7b

> @pdevine I tried Gemma with 0.1.33-rc5 version. It works now but is slow. I see in the server logs that not all the layers are sent to the GPU....

I have a problem I get many #

@necro304 Does this happen with all models, or only llama2? What are the specs for you machine, what version of ollama are you running (use `ollama --version`), and what version...

I have a problem I get many #

I believe this ended up being fixed a while ago. The most recent version of ollama is 0.1.28. The llama2 model that you have should still be the latest. I'm...

feat: add "unload model" command/endpoint

Hey @knoopx You can actually do this by calling `curl http://localhost:11434/api/generate -d '{"model": "llama2", "keep_alive": 0}'` (not with `-1` which will always leave the model loaded). That will immediately unload...

[WSL2] Cuda error 222 : the provided PTX was compiled with an unsupported toolchain.

@fxrobin are you still seeing this issue in 0.1.20?

Import models installed in Linux to Windows

We started with Linux/MacOS and I had shoved a colon into the name not realizing that NTFS didn't support colons in file names. I also didn't anticipate so many people...

Consult where Ollama models are saved in Linux.( in WSL on windows)

Sorry for the slow response guys. There's actually an [FAQ](https://github.com/ollama/ollama/blob/main/docs/faq.md#how-do-i-set-them-to-a-different-location) which explains how to do this. *The short answer* is use the `OLLAMA_MODELS` environment variable if you want to put...

Patrick Devine

incomprehensible answers from Gemma:7b

I have a problem I get many #

I have a problem I get many #

feat: add "unload model" command/endpoint

[WSL2] Cuda error 222 : the provided PTX was compiled with an unsupported toolchain.

Import models installed in Linux to Windows

Consult where Ollama models are saved in Linux.( in WSL on windows)

[Linux] Ran out of space while installing llama2 model, can't delete or find

Is there any option to unload a model from memory?

Is there any option to unload a model from memory?