Patrick Devine comments

Results 426 comments of


                                            Patrick Devine

Allow system message to be on its own

Hey @jackielii , sorry for the slow response. There actually is a _raw mode_ for `/api/generate` if you check out the [API documentation](https://github.com/ollama/ollama/blob/main/docs/api.md#generate-a-completion). We've been trying to rethink the way...

release information is chaotic

Hey @FlorinAndrei sorry this is confusing. `0.5.7` is the current release. `0.5.8` and `0.5.9` are in "pre-release" (you should be able to see this on the releases page); `0.5.8` _would_...

release information is chaotic

cc @jmorganca

The quality of the results returned by the embedding model become worse

I'm going to go ahead and close this as stale. There have been a lot of improvements w/ embedding models on the ollama engine (i.e. not the legacy llama.cpp engine)...

Add alias of /quit and /exit for /bye.

Hey @bulrush15 , what's wrong with `/bye` or Ctrl-D?

Massive performance regression on 0.1.32 -> GGML_CUDA_FORCE_MMQ: (SET TO NO, after 0.1.31)

Going to go ahead and close this as stale.

Feature Request: LLaMA 3.2 Vision Support

For people who want to try this: * Download the [pre-release of Ollama 0.4.0](https://github.com/ollama/ollama/releases/tag/v0.4.0-rc4) * Pull the vision model with `ollama pull x/llama3.2-vision:11b` (you can find the other quantizations at...

Ollama Pull Error

Hey @573932914 , the part that you cut/pasted is just the same as on your screenshot. Can you paste in what `e.response.text` and `e.response.status_code` are set to?

Llama3.1 405b q1 q2 q5 q6 q8 fp16

Closing since this is supported now. You can find the tags here: https://ollama.com/library/llama3.1/tags

Ollama push got `retrieving manifest Error: file does not exist`

@emsi and @pacozaa I believe the problem is the uppercase characters in the model name. If you can just change the names to lowercase it _should_ work. We actually have...