Patrick Devine

[email protected]

Silicon Valley, CA

Results 426 comments of


                                            Patrick Devine

Support loading multiple models at the same time

@Picaso2 other than the multimodal models we don't _yet_ support loading multiple models into memory simultaneously. What is the use case you're trying to do?

Pass in prompt as arguments is broken for multimodal models

Sorry for the slow response. This did get fixed a while back but the issue never got updated. Here's an example: ``` % ./ollama run llava:13b "Describe this image: /Users/pdevine/Pictures/steve.png"...

Error: digest mismatch on download dolphin-mixtral:8x7b-v2.5-q3_K_L AND mixtral:8x7b-instruct-v0.1-q3_K_L AND openchat:7b-v3.5-1210-q8_0

Hi @byteconcepts , sorry for the slow response. I just pulled and was successful: ``` % ./ollama pull dolphin-mixtral:8x7b-v2.5-q3_K_L pulling manifest pulling a69e225da78e... 100% ▕█████████████████████████████████████████████████████████████████▏ 20 GB pulling 43070e2d4e53... 100%...

Fails to load larger models, weight not found

I think you were just running out of memory when you were trying to run the model. We've made several changes around how we handle memory since you filed this,...

Importing (PyTorch & Safetensors)

We actually changed the docs on this a while back to not use the docker image for quantizing. You can see it [here](https://github.com/ollama/ollama/blob/main/docs/import.md#quantize-the-model). I have been working on a new...

:thought_balloon: Feature request > make it possible to remove the "thinking" animation "⠙ ⠹ ⠸"

Hey @adriens is the animation getting animated incorrectly? I'm just wondering what the use case is.

:thought_balloon: Feature request > make it possible to remove the "thinking" animation "⠙ ⠹ ⠸"

@adriens Have you tried out the new [python bindings](https://github.com/ollama/ollama-python)?

:thought_balloon: Feature request > make it possible to remove the "thinking" animation "⠙ ⠹ ⠸"

@adriens are you OK to close the issue? I'm not sure we need this, but can always reopen in the future.

[Feature request] update models from CLI

I've submitted #2179

ollama doesn't use system RAM

This should be working better in that ollama should offload a portion to the GPU, and a portion to the CPU. Can you test again with ollama version 0.1.28? There...

‹
1
2
...
7
8
9
10
11
12
13
...
42
43
›