Patrick Devine
Patrick Devine
This change converts Mixtral 8x7b directly into an Ollama model. The 8x22b model will added in a separate PR as it has a different structure for the way the experts...
This change exposes the GGML KVs and tensor data to make it easier to introspect a model.
This change allows you to convert directly from a llama3 derived safetensors model into Ollama. It is currently *missing*: * pytorch *almost* works however the embeddings layer size is off...
This change adds a rudimentary `ps` command which makes use of the new scheduler changes in the server. The UX also The UX for this depends on whether you're using...
Fixes #4297
This moves the parser back to `parser/` and also adds support for decoding utf16le and utf16be files. Fixes #4503
This change converts a Safetensors based LoRA into GGUF and ties it w/ a base model. Only llama2/llama3/mistral will work initially. You can create the Modelfile to look like: ```...
If a Modelfile has an error in it, it's often difficult to debug where the error is located in the Modelfile itself. This change adds the line which the error...
This PR changes the way the POST `/api/create` endpoint works by changing the way the various options/parameters get serialized and passed to the server. Currently the create endpoint requires a...