llama.cpp icon indicating copy to clipboard operation
llama.cpp copied to clipboard

Feature Request: Support GLM-4.1V-9B-Thinking

Open Willian7004 opened this issue 9 months ago • 12 comments

Prerequisites

  • [x] I am running the latest code. Mention the version if possible as well.
  • [x] I carefully followed the README.md.
  • [x] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • [x] I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Support GLM-4.1V-9B-Thinking

Motivation

It's a SOTA open sourse thinking VLM.

Possible Implementation

No response

Willian7004 avatar Jul 02 '25 05:07 Willian7004

Any updates?

wogam avatar Jul 17 '25 07:07 wogam

Yeah any updates?

IDKKKKK avatar Jul 20 '25 21:07 IDKKKKK

You can already test my solution, no need to recompile anything (it works like the old GLM-4). You just need to use the converter to create a GGUF file for GLM-4.1V-9B-Thinking.

jacekpoplawski avatar Jul 23 '25 21:07 jacekpoplawski

It looks like https://github.com/ggml-org/llama.cpp/pull/14823 only support text, so we should leave this issue open until vision is supported?

rujialiu avatar Jul 24 '25 02:07 rujialiu

I can see text works okay but still waiting for vision support. Please could this be implemented, this model is really good for vision.

wogam avatar Aug 02 '25 09:08 wogam

same, it would be nice to have it, i tried it on a demo and it can scan a whole page and give you an over all really good result. support for it would be really nice.

IDKKKKK avatar Aug 02 '25 09:08 IDKKKKK

yes its very good model for work , i try it on demo site very accurate response for vision task , please if you can support it as vision model on llama-server .

i try many steps but fail , its need --mmproj projector file to run on llama-server and provided gguf suport text only no vision no audio

https://www.modelscope.cn/models/unsloth/GLM-4.1V-9B-Thinking-GGUF/summary

Alias4D avatar Aug 04 '25 16:08 Alias4D

The vision is what makes this model interesting, is there some unknown problem converting it?

clort81 avatar Aug 10 '25 01:08 clort81

Now that GLM-4.5V is relased. It'll be great to have both supported.

rujialiu avatar Aug 12 '25 10:08 rujialiu

Yeah it would make sense

IDKKKKK avatar Aug 14 '25 12:08 IDKKKKK

Have there been any recent advancements?

wogam avatar Aug 28 '25 16:08 wogam

is there a mmproj for glm-4.1v yet?

BryantVanOrden avatar Oct 10 '25 18:10 BryantVanOrden

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions[bot] avatar Nov 24 '25 01:11 github-actions[bot]