Fizz~

Results 5 issues of Fizz~

Hi! Are there currently any plans to support the [9b version of RecurrentGemma](https://huggingface.co/google/recurrentgemma-9b) that recently released?

Feature
stat:awaiting response

### The model to consider. https://huggingface.co/THUDM/GLM-4-32B-0414, etc ### The closest model Aphrodite already supports. https://github.com/aphrodite-engine/aphrodite-engine/blob/main/aphrodite/modeling/models/chatglm.py Also already in vLLM https://github.com/yangw-dev/vllm/blob/main/vllm/model_executor/models/glm4.py ### What's your difficulty of supporting the model you want?...

Ported from vLLM

Previously, cooldown steps were _defined_ in the config, but never used anywhere -- this PR fixes that, and brings it up to par with the warmup hyperparams. Also, inspired by...

### Prerequisites - [x] I am running the latest code. Mention the version if possible as well. - [x] I carefully followed the [README.md](https://github.com/ggml-org/llama.cpp/blob/master/README.md). - [x] I searched using keywords...

enhancement