llama.cpp icon indicating copy to clipboard operation
llama.cpp copied to clipboard

Wizardllm Model Does not let user type after first prompt

Open Asory2010 opened this issue 2 years ago • 1 comments

for example i tell it:

User: hi AI: hello

then the cmd goes blank (as if it is pressing enter to start a new line of text)

Asory2010 avatar Apr 27 '23 21:04 Asory2010

The following is my script to use wizardllm models. They work fine for me.

#maximum compatiblity
./main -t 12 -m models/wizardLM-7B-HF/wizardLM-7B.ggml.q4_0.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1  --interactive-first --color -r 
"Human:"
# best compromise between resource, speed and quality
#./main -t 12 -m models/wizardLM-7B-HF/wizardLM-7B.ggml.q4_2.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1  --interactive-first --color -r
 "Human:"
# maximum quality 4bit, hight ram requirements and solwer inference
#./main -t 12 -m models/wizardLM-7B-HF/wizardLM-7B.ggml.q4_3.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1  --interactive-first --color -r
 "Human:"
# brand new 5bit mothod. potentially higher qality than 4 bit at a cost of slightly hiher resource
#./main -t 12 -m models/wizardLM-7B-HF/wizardLM-7B.ggml.q5_0.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1  --interactive-first --color -r
 "Human:"
# brand new 5 bit method slicghly highter resource usage than q5_0
#./main -t 12 -m models/wizardLM-7B-HF/wizardLM-7B.ggml.q4_0.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1  --interactive-first --color -r
 "Human:"

iplayfast avatar Apr 29 '23 06:04 iplayfast

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions[bot] avatar Apr 09 '24 01:04 github-actions[bot]