qwen.cpp
qwen.cpp copied to clipboard
Can you add an additional function to let convert.py support Qwen/Qwen-7B-Chat-Int4?
It seems conversion on Qwen-7B-Chat needs more than 32GB memory to run. It probably can be solved by conversion for Int4