Hao Li
Hao Li
mlc-ai-nightly-cu122 0.15.dev404 mlc-llm-nightly-cu122 0.1.dev1355 transformers 4.41.2 git clone https://huggingface.co/THUDM/glm-4-9b-chat mlc_llm convert_weight ./dist/models/glm-4-9b-chat/ --quantization q4f16_1 -o dist/glm-4-9b-chat-MLC mlc_llm gen_config ./dist/models/glm-4-9b-chat/ --quantization q4f16_1 --conv-template glm -o dist/glm-4-9b-chat-MLC/ It shows The repository for...
NotImplementedError: <class 'transformers_modules.modeling_chatglm.ChatGLMForConditionalGeneration'>
When running python -m awq.entry --model_path ./chaglm3-6b --w_bit 4 --q_group_size 128 --run_awq --dump_awq ./awq_cache/chatglm3-6b-w4-g128.pt It shows error like NotImplementedError: any plan to support chatglm3-6b? Thanks!
### Name and Version ggml_cuda_init: found 1 CUDA devices: Device 0: NVIDIA GeForce RTX 4060 Laptop GPU, compute capability 8.9, VMM: yes version: 6907 (bea04522f) built with MSVC 19.44.35219.0 for...
add sample for offloading Resnet to AMD Ryzen AI NPU