gq2010 comments

Repositories
Issues
Comments

Results 3 comments of


                                            gq2010

我用LLaMA-Factory微调ChatGLM3-6B的模型，总是报内存溢出的问题,使用16G显存的微调提示差几十兆，换成24G显存还是提示差几十兆，使用chatglm官方提供的方法微调在24g是完全够用的微调用的文件总共500多K： [law0.json](https://github.com/hiyouga/LLaMA-Factory/files/14622707/law0.json) 使用LLaMA-Factory进行微调 ![3cf239ba6b64a838218d9627f66b6b9](https://github.com/hiyouga/LLaMA-Factory/assets/5334458/910dfb24-753a-41e2-b37a-d0e0841d5244) chatglm官方推荐的微调方法（lora） ![image](https://github.com/hiyouga/LLaMA-Factory/assets/5334458/50ae7eea-3e26-46dd-9f1f-363d6ee830f4)

xinference、glm4工具调用报错400

本地部署的glm4模型，官网提供的代码里没有/v1/embeddings这个API怎么解决呢？

gq2010

Module parse failed

4bit longlora 微调爆显存

xinference、glm4工具调用报错400

gq2010

Module parse failed

4bit longlora 微调 爆显存

xinference、glm4工具调用报错400

4bit longlora 微调爆显存