LanShanPi

Results 8 comments of LanShanPi

@janglichao 我也遇到了这个问题,请问你解决了吗?怎么解决的。

I execute the follow command to configuration anaconda environment: pip install deepspeed>=0.9.0 git clone https://github.com/microsoft/DeepSpeedExamples.git cd DeepSpeedExamples/applications/DeepSpeed-Chat/ pip install -r requirements.txt

在fastllm/CMakeLIsts.txt文件中将set(CMAKE_CUDA_ARCHITECTURES "native") 中的native改成显卡对应的算力,11.7应该对应80.

@fushengwuyu 这样在用llm.from_hf()加速的时候不会重复加载模型到gpu吗