yijia comments

Results 5 comments of


                                            yijia

请问如何用VLLM部署33B

> 请问vllm部署时如何使用多卡加载模型，使用`CUDA_VISIBLE_DEVICES=0,1`还是只有一张卡load了，很奇怪，谢谢 try add `--tp=2` to launch argument

does deepspeed support pure bf16 training?

Hi, @hjc3613 , you can offload to nvme instead of cpu memory, please checkout out [nvme offload](https://www.deepspeed.ai/tutorials/zero/#offloading-to-cpu-and-nvme-with-zero-infinity).

does deepspeed support pure bf16 training?

You can achieve that by setting `fp32_optimizer_states=False ` in initialization of `DeepSpeedCPUAdam`, this param is added to deepspeed from version 0.14.3. note: if you are using transformers trainer, it will...

does deepspeed support pure bf16 training?

make sure you are using `DeepSpeedCPUAdam`, you can find the signature here [DeepSpeedCPUAdam](https://github.com/microsoft/DeepSpeed/blob/ffe0af23575c4f03a07408eacfc50b1a58781429/deepspeed/ops/adam/cpu_adam.py#L25)

[BUG] In zero3 mode, how to set nn.Linear weight (some parameters can be updated, but some cannot).

may be you can refer to [partition_parameters.](https://github.com/microsoft/DeepSpeed/blob/master/deepspeed/runtime/zero/partition_parameters.py#L2044)