dakai comments

Results 8 comments of


                                            dakai

Data parallel inference

> > This approach should end up in a more scalable (maybe also cleaner) architecture: > > Run a vLLM API server for each GPU, serving on different ports. Then...

256G mem is Not Enough （AWQ 4bit LLama 70b）

I met the same problem. Is there any solution to it? @busishengui @Hukongtao

[Feature] 请问使用vllm评测时怎么实现类似HF多卡数据并行？

> @liushz Thank you for your response; I appreciate your clarification. However, the parameter in your reply pertains to setting tensor parallelism in vLLM. My intention is to load the...

minikube cannot detect the GPUs

Thanks, but I have tried this. I start from the section "[Enabling GPU Support in Kubernetes](https://github.com/NVIDIA/k8s-device-plugin#enabling-gpu-support-in-kubernetes)". I think this image has done the work before this section, I am not...

minikube cannot detect the GPUs

I am trying to implement these, but `systemctl` is not supported in this image. I get confused about how to run `systemctl restart docker`. I tried some ways to install...

minikube cannot detect the GPUs

Still not work. 1. I restart a container. `docker run --gpus 1 -it --privileged --name ElasticDL -d elasticdl:v1`. The image `elasticdl:v1` only adds `minikube`. 2. run `docker exec -it ElasticDL...

minikube cannot detect the GPUs

I also tried this document: https://github.com/intelligent-machine-learning/dlrover/blob/master/docs/tutorial/gpu_user_guide.md, similar to the Nvidia's document. Still get the same result. ``` root@c0ac3df639d6:/usr/src# kubectl describe pod nvidia-device-plugin-daemonset-r9spv -n kube-system Name: nvidia-device-plugin-daemonset-r9spv Namespace: kube-system Priority: 2000001000...