runpod-python icon indicating copy to clipboard operation
runpod-python copied to clipboard

VLLM Serverless Endpoint Deployment using Python

Open arianyambao opened this issue 1 year ago • 2 comments

Hello How do we deploy a VLLM Serverless endpoint using Python?

Do we explicitly have to create templates first? Not straight to the point of using the vLLM ready template?

arianyambao avatar Jan 03 '25 14:01 arianyambao

I don't get what you mean, maybe you mean wanna add something to the vllm for your own use? maybe you can try making a dockerfile but with the base image of VLLM-worker, and modify the entrypoint or CMD

nerdylive123 avatar Feb 08 '25 02:02 nerdylive123

@arianyambao Have you looked at this article? How to run vLLM with RunPod Serverless

deanq avatar Feb 09 '25 03:02 deanq