Chunyan Li comments

Repositories
Issues
Comments

Results 1 comments of


                                            Chunyan Li

[Feature] Expert Parallelism Load Balancer (EPLB)

@abmfy Hello, I'm encountering the following error when using multi-GPU parallel processing Here's my startup command: python -m vllm.entrypoints.openai.api_server --model="/public/models/hf_models/DeepSeek-V2-Lite-Chat-FP8-A16" --trust-remote-code -tp 2 -dp 2 --port 8200 --enforce-eager --enable-eplb --eplb-log-balancedness...