aabbccddwasd

Results 3 issues of aabbccddwasd

FP16 version is impossible to run on local GPU, I hope there will be GPTQ and AWQ versions, please!!!

### Your current environment ```text The output of `python collect_env.py` ``` ### How would you like to use vllm (EngineCore_0 pid=20659) WARNING 08-11 05:36:17 [fp8_utils.py:593] Using default W8A8 Block FP8...

usage
stale

假设我使用4张rtx pro 6000加双路epyc 9005推理满血的deepseek 685B,性能上会领先于单路吗?