copasseron
Results
2
issues of
copasseron
Triton's vLLM backend is based on vLLM 0.4.2 that propose more argument to the one in the documentation of the tutorial.
### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS:...
bug