copasseron

Results 2 issues of copasseron

Triton's vLLM backend is based on vLLM 0.4.2 that propose more argument to the one in the documentation of the tutorial.

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS:...

bug