Xingbin Liu
Xingbin Liu
> > pytorch 1.6. cuda10.1 > > when i run "./make.sh". I come cross this problem > > """ > > File "/home/lxb/anaconda3/envs/torch/lib/python3.8/site-packages/setuptools/command/build_ext.py", line 208, in build_extension > > _build_ext.build_extension(self,...
> > > > pytorch 1.6. cuda10.1 > > > > when i run "./make.sh". I come cross this problem > > > > """ > > > > File...
any progress for this problem?
Thanks for your reply. I am looking forward to the training code. Best, 在 2024-03-18 14:34:45,"Junfeng Wu" ***@***.***> 写道: Hi~ Thanks for your attention! The training code and models will...
> Hi~ Thanks for your attention! The training code and models will be released in this week! Hi, thanks for the update. I am wondering if there are any tips...
any update?
I change the code in `vllm/model_executor/model_loader/openvino.py` to 'GPU'  It turns out that `[rank0]: RuntimeError: Exception from src/inference/src/cpp/core.cpp:104: [rank0]: Exception from src/inference/src/dev/plugin.cpp:53: [rank0]: Exception from src/plugins/intel_gpu/src/plugin/program_builder.cpp:246: [rank0]: Operation: PagedAttentionExtension_39914 of...
Hi I tried the PR, but new error occurred. I use openvino-gpu to run qwen2-0.5b. It turns out: ``` Traceback (most recent call last): File "/workspace/vllm/vllm/worker/openvino_worker.py", line 302, in determine_num_available_blocks...
I change the GPU available memory [here](https://github.com/sshlyapn/vllm_gpu/blob/ov_gpu_support/vllm/envs.py#L297), which solves my problem.