Li Gang
Li Gang
GenAIExamples/ChatQnA/docker/Dockerfile: git clone https://github.com/opea-project/GenAIComps.git that will cause version control issue.
## Description Enable ChatQnA with vllm inference on Intel ARC GPU ## Issues n/a ## Type of change List the type of change like below. Please delete options that are...
### Self Checks - [x] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [x] I have searched for existing...
### Self Checks - [x] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [x] I have searched for existing...
**Describe the bug** Need to get the average latency for each rerank request, but currently ovms_request_time_us_sum always 0, Want to clarify which metric can I use, or how to calculate....