Mehdi Bahri

Results 8 issues of Mehdi Bahri

Hi First of all, thank you for sharing this package! I'm installing spherecluster with pip and had to manually edit spherical_kmeans.py to fix the import of _k_means (I changed it...

Hi, First of all, thank you for providing this plugin! It really fills a gap. There is an issue with C++ code. Consider the below minimal working example: ``` //...

bug

Hi Thanks a lot for providing this backend. I have tried to use it and I have had some trouble getting Triton to load and run my OpenVINO models. I...

Hello The latest Triton image from NGC (24.04) includes the DALI backend, but the conda-packed environment that used to be shipped with the backend is no longer there. This breaks...

Hi Thank you for the great work you're doing on TensorRT-LLM and the Triton backend. I have some questions on matching versions between the tensorrt-llm python package, the backend, and...

help wanted

### Describe the issue I'm trying to optimize the vision encoder of a CLIP model exported from HuggingFace Transformers, but the attention subgraphs don't get fused. I tried with a...

model:transformer

**Description** In ONNXRuntime, the OpenVINO EP accepts configuration options to set the number of threads and number of streams documented [here](https://onnxruntime.ai/docs/execution-providers/OpenVINO-ExecutionProvider.html#cc-api-20), but these are ignored when passed to the EP...

**Description** I am encountering two possibly related issues with the Python backend and shared memory: 1. During operation, the shared memory usage keeps growing, leading to errors. It looks like...

question