geraldstanje1

Results 26 comments of geraldstanje1

hi, can sentence-transformers e.g. https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2 already used with tensorRT-LLM? my goal is to compile a sentence-transformers/all-MiniLM-L6-v2 model without quantization using tensorRT-LLM and serve with triton... are there any docs how...

@sbueringer where to call log.SetLogger(...) - in controller-runtime?

@sbueringer do you mean there is an issue in the controller-runtime code in sigs.k8s.io/controller-runtime/pkg?

but the error comes from the library - do you see?

@lix19937 can you also run tensorrt engine with golang?

hi any updated on gemma3 support?

hi @karljang and @juney-nvidia any update?

@karljang so for that case we need to use triton with vllm backend?