geraldstanje1
geraldstanje1
hi, can sentence-transformers e.g. https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2 already used with tensorRT-LLM? my goal is to compile a sentence-transformers/all-MiniLM-L6-v2 model without quantization using tensorRT-LLM and serve with triton... are there any docs how...
@sbueringer where to call log.SetLogger(...) - in controller-runtime?
@sbueringer do you mean there is an issue in the controller-runtime code in sigs.k8s.io/controller-runtime/pkg?
but the error comes from the library - do you see?
@lix19937 can you also run tensorrt engine with golang?
hi any updated on gemma3 support?
hi @karljang and @juney-nvidia any update?
@karljang so for that case we need to use triton with vllm backend?