jackylu0124
jackylu0124
I have a small question, could anyone please explain to me what's the purpose of wrapping the prediction inference call in a setTimeout() and also passing in exactly 10 milliseconds...
## 🐞Describe the bug I tried converting the `ControlFlowNet` example model from the documentation site ([https://coremltools.readme.io/docs/model-scripting](https://coremltools.readme.io/docs/model-scripting)) using `torch.jit.script()` and `coremltools.converters.convert()`, but I saw the `assert str(node.output().type()) == "Tensor"` assertion error...
### Describe the bug I tried to initialize an inference session with a Uint8Array representation of a very simple .ort model (see screenshot of the .ort file viewed in Netron...
First of all, thank you for this incredible project! I would like to hear about some of your insights on the trade-off between model performance and model quality, especially with...
### Describe the bug I tried to load a very simple .ort model (attached and also in the repo linked below) into my React Native app after converting it from...
Hey all, I have a quick question, is onnxruntime-genai ([https://onnxruntime.ai/docs/genai/api/python.html](https://onnxruntime.ai/docs/genai/api/python.html)) supported in Triton Inference Server's ONNX runtime backend? I couldn't find relevant sources in the documentation. Thanks!
### What were you trying to accomplish? I am trying to create a EKS cluster with the `eksctl create cluster -f cluster.yaml` command (see my `cluster.yaml` file content in the...
Hey all, I have a quick question, is onnxruntime-genai ([https://onnxruntime.ai/docs/genai/api/python.html](https://onnxruntime.ai/docs/genai/api/python.html)) supported in Triton Inference Server's ONNX runtime backend? I couldn't find relevant sources in the documentation. Thanks in advance!
Hey all, I have a question regarding the input and output shape configuration in the model configuration file. Basically I have a model that takes in images in the NCHW...
**Description** I am trying to build my own Triton client container image based off the `nvcr.io/nvidia/tritonserver:24.04-py3-sdk` container image, in which I install additional packages with `pip`. The issue arises when...