onnxruntime
onnxruntime copied to clipboard
ONNX Runtime doesn't support the graph optimization of vision-encoder-decoder yet
Is there another way to quantized TrOCR model to speed the inference, i alreayd have in ONNX form but the inference time is to high