RTranslator icon indicating copy to clipboard operation
RTranslator copied to clipboard

[Question] how to export huggingface whisper models to onnx ?

Open eix128 opened this issue 1 year ago • 10 comments

On huggingface , we need to use some safetensors file here : https://huggingface.co/openai/whisper-large-v3-turbo

how can we convert these models to onnx that is compatible with this library ?

Also for NLLB onnx

We need to fine tune and change onnx files. But your onnx files different from whisper turbo v3 models

eix128 avatar Dec 16 '24 07:12 eix128

Hi, in the next week I will probably write an article on how I converted whisper in onnx and optimized it. In the meantime you can check here for some details.

niedev avatar Dec 20 '24 18:12 niedev

@niedev Hello, do you have any plans to release the Whisper model converted to ONNX? I’m very eager to know. Thank you.

laixiaofeng0 avatar Feb 13 '25 10:02 laixiaofeng0

@laixiaofeng0 all the models used by RTranslator are here in the assets.

niedev avatar Feb 15 '25 17:02 niedev

Hi, in the next week I will probably write an article on how I converted whisper in onnx and optimized it. In the meantime you can check here for some details.

Hi, author. First of all, thank you for this wonderful project.

Could I ask when you will release the "article" about converting whisper in onnx (including kv-cache and quantization)? since I want to load my fine-tuned whisper model into R-translator.

Eric-Edf avatar Mar 10 '25 07:03 Eric-Edf

@Eric-Edf Thank you! I haven't had much time lately so I haven't even managed to start it, sorry, but as soon as I can do it I'll link it here

niedev avatar Mar 10 '25 20:03 niedev

@Eric-Edf Thank you! I haven't had much time lately so I haven't even managed to start it, sorry, but as soon as I can do it I'll link it here

Thank you for your reply. Looking forward to it!

Eric-Edf avatar Mar 13 '25 01:03 Eric-Edf

Could you please write it quickly? I need him very much. Thank you.@niedev

tianxing1234-coder avatar May 29 '25 08:05 tianxing1234-coder

@tianxing1234-coder the quick version is here

niedev avatar May 31 '25 11:05 niedev

@tianxing1234-coder the quick version is here

Looking forward !!!!!!!!!!!!!!!! NLLB and whisper !!!!! Convert to onnx model and quantize!!!!

Geministudents avatar Jul 22 '25 07:07 Geministudents

@Geministudents I was about to direct you here, let me know how it goes 🚀

niedev avatar Jul 22 '25 08:07 niedev