Fine-tuned Model Versions

Open ddlinh opened this issue 7 months ago • 0 comments

Dear Khai,

I have accessed the ASR models published at https://huggingface.co/leduckhai/MultiMed-ST, including 'whisper-small-vietnamese' and 'whisper-small-multilingual', to generate transcripts for the audio files. However, due to the complex mix of voices in the recordings, these versions haven't produced satisfactory transcripts.

I noticed in your paper that you mentioned other fine-tuned versions based on whisper-medium and whisper-large. I would be very interested in trying out these models for my use case.

Would it be possible for me to access them? Thank you very much.

Jun 29 '25 13:06 ddlinh