FunASR The English model is not working properly

🐛 Bug

Hi I am trying the english ASR model paraformer-en , but the output is not in English at all.

To Reproduce

Load the paraformer-en and transcribe an English audio file.

from funasr import AutoModel
# paraformer-zh is a multi-functional asr model
# use vad, punc, spk or not as you need
model = AutoModel(model="paraformer-en",  
                  vad_model="fsmn-vad", 
                  punc_model="ct-punc", 
# spk_model="cam++"
                  )
res = model.generate(input=f"https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/asr_example_en.wav", 
            batch_size_s=300)
print(res)

Output: (The correct transcription should be "He tried to think how it could be." )

[{'key': 'asr_example_en', 'text': ' Procedure telegram, impatience phuket, sleeping danielot pro.'}]

Code sample

Expected behavior

I hope the English model could transcribe properly.

Environment

OS (e.g., Linux):
FunASR Version (e.g., 1.0.0): 1.1.6
ModelScope Version (e.g., 1.11.0): 1.14.0
PyTorch Version (e.g., 2.0.0): 2.1.0+cu118
How you installed funasr (pip, source): pip
Python version: python 3.10
GPU (e.g., V100M32): 3090 Ti
CUDA/cuDNN version (e.g., cuda11.7): 11.8
Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1): Not using docker
Any other relevant information:

Additional context

Sep 18 '24 03:09 treya-lin

Hi,

I have exactly the same problem when input a speech: "How are you doing today" and I got the result:

"sleeping war suspense mm sentosa"

Could you please explain what caused this bug?

Jun 04 '25 03:06 pvtoan

@pvtoan Hi, I have the same problem, have you solved it? thanks

Sep 12 '25 07:09 liguiyuan