SenseVoice icon indicating copy to clipboard operation
SenseVoice copied to clipboard

单声道和多声道音频识别结果差异比较

Open qiutzh opened this issue 9 months ago • 0 comments

❓ Questions and Help

What is your question?

您好,请问老师有对比asr模型对单声道音频和多声道音频的识别结果差异吗? 我用客服对话数据做了一些测试,发现有些情况(比如双声道一些音频)识别结果还行,有的情况(比如一些单声道音频)识别结果会比较差。 请问老师,公开small版本asr模型,哪些场景下的asr识别结果效果比较不错呢?edu行业客服对话数据是否适合。 代码中,是否支持设置要识别的声道呢?(如识别所有声道,或者识别第1个声道)。感谢!

Code

no code

What have you tried?

What's your environment?

  • OS (e.g., Linux): ubuntu20.04
  • FunASR Version (e.g., 1.0.0): 1.2.6
  • ModelScope Version (e.g., 1.11.0): 1.24.1
  • PyTorch Version (e.g., 2.0.0): 2.5.1
  • How you installed funasr (pip, source): pip
  • Python version: 3.12
  • GPU (e.g., V100M32): rtx4090d
  • CUDA/cuDNN version (e.g., cuda11.7): cuda11.8
  • Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1): no
  • Any other relevant information:

qiutzh avatar Apr 14 '25 09:04 qiutzh