Chtholly.Ruq.Seniorious comments

Results 5 comments of


                                            Chtholly.Ruq.Seniorious

功能区域不显示我已经添加的插件或组件

+1 v2.1.7-preview + 暴力猴/油猴 + 允许访问文件URL，仍然无法解决，功能处显示【空空如也】

network_architecture not available

@LouisDo2108 I had the same issue and was able to resolve it by running 'conda install python-graphviz'.

When i run a code inside http://localhost:2358/dummy-client.html it gives me following error

> i am facing the same issue as well , changing the value to true will help ? > > { "stdout": null, "time": null, "memory": null, "stderr": null, "token":...

funasr + Whisper语音识别-多语言-large-v3 + fsmn-vad + ct-punc-c + cam++ 报错 raise NotImplementedError("batch decoding is not implemented") NotImplementedError: batch decoding is not implemented

我用cursor改了一版能跑了，下面是cursor总结的 ### 1. 为什么会有这个问题这个问题的根本原因在于FunASR库中Whisper模型实现的局限性： 1. **批处理未实现**：在FunASR库的WhisperWarp类中，`inference`方法明确检查`batch_size`参数，当它大于1时会抛出错误"batch decoding is not implemented"。这是因为Whisper模型的实现没有支持批处理功能。 2. **VAD分段处理**：当处理较长音频时，FunASR会先用VAD（语音活动检测）模型将音频切分成多个片段，然后批量送入ASR模型处理。对于其他模型（如SenseVoice和Paraformer）这种方式效率很高，但Whisper模型不支持批处理。 3. **参数配置不合理**：默认的`batch_size_s`参数为60秒，这会导致批处理大小非常大（内部转换为毫秒后约60000），而Whisper模型更适合处理较短的音频段。 ### 2. 为此做的改动为了解决这个问题，我实现了以下优化： 1. **创建Whisper补丁**： - 通过猴子补丁（Monkey Patch）技术修改了WhisperWarp类的`inference`方法 - 当`batch_size > 1`时，不抛出错误，而是改为逐个处理每个样本，然后合并结果 2. **限制批处理大小**：...

Chtholly.Ruq.Seniorious

功能区域不显示我已经添加的插件或组件

network_architecture not available

自动识别队伍与战斗功能优化

When i run a code inside http://localhost:2358/dummy-client.html it gives me following error

funasr + Whisper语音识别-多语言-large-v3 + fsmn-vad + ct-punc-c + cam++ 报错 raise NotImplementedError("batch decoding is not implemented") NotImplementedError: batch decoding is not implemented