FunASR icon indicating copy to clipboard operation
FunASR copied to clipboard

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Results 555 FunASR issues
Sort by recently updated
recently updated
newest added

大神们, 我的模型是部署在GPU上。然后一共5条语音需要转写。结果前两条我看是用的GPU推理,后面几条语音,为啥又用CPU推理了? 我的代码; ``` import os os.environ['CUDA_VISIBLE_DEVICES'] = '4' from funasr import AutoModel class ASRRecognize(object): def __init__(self): self.model = AutoModel(model="/data/fffan/0_experiment/15_ASR/1_code/0_models/0_funasr_models/speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch",model_revision="v2.0.4", vad_model="/data/fffan/0_experiment/15_ASR/1_code/0_models/0_funasr_models/speech_fsmn_vad_zh-cn-16k-common-pytorch",vad_model_revision="v2.0.4", punc_model="/data/fffan/0_experiment/15_ASR/1_code/0_models/0_funasr_models/punc_ct-transformer_zh-cn-common-vocab272727-pytorch",punc_model_revision="v2.0.4", spk_model="/data/fffan/0_experiment/15_ASR/1_code/0_models/0_funasr_models/speech_campplus_sv_zh-cn_16k-common", spk_model_revision="v2.0.2", device="cuda:0" ) def _recognize(self, voice_path,hotword): try:...

#### What is your question? 为什么第一次运行是GPU,然后最后一个进度条特别慢,结束后,再次运行后就是CPU运行了 下面是完整的代码 #### Code ``` import os import sys import torch sys.path.append(os.path.abspath(os.path.join(os.path.dirname(__file__), '..', '..'))) from app.config import CONFIG from app.config.logger import get_logger from app.config.json_queue import...

question

We use the below command to run the FunASR sever . when we use the test-demo to test the online mode , hotwords cannot be recognized, so the question is...

question

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) ## ❓ Questions and Help ### Before asking: 1. search the issues. 2. search the...

question

用声音模型speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404, load_pretrained_model()方法内部定义了两个变量,是深拷贝,占用内存到达3GB,是否可以在此方法结束后删除这两个变量,提前释放内存: `del ori_state` `del src_state`

bug

镜像名称 Ubuntu22.04-Docker26 镜像类型 Docker基础镜像 操作系统 Ubuntu Server 22.04 LTS 64bit I20250822 17:16:08.092835 113 paraformer.cpp:85] Successfully load model from /workspace/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-online-onnx/decoder_quant.onnx I20250822 17:16:11.162286 113 paraformer.cpp:142] Successfully load model from /workspace/models/damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-onnx/model_quant.onnx I20250822 17:16:19.573807...

question

## Long-Audio Slowdown in FunASR GPU Inferencing (Root cause: kwargs state leaks) ### What I Observed - First pass on a 30 min+ recording finishes quickly, but running the same...

环境为:Ubuntu server 22.04 python: 3.11 cuda: 11.8 执行训练报下面错误: ```shell [2025-05-01 18:32:21,169][root][INFO] - Validate epoch: 1, rank: 0 [2025-05-01 18:32:21,172][root][INFO] - rank: 0, dataloader start from step: 0, batch_num: 1, after:...

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节) 用html访问如下: docker容器中的log.txt报错如下: I20250813 09:34:13.069487 181 websocket-server.cpp:29] on_tls_init called with hdl: 0x7f2664033410 I20250813 09:34:13.069558 181 websocket-server.cpp:30]...

question