Joseph513shen

Results 5 issues of Joseph513shen

careray@careray:~/voice_interaction_system/whisper-edge-main$ bash whisper-edge/build.sh Sending build context to Docker daemon 3.802MB Step 1/25 : FROM dustynv/jetson-inference:r32.7.1 r32.7.1: Pulling from dustynv/jetson-inference f46992f278c2: Pull complete d0ec296fcb76: Pull complete 9e18ddc8ca7a: Pull complete 457ba495c8e5: Pull...

1.我将麦克风的流式音频转换成chunk输入模型,但是识别的效果比wav的本地音频要差很多,同时有很多误识别,请问这个是什么原因,加上一些vad会好吗? 2.还有,这个paraformer-zh是运行在gpu上面的吗?我看显存确实有相应增加?但是read me中似乎写着gpu未实现? sd.default.device = 27 # ID为27号设备 1 model = AutoModel(model="paraformer-zh-streaming") chunk_size = [0, 20, 5] encoder_chunk_look_back = 4 # number of chunks to lookback for encoder self-attention decoder_chunk_look_back...

question

careray@careray:~/voice_interaction_system/jetson-voice-master$ PYTHONPATH=$(pwd) python3.8 examples/asr.py --wav data/audio/dusty.wav Namespace(debug=False, default_backend='tensorrt', global_config=None, list_devices=False, list_models=False, log_level='info', mic=None, model='quartznet', model_dir='data/networks', model_manifest='data/networks/manifest.json', profile=False, verbose=False, wav='data/audio/dusty.wav') Traceback (most recent call last): File "examples/asr.py", line 25, in asr...