h770347
h770347
**wget https://phon.ioc.ee/~tanela/tedlium_nnet_ms_sp_online.tgz** converted 'https://phon.ioc.ee/~tanela/tedlium_nnet_ms_sp_online.tgz' (ANSI_X3.4-1968) -> 'https://phon.ioc.ee/~tanela/tedlium_nnet_ms_sp_online.tgz' (UTF-8) --2022-01-05 19:37:12-- https://phon.ioc.ee/~tanela/tedlium_nnet_ms_sp_online.tgz Resolving phon.ioc.ee (phon.ioc.ee)... 193.40.251.126 Connecting to phon.ioc.ee (phon.ioc.ee)|193.40.251.126|:443... failed: Connection refused.
C:\Users\SA>docker run --memory="8g" -it -p 8080:80 -v /media/kaldi_models:/opt/models ca9ada4fab7f /bin/bash root@b4eabf42047c:/opt# /opt/start.sh -y /opt/models/nnet2.yaml python main.py -u ws://localhost:8080/client/ws/speech -r 8192 .\test_data_bill_gates-TED.mp3 Received error from server (status 9) Error message: No...
sudo python3.9 data_preparation.py [sudo] password for osboxes: Sorry, try again. [sudo] password for osboxes: Copying Train dataset: 0it [00:00, ?it/s] Copying Train dataset: 0it [00:00, ?it/s] Traceback (most recent call...
I have finished training a new model using the following example in the link below https://github.com/KYJun/kaldi_tutorial How can I use it in speech recognition I have test.wav how use my...
You have finished training a new model using the following example in the link below **https://github.com/golbin/kaldi_tutorial** How can I use it in speech recognition I have test.wav what the code...
I have finished training a new model using the following example in the link below https://github.com/KYJun/kaldi_tutorial How can I use it in speech recognition I have test.wav how use my...
can I use it to train new language such as arabic?
I have voice recordings of the names of the cities in my country I want to build dataset and convert it to speech recognition model for my century cities I...
~/apiai/asr-server$ make make -C src make[1]: Entering directory '/home/osboxes/apiai/asr-server/src' ar -cr libstidecoder.a Timing.o Response.o RequestRawReader.o ResponseJsonWriter.o ResponseMultipartJsonWriter.o OnlineDecoder.o Nnet3LatgenFasterDecoder.o QueryStringParser.o FcgiDecodingApp.o ranlib libstidecoder.a g++ -shared -o liblibstidecoder.so -Wl,--no-undefined -Wl,--as-needed -Wl,-soname=liblibstidecoder.so,--whole-archive...
import fairseq import torch import torchaudio import matplotlib.pyplot as plt from scipy.spatial.distance import cdist wav2vec2_checkpoint_path = "xlsr_53_56k.pt" checkpoint = torch.load(wav2vec2_checkpoint_path) wav2vec2_encoder = fairseq.models.wav2vec.Wav2Vec2Model.build_model(checkpoint['cfg']['model']) wav2vec2_encoder.load_state_dict(checkpoint['model']) q_dat, q_sr = torchaudio.load("hello.wav") r_dat, r_sr...