Desh Raj comments

Results 178 comments of


                                            Desh Raj

How's your model's performance on SemEval 2010 Task 8?

I haven't tried it on the Task 8 data, since I was only focusing on biomedical domain at the time. However, I'm pretty sure it wouldn't be as low as...

How's your model's performance on SemEval 2010 Task 8?

One thing I saw in your code is that you are using randomly initialized word embeddings. The number of parameters in CRNN is already somewhat larger than in CNN, and...

Test accuracy is very low

@GatsbyUSTC I agree with you. If we are using pretrained embeddings and not tuning them during training, an attention at the input layer would be meaningless. However, if embeddings are...

Multi-GPU training capability for the Pytorch Transformer LM training script - https://github.com/kaldi-asr/kaldi/blob/master/egs/wsj/s5/local/pytorchnn/run_nnlm.sh

Related issue: https://github.com/kaldi-asr/kaldi/issues/4468

Cannot reproduce the RESULTS(wer:43.25%) of S5b_track1 in Chime6

It's hard to say what may be going wrong just based on WER. Did you change any hyperparameters? Did you look at the intermediate results (e.g. from GMM decoding) to...

Cannot reproduce the RESULTS(wer:43.25%) of S5b_track1 in Chime6

I think so, yeah. The WER without RNNLM rescoring should be closer to 46%. See this at the top of the run_cnn_tdnn_1b.sh: %WER 46.07 [ 27124 / 58881, 2905 ins,...

Cannot reproduce the RESULTS(wer:43.25%) of S5b_track1 in Chime6

You can try tuning some of the hyperparameters (esp. learning rate) since you changed the number of training jobs (GPUs). But I think at this point you're close enough that...

Cannot reproduce the RESULTS(wer:43.25%) of S5b_track1 in Chime6

Sorry, I don't think I have the eval numbers for that exact recipe on hand. We tried several systems during the challenge (see Table 7 in https://arxiv.org/pdf/2006.07898.pdf) and it seems...

[egs] Training amount of ivector in librispeech recipe

The unperturbed cleaned data has about 300k utterances (900k for speed-perturbed version). So 60k is 1/5th of that, hence about 200 hours. But yeah, the comment should make that clearer.

Fix registering of CuAllocatorOptions

Ok, I'll check.