xaiguy
xaiguy
Thanks for the quick reply! Sure, the link to Tuda-de is the following: ltdata1.informatik.uni-hamburg.de/kaldi_tuda_de/german-speechdata-package-v2.tar.gz I also thought it might be too small. I then combined it with the M-AILABS corpus...
@borisgin Yes, I have. From what I've read and heard the last weeks, it seems that other people are facing similar problems with End-to-end ASR for German. It may indeed...
Short update: I trained Jasper for a little more than 200 epochs with ~600 hrs of German audio data. At around 960k steps I reached a 38% WER without and...
@csuestc It does for me after removing nltk==3.5, downloading the rich library from the above link and removing the version from the huggingface-hub requirement.
@gianfrancodemarco On custom data? As far as I'm aware, the inference scripts only support inference on ground truth data (=evaluation). For "real" inference, T5.generate() is needed which currently only supports...
@gianfrancodemarco Thanks, that sounds a lot simpler than what I was trying to do! Were you able to confirm that it's working as intended? For example by comparing results with...