1215thebqtic
1215thebqtic
> you just need a bigger dataset... to expand word units.txt, you can try freeze all weight, but update the weight of your new units only. Hi, I use 200h...
> What's the distribution of your utterance length? mean 2.2 std 1.5 min 0.1 25% 1.2 50% 1.8 75% 2.7 99% 7.6 99.5% 8.8 99.9% 14.6 during training, utterances 20s...
> Since you have 80k hours you could probably discard everything >10s as it's less than 0.5% of your data. You might also tune the dynamic bucketing sampler settings: setting...
> https://github.com/HillZhang1999/SynGEC 在这里~ 谢谢!!我去研究一下
我也遇到了这个问题,请问是如何解决的呢?谢谢
> Yes, please refer to this tutorial to understand how to set up multiple datasets (possibly corresponding to multiple tasks) in a single training: https://colab.research.google.com/github/lhotse-speech/lhotse/blob/master/examples/03-combining-datasets.ipynb > > I specifically recommend...