psuu0001
psuu0001
Hi, your work is excellent. Do you have any plans to release the code?
我用自己的声音测的speech_model251_e_0_step_625000.model模型的效果不是很好,但是作者提供的在线demo的效果实测的不错。请问是有在基于speech_model251_e_0_step_625000.model继续训练吗?
> error: > batch_size, l, d = x.size() > ValueError: too many values to unpack (expected 3) > > x is the input image. > > when I tried to...
error: batch_size, l, d = x.size() ValueError: too many values to unpack (expected 3) x is the input image. when I tried to train the model, the error came up....
我想运行beamdecode.py,但是这需要安装ctcdecode。很多次安装失败后,我发现把pytorch降级成1.0的版本后,ctcdecode 0.4就安装成功了,但是再运行beamdecode.py的时候,又报错说pytorch的版本不对应。当我把pytorch升级到3.6以上时,ctcdecode又开始报错了。请问有人成功运行过ctcdecode这个包吗?
The whole dataset processed is so huge. Could you please offer a demo or test data image dir?
Could you please share other links to download the pre-trained model? cause the dropbbox seems to be unconnected.
Hi, bro, very appreciate for your work. The method to load the model can directly get the word. But I want the phoneme. Do you know how to get the...
