HeyangXue1997
HeyangXue1997
> @WelkinYang Hey! This is strange, we haven't experienced this on standard LJSpeech and Libri-TTS datasets. Probably, your datasets are a bit noisy. However, I can suggest you to try...
> I observed that in model.py "gin_channels" is provided in DiffusionGenerator. > > I would like to know if Grad-TTS supports multispeaker TTS training ? > > Can you also...
> Thank you for your greate work and share. > Could you please release some wave sampels? or Could you simplly evaluate the quality of the synthsized wavs you got,...
> @WelkinYang Have you figured it out, why samples are average before ? We found that it is the vocoder that causes poor quality of the generated audio when using...
Thank you, i have added the MIT license.
Hey, i was confused by the calculation for a long time as well. And i searched for a lot of related information. Now, I finally understand.   The first...
perhaps due to too much oov words you can check the oov_list
Now I found the reason for my alignment process with this error. It is because there is a clitic marker that is mistakenly included in the dictionary.txt. So the L.fst...
still no...
> 推理阶段的hubert+vq不是用text+参考音频的离散ssl自回归预测出来的吗?推理阶段生成出来的hubert+vq不应该就想包含参考音色吗,为什么会少音色泄露? 包含就对啊 包含的是错误的音色那叫泄露 包含的是正确的那不就是减少泄露= =