FastSpeech2 icon indicating copy to clipboard operation
FastSpeech2 copied to clipboard

Why all the sounds I generate are noise?

Open Moonmore opened this issue 4 years ago • 6 comments

I followed all the steps in the readme to complete the data preprocessing and model training. Why is the final generated voice empty?

Moonmore avatar Jul 25 '21 05:07 Moonmore

Hi @Moonmore Please check the training voice, it should be 22050Hz, 1 channel (mono) and 16 bit-depth.

leminhnguyen avatar Jul 25 '21 10:07 leminhnguyen

Hi @Moonmore Please check the training voice, it should be 22050Hz, 1 channel (mono) and 16 bit-depth.

I refer to the aishell3 dataset used by the author. Then I again determined the composition of the training data.it is 22050khz,mono and 16 bit. I’m sorry I have a problem uploading pictures here.

Moonmore avatar Jul 26 '21 01:07 Moonmore

how do you deal this problem

lee9871 avatar Aug 06 '21 01:08 lee9871

i have same problem

lee9871 avatar Aug 06 '21 01:08 lee9871

i have same problem

Refactoring the code.

Moonmore avatar Aug 09 '21 02:08 Moonmore

can you please elaborate the solution @Moonmore?

azman-i avatar Oct 17 '21 04:10 azman-i