hdmjdp comments

Results 29 comments of


                                            hdmjdp

The model can not converge

> what is the "warm up iters ", I cannot find it in the config. By the way I have the same problem, the loss cannot go down as below:...

New TTS Model request

can you share your code > I suggest 1 @rishikksh20

New TTS Model request

> detached hidden state @rishikksh20 Does this refer to text encoder output?

New TTS Model request

@rishikksh20 After 100k,, does the prams of prodsody extractor update or just frozen?

about final loss?

> @MorganCZY Training the model on a multiple speaker corpus and it will generalize automatically. You can just listen the audio samples released by the authors, the results in the...

about final loss?

> > > @MorganCZY Training the model on a multiple speaker corpus and it will generalize automatically. You can just listen the audio samples released by the authors, the results...

Result getting worse when i use ground truth duration.

> Dear author, thank you for your contribution for TTS, this is a big step in E2E TTS. But when I use ground truth duration aiming to train faster and...

Will there be a pytorch version to release?

@zepingyu0512 looking for a pytorch version too!

how about the quality of this net

> yes better than hifi-gan with less training but in my experiments, the stftnet has larger machine noise in the audio, did you have it? also in the training, there...

how many iters you train that you can gen a normal voice

I think this framework is similar to tacotron1, because you don't use wavenet-decoder instead of grifflim-decoder. So the tts-wav from your implements is whether better than tacotron1?