FastSpeech2 icon indicating copy to clipboard operation
FastSpeech2 copied to clipboard

Wavelet Transform and Inverse for F0

Open rafaelvalle opened this issue 4 years ago • 2 comments

Thank you for making this repo.

I've attached a jupyter notebook with an implementation of the wavelet transform and inverse for F0 based on the implementation used in the FastSpeech2 paper.

It would be great if you could add to your repo the capability of training FastSpeech2 with the Wavelet Transformed F0 like they describe in the paper.

pitch_cwt.zip

rafaelvalle avatar Jan 05 '22 10:01 rafaelvalle

Hi,

The reference website in the comment of your notebook is no longer available (should be https://www.isca-speech.org/archive_v0/ssw8/papers/ssw8_285.pdf), do you know where I can find it?

PussyCat0700 avatar Dec 19 '23 04:12 PussyCat0700

Hi,

The reference website in the comment of your notebook is no longer available (should be https://www.isca-speech.org/archive_v0/ssw8/papers/ssw8_285.pdf), do you know where I can find it?

I later found that this paper should be Wavelets for intonation modeling in HMM speech synthesis by Suni et al. cited in appendix C.1 of FastSpeech 2 paper. Sorry for disturbing!

PussyCat0700 avatar Dec 19 '23 11:12 PussyCat0700