ParallelWaveGAN

ParallelWaveGAN copied to clipboard

Reame
Issues

discrete token for audio resynthesis

Open South-Twilight opened this issue 2 years ago • 0 comments

Here is the PR for audio resynthesis in discrete token:

We extend hubert_voc1 to token_voc1 and it can handle more models token;
We add f0 for training and inference when finding poor prounciation in singing;
We add multi-stream method including residual cluster and weight sum;
Using embedding feature of models is also allowed.

The following models have been validated in opencpop recipe: HuBERT, XLS-R, WavLM, MERT, Encodec.

Feb 02 '24 13:02 South-Twilight