LPCNet icon indicating copy to clipboard operation
LPCNet copied to clipboard

Singing synthesis with LPCnet

Open yingfenging opened this issue 5 years ago • 2 comments

Hello, I was researching this vocoder recently. The LPCnet used the normal speaker data set for training and synthesized sample sounds, and the sample sound output was normal. When synthesizing the singing voice, I encountered some problems,(the following does not consider the previous acoustic model, use real features to train the vocoder) 1、The LPCnet uses female singing data for training and synthesizes sample sounds. The high-frequency part of the sample sounds cannot be recovered. What could be the problem? The first picture is the original spectrum of singing The second picture is the synthesized singing spectrum image image

Looking forward to your reply, thank you very much

yingfenging avatar Apr 20 '20 03:04 yingfenging

Could be to do with the particular selection of Bark bands and pitch quantization neglecting harmonics much higher than expected range of speech - do you find that similar lack of recovery happens with speech samples (that contain discernible higher-frequency characteristics)? It may not be limited to singing, more a general bandwidth issue with how LPCNet is configured by default

Sinnerboy89 avatar May 29 '20 10:05 Sinnerboy89

你们解决了这个问题吗

zpcoftts avatar Jul 22 '21 07:07 zpcoftts