LPCNet icon indicating copy to clipboard operation
LPCNet copied to clipboard

post-processing tech on the generated waves

Open candlewill opened this issue 6 years ago • 12 comments

LPCNet performs very well on the features from ground-truth waves. However, when the features are predicted using an end2end model or SPSS model, there are some audible artifacts in the generated waves.

Anyone knows If there are some post-processing tech on the synthesized waves to reduce the artifacts.

image

Some synthesized samples: e2e_lpcnet_samples_share.zip

candlewill avatar Feb 26 '19 09:02 candlewill

For example, from Baidu TTS online demo, it can be found from the spectrogram the audio is enhanced apparently by a post-processing tech to reduce the noise.

image

Baidu-TTS-sample.zip

candlewill avatar Feb 26 '19 09:02 candlewill

@candlewill What about dataset size of "e2e_lpcnet_samples_share.zip". It sounds well. My e2e_demo have some noise, and loss about 3.35 for default parameters.

hyzhan avatar Feb 27 '19 02:02 hyzhan

@candlewill Do you use tts with LPCNet ?

OswaldoBornemann avatar Mar 02 '19 07:03 OswaldoBornemann

yes

candlewill avatar Mar 03 '19 08:03 candlewill

@candlewill Does LPCNet perform better than Griffin Lim on audio sound quality and audio generation speed ?

OswaldoBornemann avatar Mar 03 '19 10:03 OswaldoBornemann

@tsungruihon I think the LPCNet (train on your own dataset) is better than Griffin Lim both on quality and speed, especially when some post-processing algorithm could be applied.

candlewill avatar Mar 03 '19 15:03 candlewill

@candlewill I think the synthesized sample you shared is very good actually. Probably just a little bit over fitting for LPCNet. By the way, I have the problem of training a stable LPCNet for TTS, I found LPCNet hard to converge, any suggestions ? Thanks

bearlu007 avatar Mar 07 '19 19:03 bearlu007

@candlewill have you ever tried LPCNet with 22050kHz ? thanks

OswaldoBornemann avatar Mar 25 '19 09:03 OswaldoBornemann

@candlewill have you tried any post processing techs to import sound quality?

superhg2012 avatar Jun 06 '19 08:06 superhg2012

有没有什么好的后处理算法技术推荐的呢 @candlewill ,我用LPCnet的训练的时候总是有一些背景音。

WhiteFu avatar Jul 11 '19 06:07 WhiteFu

有没有什么好的后处理算法技术推荐的呢 @candlewill ,我用LPCnet的训练的时候总是有一些背景音。

any progress?

superhg2012 avatar Jul 20 '19 06:07 superhg2012

@superhg2012 您好,麻烦问一下,合成时会有一些噪音,您有解决办法吗?

chengshaodi avatar Sep 16 '21 02:09 chengshaodi