FastSpeech2 icon indicating copy to clipboard operation
FastSpeech2 copied to clipboard

how to align Mandarin dataset by mfa?

Open yileld opened this issue 4 years ago • 5 comments

the Mandarin lexicon.txt in your project seems different from mfa pretrained model and it will failed to align,saying "There were phones in the dictionary that do not have acoustic models" image

yileld avatar May 10 '21 06:05 yileld

@yileld If you want to use your other lexicon, just put it in lexicon/, and change the symbols in text/pinyin.py.

ming024 avatar May 11 '21 14:05 ming024

@yileld If you want to use your other lexicon, just put it in lexicon/, and change the symbols in text/pinyin.py.

nonono I didnt mean to change lexicon,I just want to know how you get Mandarin textgrid by your lexicon because it seems mismatch mfa pretrained model,do you use "mfa train xxx" command?

yileld avatar May 11 '21 15:05 yileld

@yileld yeah I trained the MFA model from scratch since I wish to handle the er-hua (ㄦ話) in the Chinese language carefully, while this feature is not provided in the pretrained MFA model.

ming024 avatar May 26 '21 07:05 ming024

@yileld yeah I trained the MFA model from scratch since I wish to handle the er-hua (ㄦ話) in the Chinese language carefully, while this feature is not provided in the pretrained MFA model.

Do you have any plan to share MFA model with you trained to handle the er-hua~

hertz-pj avatar Jul 29 '21 06:07 hertz-pj

@yileld yeah I trained the MFA model from scratch since I wish to handle the er-hua (ㄦ話) in the Chinese language carefully, while this feature is not provided in the pretrained MFA model.

Do you have any plan to share MFA model with you trained to handle the er-hua~

@ming024 Could you share the mfa acoustic model of AISHELL3,thank you very much!

yuxiazff avatar Jun 02 '23 09:06 yuxiazff