Retrieval-based-Voice-Conversion-WebUI icon indicating copy to clipboard operation
Retrieval-based-Voice-Conversion-WebUI copied to clipboard

African Language Phonemes

Open paulerbear opened this issue 1 year ago • 3 comments

Hi!

I'm trying to train RVC models for a range of dialects in my country. I live in South Africa.

I'm struggling with click-consonant phonemes.

These African language sounds are not understood as phonemic by the model, so transforming voice-to-voice using recordings of African language speakers results in dropped phonemes or incorrect phonemes.

Click consonants are not reliably preserved in the output and are even occasionally replaced by other consonants ("c", "ck", "k").

Is there anyway for me to overcome this roadblock? Do I need to include more of the click-consonants in my training data?

Click Consonant Transformation.zip

paulerbear avatar May 14 '24 10:05 paulerbear

@RVC-Boss

fumiama avatar May 26 '24 06:05 fumiama

try training the model without pretraineds and adding more data

blaisewf avatar May 26 '24 20:05 blaisewf

Add more examples of click consonants in your dataset and modify the tokenizer to treat click consonants as distinct units.

arbazw avatar Jun 21 '24 19:06 arbazw