FreeVC icon indicating copy to clipboard operation
FreeVC copied to clipboard

poor performance on seen-to-unseen task while finetuning on Hindi language

Open rgenai opened this issue 2 years ago • 2 comments

Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoints as the foundation, specifically on a Hindi dataset. While I've achieved impressive results in seen-to-seen and unseen-to-seen tasks, with a remarkable 95% match, I'm eager to enhance the performance in the seen-to-unseen task. Presently, I'm encountering a moderate 60% match when working with the reference speaker for unseen-to-unseen and seen-to-unseen tasks. I would greatly appreciate any insights or suggestions you have to improve these results further.

rgenai avatar May 16 '23 17:05 rgenai

Hi @MuruganR96 , how did you train with another language ? Did you train wavlm ?

EmreOzkose avatar Aug 09 '23 10:08 EmreOzkose

Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoints as the foundation, specifically on a Hindi dataset. While I've achieved impressive results in seen-to-seen and unseen-to-seen tasks, with a remarkable 95% match, I'm eager to enhance the performance in the seen-to-unseen task. Presently, I'm encountering a moderate 60% match when working with the reference speaker for unseen-to-unseen and seen-to-unseen tasks. I would greatly appreciate any insights or suggestions you have to improve these results further.

Hi @MuruganR96 , I want to do what you did and fine-tune FreeVC on a non-English dataset. Your results of 95% match on seen-to-seen would be perfect for my use case. Can you please provide guidance or share your code?

mm3509 avatar Dec 03 '23 18:12 mm3509