contentvec icon indicating copy to clipboard operation
contentvec copied to clipboard

How to get a new spk2info.dict?

Open gu76h opened this issue 2 years ago • 4 comments

I want to train a new model with other dataset,but I don't find the way to get a new spk2info.dict.

gu76h avatar Aug 04 '23 03:08 gu76h

It's very simple. The lo and hi are fixed values for male and female voices respectively. The speaker embeddings can be extracted using the well-known Resemblyzer.

auspicious3000 avatar Aug 04 '23 18:08 auspicious3000

So,we just need the dict to use speaker embeddings ,lo and hi? The last value after these is useless?

gu76h avatar Aug 08 '23 12:08 gu76h

I developed this script that uses parselmouth or pyreaper. It needs some adjustments, because I'm creating the embeddings for all the files, and in the original I believe that an average per speaker is created.

create_contentvec_dict.zip

freds0 avatar Aug 11 '23 14:08 freds0

@auspicious3000 Thank you for sharing the tool Resemblyzer. Here is another question...if you don't mind: is there any convenient method to get frame-aligned pseudo label files on a new dataset ? I know that montreal forced aligner can do this, but the format of its output is quite different from the .km file...

SandroChen avatar Aug 14 '23 07:08 SandroChen