contentvec How to get a new spk2info.dict?

I want to train a new model with other dataset,but I don't find the way to get a new spk2info.dict.

Aug 04 '23 03:08 gu76h

It's very simple. The lo and hi are fixed values for male and female voices respectively. The speaker embeddings can be extracted using the well-known Resemblyzer.

Aug 04 '23 18:08 auspicious3000

So,we just need the dict to use speaker embeddings ,lo and hi? The last value after these is useless?

Aug 08 '23 12:08 gu76h

I developed this script that uses parselmouth or pyreaper. It needs some adjustments, because I'm creating the embeddings for all the files, and in the original I believe that an average per speaker is created.

create_contentvec_dict.zip

Aug 11 '23 14:08 freds0

@auspicious3000 Thank you for sharing the tool Resemblyzer. Here is another question...if you don't mind: is there any convenient method to get frame-aligned pseudo label files on a new dataset ? I know that montreal forced aligner can do this, but the format of its output is quite different from the .km file...

Aug 14 '23 07:08 SandroChen