YangGaoBin
YangGaoBin
oh, I see it. ` x = (x[:, 0] + x[:, 1]) / 2 ` sorry to bother you. thank you for your good work, I am newer for my...
Thank you for your reply, I should learn from you to do many good jobs and share with others (oh, I want to graduate O(∩_∩) )!
Is there a way to use tf-record to do this experiment, because the official website says "Frame-level features are stored as tensorflow.SequenceExample protocol buffers.", can this avoid downloading the original...
> > I also encountered a similar problem. May you installed the fairseq and you can use "conda list" to check it. If you have installed fairseq, you can uninstall...
> > > > I also encountered a similar problem. May you installed the fairseq and you can use "conda list" to check it. If you have installed fairseq, you...
> @liyunlongaaa I have the whole audioset files in netdisk, do you still need now? (About 100GB, I don't remember) oh, Thank you very much for your help, but I...
I think this kind of animation diarization is more easy to do, you can use the code in this repo to achieve, you can also do multi-modal speaker diarization, this...
Unfortunately, as far as I know, multimodal diarization doesn't work very well for open source right now. And I think the premise that multimodal diarization is relatively easy is still...
But I can introduce you to the latest multimodal diarization sota, https://arxiv.org/html/2401.08052v2