dutchsing009
dutchsing009
oh yeah of course but as you see above i only tried the x3 one which supposedly should do from 16khz to 48khz , give me an email to send...
> @dutchsing009 [[email protected]](mailto:[email protected]) ok i did send the email , you should have it by now :)
Don't stress , it is not really useful , Actually it doesn't do the results you see in the demo at all , as the authors never released the original...
@junjun3518 Hi , No problems at all , a late response is better than no response :) ,, I saw and read the new amazing paper which is called Nu-Wave...
Oh wow ! , that's indeed way more better and smoother than the other one , so what is missing now for the implementation to be fully done ? can...
1- Does this variance temp solution link work for English or French datasets ? Ok Thanks , So if I understand this correctly , if I have ```ph_seq``` ```ph_dur``` ```ph_num```...
does this help ? https://github.com/colstone/ENG_dur_num
Thank you so much , i will try your code and let you know . but are there any suggestions for multi-modal speaker diarization? Like what's the best repo in...
It is ok thanks for all these info , btw i talked to the author of this https://arxiv.org/pdf/2312.05730.pdf and he said the most similar one to it is this https://github.com/showlab/AVA-AVD....