audio-webui icon indicating copy to clipboard operation
audio-webui copied to clipboard

[QUESTION] Speaker Diarization

Open slavakurilyak opened this issue 2 years ago • 1 comments

Speaker Diarization

Any plans to integrate a speaker diarization library, like pyannote-audio by @pyannote?

slavakurilyak avatar Jul 22 '23 21:07 slavakurilyak

Good idea! This doesn't seem like something that will take long to implement, however, outputs will be as files probably, because gradio audio elements does not support an unknown amount of outputs, and you cannot add elements in gradio after you've already launched your app. Sadly. So a file output will have to do, outputting one audio file per speaker, or one audio file per separated speaker piece. It will be an option.

gitmylo avatar Jul 22 '23 22:07 gitmylo