agents Feature Request: Diarization Support for Plugins

First, thanks for creating such an excellent tool!

I was wondering if there are any plans to add support for speaker diarization when using plugins (e.g., Google, Deepgram, Azure). This would be extremely helpful for distinguishing between multiple speakers during real-time audio processing.

Looking forward to any updates on this feature. Thanks again!

Oct 23 '24 08:10 SiyuanQi

I really need it in deepgram. It's definitely support by them https://developers.deepgram.com/docs/diarization

Oct 28 '24 07:10 tomc98

Hey i think it should also support camera based diarization so you know not only what is being said by different people but also who

Nov 05 '24 19:11 TheHassanShahzad

Is it possible to add start_time, end_time , words(if supported by providers) when the agent's and user's speech is commited? I had made this change for the voice pipeline agent but It would be great to have some feature for this upstream for that in Voice agent. Our platform displays transcripts alongside recordings, allowing users to click on a specific message to jump to that exact moment in the recording. Timestamps in words enable the creation of clips from any selected point in the transcript. Since these transcripts are used for usability research, we require timestamps at multiple points to serve as reference markers that support key research insights.

Example format required

`[

"role": "assistant",
"content": "Hey, how can I help you today?",
"start": 1742134707.016703,
"end": 1742134708.875511,
"words": [
  {
    "text": "Hey,",
    "start": 1742134707.016781,
    "end": 1742134707.279758
  },
  {
    "text": "how",
    "start": 1742134707.279854,
    "end": 1742134707.543107
  },
  {
    "text": "can",
    "start": 1742134707.543137,
    "end": 1742134707.806495
  },
  {
    "text": "I",
    "start": 1742134707.806515,
    "end": 1742134708.069907
  },
  {
    "text": "help",
    "start": 1742134708.0699232,
    "end": 1742134708.332927
  },
  {
    "text": "you",
    "start": 1742134708.332947,
    "end": 1742134708.5982761
  },
  {
    "text": "today?",
    "start": 1742134708.598859,
    "end": 1742134708.875474
  }
]

} ]`

Mar 17 '25 18:03 akshitdudeja125