ai-sdk-provider icon indicating copy to clipboard operation
ai-sdk-provider copied to clipboard

FR: openai real time API + gemini live api + universal webrtc

Open thiswillbeyourgithub opened this issue 11 months ago • 0 comments

Hi,

In the documentation I don't see clearly mentioned wether the "voice" features are supported. I'm especially interested in the Gemini Live API and OpenAI's Realtime API.

If relevant: fastrtc is a fantastic python lib to get started with live API of pretty much all providers and their mother.

So:

  1. Is it supported currently?
  2. Is it planned?
  3. If yes: any ETA?
  4. Btw a notable feature that could be nice from openrouter would be, in a similar way that just adding :web to any LLM brings us web results, using :audio:TTS:SST with TTS being openai/whisper-1 and STT being elevenlabs/some_voice might be a really great feature: openrouter would get lots of credits and would provide a unified way to access "sort of real time" API like features from many models. Also reducing GAFAM's powers.

thiswillbeyourgithub avatar May 09 '25 17:05 thiswillbeyourgithub