videodb-python icon indicating copy to clipboard operation
videodb-python copied to clipboard

I hope transcript can support configuring language and prompt

Open desenmeng opened this issue 1 year ago • 1 comments

Confirm this is a new feature request

  • [x] Possible new feature in VideoDB Python Client
  • [X] Potential new feature in VideoDB API
  • [X] I've checked the current issues, and there's no record of this feature request

Describe the feature

I hope the transcript can support more configuration capabilities

Describe the solution you'd like

context_prompt
string
Context to feed the transcription model with for possible better performance

custom_vocabulary
string[]
Specific vocabulary list to feed the transcription model with

detect_language
boolean
default: true
Detect the language from the given audio

enable_code_switching
boolean
default: false
Detect multiple languages in the given audio

language
enum<string>
Set the spoken language for the given audio

Available options: af, sq, am, ar, hy, as, az, ba, eu, be, bn, bs, br, bg, ca, zh, hr, cs, da, nl, en, at, fo, fi, fr, gl, ka, de, el, gu, ht, ha, haw, he, hi, hu, is, id, it, jp, jv, kn, kk, km, ko, lo, la, lv, ln, lt, lb, mk, mg, ms, ml, mt, mi, mr, mn, mymr, ne, no, nn, oc, ps, fa, pl, pt, pa, ro, ru, sa, sr, sn, sd, si, sk, sl, so, es, su, sw, sv, tl, tg, ta, tt, te, th, bo, tr, tk, uk, ur, uz, vi, cy, yi, yo 

Describe alternatives you've considered

No response

Additional Context

https://docs.gladia.io/api-reference/api-v2/Transcription/post-v2transcription

desenmeng avatar Mar 19 '24 16:03 desenmeng

Priority for multiple languages and additional options is raised internally in our roadmap 👍

codeAshu avatar Mar 26 '24 11:03 codeAshu