Saad Kaleem
Saad Kaleem
Hi, I'm also looking to disable the KV cache completely as my use-case requires only the first token generation. The only work-around so far has been to set `max_tokens_in_paged_kv_cache` during...
Hi, I'm facing a similar issue with degradation of WER whilst running *batched* transcriptions of ~20-30 seconds of audios from the Common Voice 16_1 Dataset (Spanish subset). WER seems to...
> > Hi, I'm facing a similar issue with degradation of WER whilst running _batched_ transcriptions of ~20-30 seconds of audios from the Common Voice 16_1 Dataset. WER seems to...