Saad Kaleem

Results 3 comments of Saad Kaleem

Hi, I'm also looking to disable the KV cache completely as my use-case requires only the first token generation. The only work-around so far has been to set `max_tokens_in_paged_kv_cache` during...

Hi, I'm facing a similar issue with degradation of WER whilst running *batched* transcriptions of ~20-30 seconds of audios from the Common Voice 16_1 Dataset (Spanish subset). WER seems to...

> > Hi, I'm facing a similar issue with degradation of WER whilst running _batched_ transcriptions of ~20-30 seconds of audios from the Common Voice 16_1 Dataset. WER seems to...