Levi Melamed

Chicago

Results 31 comments of


                                            Levi Melamed

OG whisper word level timestamps support

> would it be possible in theory to enable word_level timestamps through faster_whisper and patch them into the segments? Yes. The timestamp tokens are being [filtered out during decoding](https://github.com/m-bain/whisperX/blob/78dcfaab51005aa703ee21375f81ed31bc248560/whisperx/asr.py#L69). You...

Limit the length of my subtitles

> Maybe `--chunk_size` is what you need. > > https://github.com/m-bain/whisperX/blob/f8cc46c6f7fa3b8509bc6aa04cdf4a62a702bb42/whisperx/transcribe.py#L44 Wouldn't this cause worse transcription results, since the context is lost with smaller chunks of audio? Seems like it would...

429 Quota exceeded for quota metric 'Generate Content API requests per minute' and limit 'GenerateContent request limit per minute for a region' of service 'generativelanguage.googleapis.com' for consumer 'project_number:************'. [reason: "RATE_LIMIT_EXCEEDED"

I'm getting that error when I attempt to call `count_tokens()` with a large string.

Memory error

Instead of copying the file to the worker, you can transfer a reference to it like so: `await this.ffmpeg.mount("WORKERFS", { files: [file]}, '/some-dir');` [See here](https://emscripten.org/docs/api_reference/Filesystem-API.html#filesystem-api-workerfs)

Add a `hasColor` function

Some interesting behavior I found while testing scanForColors: ``` tinycolor.scanForColors("background-position: 100% 50%;"); // finds a match: ["100"] ``` The culprit is the hex3 matcher, which looks like this once you...

Add a `hasColor` function

A few more strings that are returning matches: ``` z-index: 100; margin: 0 !important; // finds the word "tan" ```

Add a `hasColor` function

I think requiring a preceding `#` makes sense. I see it as being no different than the `rgba?` which precedes rgb colors. Without it, you are prone to all sorts...

Add a `hasColor` function

How about an optional object parameter `scanForColors(text, { css: true })` which defaults to false? Based on the parameter, CSS syntax enforcements could be toggled on/off.

MPS prediction bug for segmentation + detection

Any updates on this? Experiencing the same issue on `8.2.32` on `M3 Max`. CPU validation matches the results of Cuda validation, MPS gives much worse results.

Benchmarks for whisperx, faster-whisper, and whispers2t!

It looks like `whispers2t` does not use the previous segment transcription as context. This is the same with `WhisperX`. Would be interesting to see WER benchmarks alongside the performance, especially...

‹
1
2
3
4
›