Levi Melamed

Results 31 comments of Levi Melamed

> would it be possible in theory to enable word_level timestamps through faster_whisper and patch them into the segments? Yes. The timestamp tokens are being [filtered out during decoding](https://github.com/m-bain/whisperX/blob/78dcfaab51005aa703ee21375f81ed31bc248560/whisperx/asr.py#L69). You...

> Maybe `--chunk_size` is what you need. > > https://github.com/m-bain/whisperX/blob/f8cc46c6f7fa3b8509bc6aa04cdf4a62a702bb42/whisperx/transcribe.py#L44 Wouldn't this cause worse transcription results, since the context is lost with smaller chunks of audio? Seems like it would...

Instead of copying the file to the worker, you can transfer a reference to it like so: `await this.ffmpeg.mount("WORKERFS", { files: [file]}, '/some-dir');` [See here](https://emscripten.org/docs/api_reference/Filesystem-API.html#filesystem-api-workerfs)

Some interesting behavior I found while testing scanForColors: ``` tinycolor.scanForColors("background-position: 100% 50%;"); // finds a match: ["100"] ``` The culprit is the hex3 matcher, which looks like this once you...

A few more strings that are returning matches: ``` z-index: 100; margin: 0 !important; // finds the word "tan" ```

I think requiring a preceding `#` makes sense. I see it as being no different than the `rgba?` which precedes rgb colors. Without it, you are prone to all sorts...

How about an optional object parameter `scanForColors(text, { css: true })` which defaults to false? Based on the parameter, CSS syntax enforcements could be toggled on/off.

Any updates on this? Experiencing the same issue on `8.2.32` on `M3 Max`. CPU validation matches the results of Cuda validation, MPS gives much worse results.

It looks like `whispers2t` does not use the previous segment transcription as context. This is the same with `WhisperX`. Would be interesting to see WER benchmarks alongside the performance, especially...