Jérôme Louradour comments

Results 54 comments of


                                            Jérôme Louradour

RuntimeError: No SGEMM backend on CPU

Thank you! I fixed the MKL librairies folder, but the same runtime error occurs. With the installation procedure that you describe, it works like a charm.

When I tested word timestamps on a bunch of file, I saw this error happening on some corner case: ``` File "/usr/local/lib/python3.10/site-packages/faster_whisper/transcribe.py", line 531, in add_word_timestamps alignment = self.find_alignment(tokenizer, text_tokens,...

Repetitive Phrase Looping

This seems to be a duplicate of https://github.com/linto-ai/whisper-timestamped/issues/94 You have some suggestion there. Repetitions are due to model hallucination. Which model are you using?

Repetitive Phrase Looping

OK, when you say "I use the large model", you have to know there are several versions of the large model (now there are 3). So if you use `model...

Bad timestamp prediction with some finetuned Whisper models

Thank you @lumpidu Can you please also give the options you use to get the transcription with the bad (too short) second segment? If I just run ``` whisper_timestamped iceland.webm...

Bad timestamp prediction with some finetuned Whisper models

I could see some problems with option --accurate And here is my guess: That model was finetuned with segments of less than 30 seconds only, without the prediction of the...

Bad timestamp prediction with some finetuned Whisper models

Also another thing you could try is with the regular model `--model large-v3 --language is` instead of the finetuned model. Maybe the transcription won't be as accurate on some places,...

Bad timestamp prediction with some finetuned Whisper models

Have you tried that with the finetuned model? `whisper_timestamped.transcribe(..., trust_whisper_timestamps=False)` Concerning text normalization, you mean that there aredigits instead of numbers written with letters, upper case letters, and punctuation marks?...

Bad timestamp prediction with some finetuned Whisper models

@lumpidu have you tried option `trust_whisper_timestamps=False` (in python, or `--recompute_all_timestamps True` in the CLI) with the finetuned model?

Jérôme Louradour

Support for BLOOM

RuntimeError: No SGEMM backend on CPU

word-level timestamps

Repetitive Phrase Looping

Repetitive Phrase Looping

Bad timestamp prediction with some finetuned Whisper models

Bad timestamp prediction with some finetuned Whisper models

Bad timestamp prediction with some finetuned Whisper models

Bad timestamp prediction with some finetuned Whisper models

Bad timestamp prediction with some finetuned Whisper models