Jérôme Louradour

Results 54 comments of Jérôme Louradour

Thank you! I fixed the MKL librairies folder, but the same runtime error occurs. With the installation procedure that you describe, it works like a charm.

When I tested word timestamps on a bunch of file, I saw this error happening on some corner case: ``` File "/usr/local/lib/python3.10/site-packages/faster_whisper/transcribe.py", line 531, in add_word_timestamps alignment = self.find_alignment(tokenizer, text_tokens,...

This seems to be a duplicate of https://github.com/linto-ai/whisper-timestamped/issues/94 You have some suggestion there. Repetitions are due to model hallucination. Which model are you using?

OK, when you say "I use the large model", you have to know there are several versions of the large model (now there are 3). So if you use `model...

Thank you @lumpidu Can you please also give the options you use to get the transcription with the bad (too short) second segment? If I just run ``` whisper_timestamped iceland.webm...

I could see some problems with option --accurate And here is my guess: That model was finetuned with segments of less than 30 seconds only, without the prediction of the...

Also another thing you could try is with the regular model `--model large-v3 --language is` instead of the finetuned model. Maybe the transcription won't be as accurate on some places,...

Have you tried that with the finetuned model? `whisper_timestamped.transcribe(..., trust_whisper_timestamps=False)` Concerning text normalization, you mean that there aredigits instead of numbers written with letters, upper case letters, and punctuation marks?...

@lumpidu have you tried option `trust_whisper_timestamps=False` (in python, or `--recompute_all_timestamps True` in the CLI) with the finetuned model?