trungkienbkhn comments

Results 64 comments of


                                            trungkienbkhn

Request to support FlashAttention in cuda attention.cc

For more information, I executed some benchmarks for Faster whisper with FlashAttention in [here](https://github.com/SYSTRAN/faster-whisper/issues/598#issuecomment-2158053951).

change subtitles lenght per line

FYI, FW does not support change subtitle length. Sometimes model creates segments for 30s because it doesn't detect audio well and can't split the segments, maybe your audio has a...

Using distil-whisper-large-v3 German Model from HF with faster-whisper?

@Arche151 , no. You need to convert it through Ctranslate2. Example: ``` ct2-transformers-converter --model sanchit-gandhi/distil-whisper-large-v3-de-kd --output_dir distil-whisper-large-v3-de-kd-ct2 --copy_files tokenizer.json preprocessor_config.json --quantization float16 ``` Then, when initializing the Whisper model, you...

Using distil-whisper-large-v3 German Model from HF with faster-whisper?

Since your model was converted from Distil model, so you should add option `condition_on_previous_text=False` when transcribing. For more info, see this comment: https://github.com/SYSTRAN/faster-whisper/pull/557#issuecomment-1837394755

Large-v3 model hallucinates, large-v2 doesn't

@Arche151 , could you try again with compute_type="default" (or remove this command when initializing whisper model) ?

run error

@tuocheng0824 I just downloaded this model and everything is still working properly, please verify your Internet connection and try again. ``` config.json: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.39k/2.39k [00:00

trungkienbkhn

Request to support FlashAttention in cuda attention.cc

change subtitles lenght per line

Using distil-whisper-large-v3 German Model from HF with faster-whisper?

Using distil-whisper-large-v3 German Model from HF with faster-whisper?

Large-v3 model hallucinates, large-v2 doesn't

run error

Whisper-live taking same time on CPU and GPU to transcribe an audio

Whisper-live taking same time on CPU and GPU to transcribe an audio

Allow loading model from memory

Allow loading model from memory