kittsil comments

Results 6 comments of


                                            kittsil

Add option to carry initial_prompt with the sliding window

There are outstanding issues with this PR: 1. I have not found the definition of the 224 context token length. 2. It prepends the `initial_prompt` to itself before enough tokens...

Add option to carry initial_prompt with the sliding window

@ryanheise Thank you for your input; it was helpful. Do you mind providing any additional feedback? --- Aside: I did find the left-slice in the code, and it turns out...

Add option to carry initial_prompt with the sliding window

> how errors like this can be fixed? > > ![image](https://private-user-images.githubusercontent.com/19240467/378769017-877a41d6-8da7-4c19-bece-2ddbd91bde9c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mjk2NTExNjcsIm5iZiI6MTcyOTY1MDg2NywicGF0aCI6Ii8xOTI0MDQ2Ny8zNzg3NjkwMTctODc3YTQxZDYtOGRhNy00YzE5LWJlY2UtMmRkYmQ5MWJkZTljLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDEwMjMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQxMDIzVDAyMzQyN1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWU5MTk0YzM1YjM0NWIyMmYxODM2ZGZiMDBiZmM1NzVkYzI3N2FlZDUyNjQ0OWJlM2U2MTM3MjVmOGIzODRkN2YmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.BnlPKr6YoEypvhm0ZiH_SJod76TFrhREYZaUZ59mRhc) @FurkanGozukara, that's an issue with `whisper`, not with your prompt. You can try setting `compression_ratio_threshold` lower; I have found...

CUDNN 9 support

@drake7707 's great Dockerfile worked for me, except that: 1. I am running `python 3.10`, whereas the base image has `python 3.11`. I explicitly installed `python3.10` with `apt-get` and then...

Transcribe on GPU

I am not sure that this is a valuable change. While it is not a robust benchmark, I did do an experiment on my local machine. 10x `log_mel_spectrogram()` on a...

Transcribe on GPU

@take0x, I was using my GPU to transcribe. > What is important is that the device specified in `load_model()` should be used when transcribing, rather than the actual benchmark result....