1E04
1E04
I have the same issue with `\\home\\mef\\Documents\\plate_detection_project\\best.pt` but the directory didn't change if i change it in `helper/params.py.` Any further information, where i have to setup the model?
I have the same issue on an A100 80GB. If I used `--backend vllm` the usage of VRAM goes up at first to 15GB then after seconds to 70GB. I...
I also had the same thoughts about this, and I think you are right, the Flash Attention mechanism and the use of float16 or bfloat16 with transformers is the magic...
Hello, 1. SDPA wasn't slower for short audio files in my benchmarking tests, then flash attention. SDPA otherwise supports word-level timestamps. 2. The implementation of the code seems right to...