BadGame

Results 3 issues of BadGame

Thanks for putting together such a cool project! I wonder what performance settings I can change to boost the translation speed with a higher-end GPU or even multiple GPUs. I...

enhancement

I tried to quantize the model into BF16 and FP16 to preserve a bit more precision than NF4/FP8 while still running fast on data center cards. However, when I tried...

What is the minimum vram to safely run the model in 99% of the cases? Thanks