bark icon indicating copy to clipboard operation
bark copied to clipboard

quantization

Open ZechenM opened this issue 2 years ago • 2 comments

Is there a way to quantize bark? That way the model would be even smaller and faster to load and hence takes less space on GPU and results in lower latency. Thanks!

ZechenM avatar Jun 07 '23 22:06 ZechenM

Related to #30 and this

thiswillbeyourgithub avatar Nov 02 '23 17:11 thiswillbeyourgithub

You can find working quantized Bark examples in https://github.com/PABannier/bark.cpp and OpenVino

JonathanFly avatar Nov 17 '23 15:11 JonathanFly