quantization topic

List quantization repositories

hicolor

193
Stars
5
Forks
Watchers

🎨 Convert images to 15/16-bit RGB color with dithering

rwkv.cpp

1.4k
Stars
93
Forks
Watchers

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

gptq_for_langchain

40
Stars
9
Forks
Watchers

A guide about how to use GPTQ models with langchain

BabyGPT

20
Stars
2
Forks
Watchers

Something in the middle of Karpathy's mingpt model and video lectures, BabyGPT is an easy to use model on a much smaller scale (16 and 256 out channels , 5 heads, fine tuned). To be made useful on l...

Lightweight-Low-Resource-NMT

16
Stars
3
Forks
Watchers

Official code for "Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models" to appear in WMT 2022.

autoencoder_based_image_compression

19
Stars
7
Forks
Watchers

Autoencoder based image compression: can the learning be quantization independent? https://arxiv.org/abs/1802.09371

image-optimizer

18
Stars
2
Forks
Watchers

Optimize any image by chroma subsampling and optimized huffman coding in Python. Basically, using JPEG algorithm!

takeoff-community

112
Stars
14
Forks
Watchers

TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessible to everyone.

PB-LLM

135
Stars
9
Forks
Watchers

PB-LLM: Partially Binarized Large Language Models

awesome-approximate-dnn

25
Stars
6
Forks
Watchers

Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment