quantization topic

List quantization repositories

KIVI

200
Stars
16
Forks
Watchers

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

TFMQ-DM

53
Stars
3
Forks
Watchers

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".

KGySoft.Drawing.Tools

18
Stars
4
Forks
Watchers

Debugger visualizers and image editor apps built on KGy SOFT Drawing Libraries

fast_yolov7_pytorch

17
Stars
3
Forks
Watchers

Using pruning and quantization algorithm to accelerate your yolov7's inference.

pratical-llms

38
Stars
9
Forks
Watchers

A collection of hand on notebook for LLMs practitioner

BitNetMCU

225
Stars
20
Forks
Watchers

Neural Networks with low bit weights on low end 32 bit microcontrollers such as the CH32V003 RISC-V Microcontroller and others

quantizr

16
Stars
3
Forks
Watchers

Fast library for converting RGBA images to 8-bit palette images. Written in Rust; can be used in C programs

picollm

273
Stars
15
Forks
273
Watchers

On-device LLM Inference Powered by X-Bit Quantization

MI-optimize

18
Stars
4
Forks
Watchers

mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniqu...