quantization topic

List quantization repositories

xTuring

2.6k
Stars
206
Forks
Watchers

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJ...

AdaQP

17
Stars
0
Forks
Watchers

Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training

resnet50-quantization

17
Stars
2
Forks
Watchers

Resnet50 Quantization for Inference Speedup in PyTorch

sconce

33
Stars
2
Forks
Watchers

Model Compression Made Easy

AI-Engineering.academy

157
Stars
37
Forks
Watchers

Navigating the World of AI, One Step at a Time

qimera

30
Stars
5
Forks
Watchers

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]

IntLLaMA

18
Stars
0
Forks
Watchers

IntLLaMA: A fast and light quantization solution for LLaMA

QuantEase

17
Stars
1
Forks
Watchers

QuantEase, a layer-wise quantization framework, frames the problem as discrete-structured non-convex optimization. Our work leverages Coordinate Descent techniques, offering high-quality solutions wit...

auto-round

222
Stars
19
Forks
Watchers

Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

OmniQuant

595
Stars
45
Forks
Watchers

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.