quantization topic

List quantization repositories

oreilly-pytorch-dl

45
Stars
33
Forks
45
Watchers

Code for Deep Learning for Modern AI

flux-fp8-api

264
Stars
37
Forks
Watchers

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

ao

2.6k
Stars
390
Forks
2.6k
Watchers

PyTorch native quantization and sparsity for training and inference

Q-GaLore

164
Stars
13
Forks
Watchers

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

RaBitQ

41
Stars
6
Forks
Watchers

[SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search

coursera-mlops-specialization

18
Stars
19
Forks
Watchers

Coursera Machine Learning Engineering for Production Specialization Course

nunchaku

3.6k
Stars
212
Forks
3.6k
Watchers

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

ComfyUI-nunchaku

2.0k
Stars
68
Forks
Watchers

ComfyUI Plugin of Nunchaku

GERM

17
Stars
2
Forks
17
Watchers

[ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-awar...