quantization topics

oreilly-pytorch-dl

45

Stars

33

Forks

45

Watchers

Code for Deep Learning for Modern AI

sinanuozdemir

bert

clip

deep-learning

diffusion

flux-fp8-api

264

Stars

37

Forks

Watchers

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

aredden

diffusion

fast-inference

flux

fp8

ao

2.6k

Stars

390

Forks

2.6k

Watchers

PyTorch native quantization and sparsity for training and inference

pytorch

brrr

cuda

dtypes

float8

Q-GaLore

164

Stars

13

Forks

Watchers

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

VITA-Group

large-language-models

low-rank

memory-efficient-learning

quantization

RaBitQ

41

Stars

6

Forks

Watchers

[SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search

gaoj0017

high-dimensional-vectors

nearest-neighbor-search

quantization

coursera-mlops-specialization

18

Stars

19

Forks

Watchers

Coursera Machine Learning Engineering for Production Specialization Course

johnmoses

automl

data-modeling

deep-learning

feature-engineering

nunchaku

3.6k

Stars

212

Forks

3.6k

Watchers

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

nunchaku-tech

comfyui

diffusion-models

flux

genai

ComfyUI-nunchaku

2.0k

Stars

68

Forks

Watchers

ComfyUI Plugin of Nunchaku

nunchaku-tech

comfyui

diffusion

flux

genai

GERM

17

Stars

2

Forks

17

Watchers

[ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.

MAGICS-LAB

bioinformatics

cpu

dna

dna-sequences

Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

23

Stars

6

Forks

23

Watchers

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-awar...

shaheennabi

4bit-quantize

4bitprecision

anthropic-hh-golden

bitsandbytes