quantization topic

List quantization repositories

tensorflow-quantization-example

18
Stars
5
Forks
Watchers

TensorFlow Quantization Example, for TensorFlow Lite

instruct-finetune-mistral

31
Stars
6
Forks
Watchers

Fine-tune Mistral 7B to generate fashion style suggestions

QLLM

33
Stars
2
Forks
Watchers

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"

TBN

16
Stars
3
Forks
Watchers

TBNv2: Convolutional Neural Network With Ternary Inputs and Binary Weights

CoarseHash

15
Stars
1
Forks
Watchers

Benchmark datasets used in ICRA 2020 paper: Fast, Compact and Highly Scalable Visual Place Recognition through Sequence-based Matching of Overloaded Representations

XNOR-popcount-GEMM-PyTorch-CPU-CUDA

17
Stars
1
Forks
Watchers

A PyTorch implemenation of real XNOR-popcount (1-bit op) GEMM Linear PyTorch extension support both CPU and CUDA

torch-bnb-fp4

28
Stars
1
Forks
Watchers

Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops