quantization topic
tensorflow-quantization-example
TensorFlow Quantization Example, for TensorFlow Lite
instruct-finetune-mistral
Fine-tune Mistral 7B to generate fashion style suggestions
QLLM
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"
TBN
TBNv2: Convolutional Neural Network With Ternary Inputs and Binary Weights
quantization-notes
Notes on quantization in neural networks
awesome-compression
模型压缩的小白入门教程
CoarseHash
Benchmark datasets used in ICRA 2020 paper: Fast, Compact and Highly Scalable Visual Place Recognition through Sequence-based Matching of Overloaded Representations
XNOR-popcount-GEMM-PyTorch-CPU-CUDA
A PyTorch implemenation of real XNOR-popcount (1-bit op) GEMM Linear PyTorch extension support both CPU and CUDA
communication-system-simulation
Communication system in Matlab ☎️
torch-bnb-fp4
Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops