quantization topic
xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJ...
AdaQP
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training
resnet50-quantization
Resnet50 Quantization for Inference Speedup in PyTorch
sconce
Model Compression Made Easy
AI-Engineering.academy
Navigating the World of AI, One Step at a Time
qimera
Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]
IntLLaMA
IntLLaMA: A fast and light quantization solution for LLaMA
QuantEase
QuantEase, a layer-wise quantization framework, frames the problem as discrete-structured non-convex optimization. Our work leverages Coordinate Descent techniques, offering high-quality solutions wit...
auto-round
Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.