quantization topic

List quantization repositories

nlcli-wizard

20
Stars
2
Forks
20
Watchers

Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)

GPTQModel

902
Stars
130
Forks
902
Watchers

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

LLM-visuals

19
Stars
3
Forks
19
Watchers

Over 60 figures and diagrams of LLMs, quantization, low-rank adapters (LoRA), and chat templates FREE TO USE in your blog posts, slides, presentations, or papers.

audiocodecs

34
Stars
3
Forks
34
Watchers

A collections of audio codecs with a standardized API

LLM-Quantization

52
Stars
6
Forks
52
Watchers

记录量化LLM中的总结。

discrete-wavlm-codec

24
Stars
3
Forks
24
Watchers

A neural speech codec based on discrete WavLM representations

Smart-Traffic-Monitoring-System

24
Stars
7
Forks
24
Watchers

An intelligent traffic monitoring system that collects traffic flow and metrics including average speed and vehicle counts for each road. Features real-time data visualization with interactive dashboa...

QKV-Core

16
Stars
0
Forks
16
Watchers

"Adaptive Hybrid Quantization Framework for deploying 7B+ LLMs on low-VRAM devices (e.g., GTX 1050). Features surgical block alignment and Numba-accelerated inference.