quantization topic
nlcli-wizard
Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)
GPTQModel
LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
LLM-visuals
Over 60 figures and diagrams of LLMs, quantization, low-rank adapters (LoRA), and chat templates FREE TO USE in your blog posts, slides, presentations, or papers.
audiocodecs
A collections of audio codecs with a standardized API
discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations
Smart-Traffic-Monitoring-System
An intelligent traffic monitoring system that collects traffic flow and metrics including average speed and vehicle counts for each road. Features real-time data visualization with interactive dashboa...