quantization topic

List quantization repositories

mmrazor

1.4k
Stars
218
Forks
Watchers

OpenMMLab Model Compression Toolbox and Benchmark.

Easy-Translate

181
Stars
287
Forks
Watchers

Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible for beginners and as seamlesscustomizable and as possible for a...

faster-whisper

20.1k
Stars
1.7k
Forks
20.1k
Watchers

Faster Whisper transcription with CTranslate2

Chinese-LLaMA-Alpaca

18.2k
Stars
1.9k
Forks
Watchers

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Qbot

7.0k
Stars
939
Forks
Watchers

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Cha...

AutoGPTQ

4.3k
Stars
457
Forks
Watchers

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

LLaMA-Factory

62.8k
Stars
7.6k
Forks
62.8k
Watchers

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

minigpt4.cpp

556
Stars
28
Forks
Watchers

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

TinyChatEngine

569
Stars
55
Forks
Watchers

TinyChatEngine: On-Device LLM Inference Library

gpu_poor

781
Stars
38
Forks
Watchers

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization