quantization topic

List quantization repositories

FedPAQ-MNIST-implemenation

23
Stars
5
Forks
23
Watchers

An implementation of FedPAQ using different experimental parameters. We will be looking at different variations of how, r(number of clients to be selected), t (local epochs) and s (Quantizer levels))

Wav2vec2-Pretraining

26
Stars
2
Forks
Watchers

Wav2vec 2.0 Self-Supervised Pretraining

fewbit

40
Stars
4
Forks
Watchers

Compression schema for gradients of activations in backward pass

IntraQ

31
Stars
1
Forks
Watchers

Pytorch implementation of our paper accepted by CVPR 2022 -- IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization

SSD-Pruning-and-quantization

28
Stars
6
Forks
Watchers

Pruning and quantization for SSD. Model compression.

BEVFormer_tensorrt

417
Stars
68
Forks
Watchers

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

stable-diffusion-streamlit

54
Stars
7
Forks
Watchers

Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container

BitNet-Transformers

241
Stars
31
Forks
Watchers

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture

SAI

28
Stars
4
Forks
Watchers

SDK for TEE AI Stick (includes model training script, inference library, examples)