quantization topic
FedPAQ-MNIST-implemenation
An implementation of FedPAQ using different experimental parameters. We will be looking at different variations of how, r(number of clients to be selected), t (local epochs) and s (Quantizer levels))
Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining
fewbit
Compression schema for gradients of activations in backward pass
IntraQ
Pytorch implementation of our paper accepted by CVPR 2022 -- IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization
SSD-Pruning-and-quantization
Pruning and quantization for SSD. Model compression.
Yolo-compression-and-deployment-in-FPGA
基于FPGA量化的人脸口罩检测
BEVFormer_tensorrt
BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).
stable-diffusion-streamlit
Quantized stable-diffusion cutting down memory 75%, testing in streamlit, deploying in container
BitNet-Transformers
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture
SAI
SDK for TEE AI Stick (includes model training script, inference library, examples)