Compressed-Transformers icon indicating copy to clipboard operation
Compressed-Transformers copied to clipboard

In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization aware training of the linear layers and demonstrate the performanc...

Results 0 Compressed-Transformers issues
Sort by recently updated
recently updated
newest added