Compressed-Transformers
Compressed-Transformers copied to clipboard
In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization aware training of the linear layers and demonstrate the performanc...
Results
0
Compressed-Transformers issues
Sort by
recently updated
recently updated
newest added