auto-round
auto-round copied to clipboard
simplify what's new and add publication_list
Here I collect all blogs and publications I familiar, if any publication/blog is not recorded, please feel free to comment here. @WeiweiZhang1 @n1ck-guo @wenhuach21 @hshen14 @thuang6
1 Add this: This is the earliest work that searches for the optimal alpha before AWQ, and it has already been deployed in AMD's quantization tool. (Link: https://medium.com/intel-analytics-software/effective-post-training-quantization-for-large-language-models-with-enhanced-smoothquant-approach-93e9d104fb98 )
2 Add TEQ: TEQ is the first method that learns (trains) alpha, predating OmniQuant.