nncf icon indicating copy to clipboard operation
nncf copied to clipboard

Andreyan/awq extention

Open andreyanufr opened this issue 1 year ago • 0 comments

Changes

Extended AWQ algorithms for patterns Act->MatMul and Act->Multiply->MatMul with insertion for extra scales after activation.

Reason for changes

Support AWQ for wider family of LLMs

Related tickets

CVS-141131

Tests

Added unit tests

andreyanufr avatar May 14 '24 07:05 andreyanufr