nncf
nncf copied to clipboard
Andreyan/awq extention
Changes
Extended AWQ algorithms for patterns Act->MatMul and Act->Multiply->MatMul with insertion for extra scales after activation.
Reason for changes
Support AWQ for wider family of LLMs
Related tickets
CVS-141131
Tests
Added unit tests