oneDNN icon indicating copy to clipboard operation
oneDNN copied to clipboard

Support weight-compressed date type s8

Open usstq opened this issue 1 year ago • 0 comments

Description

Inner-product with symmetrically quantized/compressed weight may have s8 as weight data type (it saves the zero-point subtraction cost), this change added support to such weight dt.

Fixes # (github issue)

Checklist

General

  • [ ] Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
  • [ ] Have you formatted the code using clang-format?

Performance improvements

  • [ ] Have you submitted performance data that demonstrates performance improvements?

New features

  • [ ] Have you published an RFC for the new feature?
  • [ ] Was the RFC approved?
  • [ ] Have you added relevant tests?

Bug fixes

  • [ ] Have you included information on how to reproduce the issue (either in a github issue or in this PR)?
  • [ ] Have you added relevant regression tests?

RFC PR

  • [ ] Does RFC document follow the template?
  • [ ] Have you added a link to the rendered document?

usstq avatar May 10 '24 01:05 usstq