oneDNN
oneDNN copied to clipboard
Support weight-compressed date type s8
Description
Inner-product with symmetrically quantized/compressed weight may have s8 as weight data type (it saves the zero-point subtraction cost), this change added support to such weight dt.
Fixes # (github issue)
Checklist
General
- [ ] Do all unit and benchdnn tests (
make testandmake test_benchdnn_*) pass locally for each commit? - [ ] Have you formatted the code using clang-format?
Performance improvements
- [ ] Have you submitted performance data that demonstrates performance improvements?
New features
- [ ] Have you published an RFC for the new feature?
- [ ] Was the RFC approved?
- [ ] Have you added relevant tests?
Bug fixes
- [ ] Have you included information on how to reproduce the issue (either in a github issue or in this PR)?
- [ ] Have you added relevant regression tests?
RFC PR
- [ ] Does RFC document follow the template?
- [ ] Have you added a link to the rendered document?