dash-infer
dash-infer copied to clipboard
Add Support for mac armv9 support, with A16W8 quantization.
- need port to mac (M4 + ) ARMv9
- disable openMP (because default xcode clang not support openMP)
- enable a16w8 quantization support to reduce RAM requirement.