Lingching

Results 1 issues of Lingching

- Add complete quantized_matmul_impl_typed template function for CPU (float16, float32, and bfloat16). - Add fp32 test cases for quantized_matmul. - Relax float32 tolerance in test utils.