Ken Unger

Results 3 issues of Ken Unger

- Adding these QU8 kernels for completeness, although I recognize that QU8 will be deprecated at some point. - Small change to qs8-gemm/rvv.in for qu8 support, however qs8/qd8 generated kernels...

- Supersedes #7126 which was not committed, however I implemented this similar to neon and other so we can get to u4v.

While implementing https://github.com/google/XNNPACK/pull/7638 and attempting to run qs8-dwconv-bench with xnn_qs8_dwconv_minmax_fp32_ukernel_25p8vc the benchmark encounters a malloc error after several tests. Running with 9p8vc is fine. I noticed that the current qs8-dwconv-bench...