Tao Lv
Tao Lv
This proposal aims to introduce a set of graph API into oneDNN. Rendered version: [link](https://github.com/TaoLv/mkl-dnn/blob/lvtao/rfcs/graph-api/rfcs/20220711-graph-api/README.md) cc @jianhui-li @igorsafo @mgouicem
Great benchmark for evaluting RNN units with different frameworks. But the world is changing fast and TensorFlow has evolved to version 1.4.0rc0. The benchmark for TenforFlow depends on some deprecated...
This is to propose adding OpenCL GPU runtime support in oneDNN Graph API. It addresses the API limitations mentioned RFC https://github.com/oneapi-src/oneDNN/pull/1745 and also paves the way for OpenVINO integration.
The RFC proposes adding support for Swish operation in Graph API. Rendered version: [link](https://github.com/TaoLv/mkl-dnn/blob/lvtao/rfcs/swish/rfcs/20241008-graph-api-swish/README.md) .
This is to propose supporting safe softmax for sdpa in graph API. A rendered version can be found at: [link](https://github.com/uxlfoundation/oneDNN/blob/lvtao/rfcs/safe-softmax/rfcs/20250321-graph-api-safe-softmax/README.md).
Draft an example for int8 SDPA. MFDNN-13484.
- Added a fp8 (specifically f8_e4m3) sdpa example, with static quantizations. - Exported the JSON file from the example and added into benchdnn inputs as test cases. - Renamed the...
1. Pass the accumulation mode attribute to matmul driver in reference path. 2. Add test cases where both QK and VS are in f16 accumulation mode. Previously only QK in...
Fixes MFDNN-13011 The op kind is not intended for public usage. But with it in the list, we have seen ABI compatibility issue every time when a new op kind...