executorch
executorch copied to clipboard
On-device AI across mobile, embedded and edge for PyTorch
Summary: This diff adds support for multi query attention for sdpa with kv cache Reviewed By: iseeyuan Differential Revision: D56212419
Summary: Handle multiple memory IDs by dumping them into different processes in trace view. This solution seemed the simplest, and since the time stamps match between processes it should be...
Differential Revision: D53747744
Summary: LLVM-15 has a warning `-Wunused-but-set-variable` which we treat as an error because it's so often diagnostic of a code issue. Unused variables can compromise readability or, worse, performance. This...
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: We have optimizations to remove view_copy operations, so we covert view_copy-like operations (squeeze_copy, unsqueeze_copy, select_copy) to view_copy operations where possible. Differential Revision: D54866539
I follow the instruction in [https://pytorch.org/executorch/main/build-run-qualcomm-ai-engine-direct-backend.html](url) and finish the following steps: `cd $EXECUTORCH_ROOT` `mkdir build_android` `cd build_android` `cmake .. \ -DBUCK2=buck2 \ -DCMAKE_INSTALL_PREFIX=$PWD \ -DEXECUTORCH_BUILD_QNN=ON \ -DQNN_SDK_ROOT=$QNN_SDK_ROOT \ -DCMAKE_TOOLCHAIN_FILE=$ANDROID_NDK/build/cmake/android.toolchain.cmake \...
Summary: Method name update Differential Revision: D56072265