executorch issues

{executorch][llama] support mqa

2

Summary: This diff adds support for multi query attention for sdpa with kv cache Reviewed By: iseeyuan Differential Revision: D56212419

larryliu0820

CLA Signed

fb-exported

Handle multiple memory IDs using pid

13

Summary: Handle multiple memory IDs by dumping them into different processes in trace view. This solution seemed the simplest, and since the time stamps match between processes it should be...

skrtskrtfb

CLA Signed

fb-exported

Prepare the iOS app to run test on Device Farm

1

WIP, not ready for review yet.

huydhn

CLA Signed

introduce _to_dim_order_copy op to runtime

35

Differential Revision: D53747744

Gasoonjia

CLA Signed

fb-exported

Remove unused variables in eki/builder/offloading/BlockAllocator.cpp

3

Summary: LLVM-15 has a warning `-Wunused-but-set-variable` which we treat as an error because it's so often diagnostic of a code issue. Unused variables can compromise readability or, worse, performance. This...

r-barnes

CLA Signed

fb-exported

Add quantized op support to llama runner

1

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

larryliu0820

CLA Signed

Replace view-like ops with view ops

14

Summary: We have optimizations to remove view_copy operations, so we covert view_copy-like operations (squeeze_copy, unsqueeze_copy, select_copy) to view_copy operations where possible. Differential Revision: D54866539

metascroy

CLA Signed

fb-exported

No such file or directory: #include "HTP/QnnHtpContext.h"

2

I follow the instruction in [https://pytorch.org/executorch/main/build-run-qualcomm-ai-engine-direct-backend.html](url) and finish the following steps: `cd $EXECUTORCH_ROOT` `mkdir build_android` `cd build_android` `cmake .. \ -DBUCK2=buck2 \ -DCMAKE_INSTALL_PREFIX=$PWD \ -DEXECUTORCH_BUILD_QNN=ON \ -DQNN_SDK_ROOT=$QNN_SDK_ROOT \ -DCMAKE_TOOLCHAIN_FILE=$ANDROID_NDK/build/cmake/android.toolchain.cmake \...

Wangbk-dl