PaddleCustomDevice
PaddleCustomDevice copied to clipboard
PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)
1. make range kernel a function, which uses cnnlArange_v2 2. use range function in flash_attn
/opt/py39/lib/python3.9/site-packages/paddle/include/paddle/phi/kernels/funcs/eigen/extensions.h:23:10: fatal error: unsupported/Eigen/CXX11/Tensor: No such file or directory #include "unsupported/Eigen/CXX11/Tensor" ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ compilation terminated. CMakeFiles/paddle-custom-npu.dir/build.make:257: recipe for target 'CMakeFiles/paddle-custom-npu.dir/kernels/argsort_kernel.cc.o' failed make[2]: *** [CMakeFiles/paddle-custom-npu.dir/kernels/argsort_kernel.cc.o] Error 1 CMakeFiles/paddle-custom-npu.dir/build.make:453: recipe for target 'CMakeFiles/paddle-custom-npu.dir/kernels/contiguous_kernel.cc.o'...
aclnn log_softmax
Fix bn_infer bug, resize before allocate mem