Varun Nawathey comments

Repositories
Issues
Comments

Results 3 comments of


                                            Varun Nawathey

Why is 1D convolution on CPU via NEConvolutionLayer so slow?

Hi, @morgolock ## 1. I tried the warmup call, but it is still 10x slower than my implementation. I got 0.017 seconds. ## 2. I think I am using NHWC....

Why is 1D convolution on CPU via NEConvolutionLayer so slow?

Hi, @morgolock I have 7 questions (see numbers) # 1. High Level Question: How do I get the exact inference time of ARM Compute Library's convolution implementations minus and runtime/scheduler...

Why is 1D convolution on CPU via NEConvolutionLayer so slow?

Hi, @morgolock I am targeting Android API 26. I found out how to target direct convolutions directly without the runtime: ```c++ #include "src/cpu/kernels/directconv2d/nhwc/neon/fp32.cpp" ... Window win = calculate_max_window(*conv_output.info(), Steps()); arm_compute::cpu::kernels::neon_fp32_nhwc_directconv2d(win,...