Mobile_ConvNet icon indicating copy to clipboard operation
Mobile_ConvNet copied to clipboard

RS finish is not used before collecting end time/power

Open sakridge opened this issue 9 years ago • 0 comments

forEach is non-blocking. After calling forEach_conv_() then rs.finish() api should be called for accurate per-stage timing, otherwise the kernel may be not be completed when collecting the end time.

The program should avoid rs.finish if precision per-stage timing is not needed as it can slow down execution.

sakridge avatar Nov 29 '16 04:11 sakridge