w1005444804 comments

Results 12 comments of


                                            w1005444804

inferenc cost time (2048x600x3) < (2048x512x3)

> OK,Sir make sure what operation your net use; in sonme hardware operation like convolution, ncnn maybe use pack,which means Change memory layout to accommodate L1 cache of hardware,or other...

通过Ollama下载了的模型，如何在airllm中直接使用呢

@lyogavin

Is dnnl::convolution_backward_data running on a single core？

void convolution_node::backward2data(const dnnl::memory& diff_dst) { m_src_diff_md = dnnl::memory::desc(m_src_dims, dt::f32, tag::any); m_weights_diff_md = dnnl::memory::desc({ m_weights_dims }, dt::f32, tag::any); m_dst_diff_md = dnnl::memory::desc({ m_dst_dims }, dt::f32, tag::any); // // std::cout

Is dnnl::convolution_backward_data running on a single core？

dnnl::convolution_backward_data is quite time-consuming； infer cost(ms): 10 backward2data cost(ms): 232 （however pytorch or libtorch cost(ms) 30~50） backward2weights cost(ms): 12

Is dnnl::convolution_backward_data running on a single core？

@igorsafo thanks, Activate ONEDNN_ VERBOSE does have a certain effect, but it is very unstable, and the time consumption has changed from the previous 230ms to a dynamic range of...

Is dnnl::convolution_backward_data running on a single core？

Hi @igorsafo , Is the problem caused by me?

Is dnnl::convolution_backward_data running on a single core？

@igorsafo Yes, It is the first layer, my model is a conv-layer, I just wanted to test the speed of forward and backward propagation of convolutions, and then found this...

Is dnnl::convolution_backward_data running on a single core？

The code is roughly as follows： ... dnnl::memory::dims conv1_src_tz = { 10, 3, 160, 160 }; auto conv1_src_memory = dnnl::memory({ {conv1_src_tz}, dt::f32, tag::nchw }, engine); convolution_node conv1(engine, 3, 6, 5,...

Is there only one input shape for vit to correctly output mask？

@heyoeyo You're right. but I want to know if it's possible to choose different shapes for the input of VIT（encoder） .