Thierry Moreau

Results 35 comments of Thierry Moreau

Thanks @zhanghaohit, for catching this error. I agree with the fact that the dependence should not be hardcoded. However in order to not to add too many parameters in the...

Thank you for the explanation. Do you mind breaking down what happens in the 32x32 configuration that causes correctness to fail (since the dependence analysis waive is unbounded)? I agree...

@remotego I just read your comment that must have been posted as I wrote my reply. Your explanation makes sense, ultimately the critical path in the accumulation is READ ->...

I assume that if you have an FPGA that needs to be clocked at a high frequency, you'll likely end up with more cycles.

So to conclude, I agree with you that we may want this parameter to be larger than 2 cycles if the hardware pipeline becomes more complicated. So the big question...

Perhaps one way to look at this is to start with DISTANCE=3 by default. And if the II>1 for the GEMM, we issue a warning telling the user to increase...

I'll think some more about this problem overnight, thanks for bringing attention to this issue @zhanghaohit @remotego

@zhanghaohit I think that the warning approach is a reasonable compromise between user-defined flexibility and making sure that they set the value correctly. To echo @liangfu , it would be...