Jinhyeok Kim
Results
1
comments of
Jinhyeok Kim
I think this method(therefore the kernel) only focus on single-batch setting in decoding stage, as other activation sparsity also only focus on them. The advantage of activation sparsity is latency,...