Jinhyeok Kim

Results 1 comments of Jinhyeok Kim

I think this method(therefore the kernel) only focus on single-batch setting in decoding stage, as other activation sparsity also only focus on them. The advantage of activation sparsity is latency,...