Dahun Kim

Results 1 issues of Dahun Kim

If the span-size `K` is smaller than the width `W`, then do we have the size of `(C,W,K)` for the relative position encoding matrix `r^q`? So that it's `einsum`ed with...