yshli

Results 2 issues of yshli

``` def relu_linear_att(self, qkv: torch.Tensor) -> torch.Tensor: B, _, H, W = list(qkv.size()) if qkv.dtype == torch.float16: qkv = qkv.float() qkv = torch.reshape( qkv, ( B, -1, 3 * self.dim,...