lgstd

Results 1 issues of lgstd

An error occurred while run in class DisentangledSelfAttention.forward() where query_states.size(1) > hidden_states.size(1): https://github.com/microsoft/DeBERTa/blob/master/DeBERTa/deberta/disentangled_attention.py line 165: p2c_att = torch.gather(p2c_att, dim=-2, index=pos_index.expand(p2c_att.size()[:2] + (pos_index.size(-2), key_layer.size(-2))))