Gregory (Gabriel) Barello

Results 1 issues of Gregory (Gabriel) Barello

Previously, the `Pix2StructTextModel` was configured with `is_decoder=False` by default causing the attention mask used for self-attention to be non-causal and causing fine-tuning to fail. As a fix, this PR adds...