Gregory (Gabriel) Barello
Results
1
issues of
Gregory (Gabriel) Barello
Previously, the `Pix2StructTextModel` was configured with `is_decoder=False` by default causing the attention mask used for self-attention to be non-causal and causing fine-tuning to fail. As a fix, this PR adds...