Will
Results
2
comments of
Will
Hi @stevhliu, may I work on GPT-2?
What up @ZOUHAN1. The paper released with the repo mentions on page five that "Masked self-attention was used in the text encoder to preserve the ability to initialize with a...