Will

Results 2 comments of Will

Hi @stevhliu, may I work on GPT-2?

What up @ZOUHAN1. The paper released with the repo mentions on page five that "Masked self-attention was used in the text encoder to preserve the ability to initialize with a...