cookielee77
cookielee77
OK, I see. Thanks! I'm think the connection between GPT2 and TransformerXL. I feel like if the GPT2 uses "past" during training, then they are almost the same (except the...
Hi, Thanks for your valuable suggestions. I will update the README refer to your post. Unfortunately, It's difficult for me to update the dependencies as I already lost access to...
Got the same problem. Did you already solve it?
Nope. I didn't debug too many details. I'm not familiar with the author's dataloader type. Maybe the author could fix this at some time.