CapDec icon indicating copy to clipboard operation
CapDec copied to clipboard

About the type of gpt2

Open zjr2000 opened this issue 2 years ago • 1 comments

Hi,

Thanks for your amazing work! I have a question about the type of GPT2. You have mentioned that you use gpt2 large as your langauge model (In section A.1), But I found your code actually load the GPT2 base model:

self.gpt = GPT2LMHeadModel.from_pretrained('gpt2')

Is there any mistake?

zjr2000 avatar Feb 20 '23 10:02 zjr2000

Hi, thanks for paying attention to that. It is a mistake in the paper, we used the base model. Please let me know if you tried both and got better results.

DavidHuji avatar Apr 04 '23 07:04 DavidHuji