CapDec
CapDec copied to clipboard
About the type of gpt2
Hi,
Thanks for your amazing work! I have a question about the type of GPT2. You have mentioned that you use gpt2 large as your langauge model (In section A.1), But I found your code actually load the GPT2 base model:
self.gpt = GPT2LMHeadModel.from_pretrained('gpt2')
Is there any mistake?
Hi, thanks for paying attention to that. It is a mistake in the paper, we used the base model. Please let me know if you tried both and got better results.