katitizhou
Results
1
comments of
katitizhou
What is the right way to set injection_policy to split wte and wpe for GPT2? I found no relevant examples for this question