katitizhou

Results 1 comments of katitizhou

What is the right way to set injection_policy to split wte and wpe for GPT2? I found no relevant examples for this question