Asad Ahsan

Results 2 comments of Asad Ahsan

Hey, i think the swiglu issue is when you try to use GPT2LMHead for loading the model, nonetheless i shifted to load the model using this snippet: import torch from...

@simran-arora @seyuboglu Really sorry for pinging you guys here, but can you guide me a little bit on this?