BitNet
BitNet copied to clipboard
Encountering Size Mismatch Error in Updated Code
While running the updated code, I encountered an issue as follows:
I would greatly appreciate any guidance or assistance you can provide to help resolve this issue.
it seems self.gamma and self.beta all need squeeze, i.e. self.gamma.squeeze()
Thank you so much for your reply. May I ask if you have encountered the following error message?? Because when I add squeeze to self.gamma and self.beta, it reports an error:
Additionally, does training with the latest version of the code still generate gibberish messages like in issue #23?
Stale issue message