Results 4 comments of Jackson

Hey @danielhanchen, thanks for the amazing work! However, I tried saving 4-bits model but encounter following error: ``` NotImplementedError: You are calling `save_pretrained` on a 4-bit converted model. This is...

Ah its okay! I was planning to save it in 16-bit but getting errors, that's why I tried to use 4bit saving. But now I am able to save it...

You could simply change the ceil mode of maxpool layer from last sequential layer of VGG networks to **True** if you are running on different pytorch version from author's. ![image](https://user-images.githubusercontent.com/51978507/199374709-a88beb92-d916-4a8b-91fc-f6b385806359.png)

Hi, I have a question. Since we are accepting the tokens whenever `q > p`, what happens when there is actually other token that has higher probability than `q`?