malcolmsharpe
Results
1
comments of
malcolmsharpe
I ran into this issue as well, in my case with a different 8-bit quant, `TheBloke/Mistral-7B-Instruct-v0.2-GPTQ:gptq-8bit-32g-actorder_True`. An additional clue, in case it's helpful: when the input contains =128 tokens, a...