honuvo

Results 2 comments of honuvo

Hi, to add to this: My observations are the same. Random tokens after a few messages. I may have some new information. I don't have those problems when using a...

> Looks like Q5km works ok in latest koboldcpp if I use OpenBLAS instead of CuBLAS for prompt processing. It's slow, though. EDIT: CLBlast works too on GPU. Wow, indeed,...