Marc Päpper
Marc Päpper
Awesome, thanks for adding @Zetaphor I tested out the llama agent, but noted that it was much slower with that Python binding than when I ran llama.cpp directly. There is...
For me, llama.cpp runs a few seconds and with the binding it's more like 30 seconds
Thanks @MikeBeller, I updated it in the README now.
Hi @bibhas2 do I understand it correctly that you are referring to not having the ability to parse sitemaps which link other sitemaps in turn?
Hi Adam, the error indicates that the model is loaded with half precision (torch.cuda.HalfTensor), so you need to also pass the input with half precision. Presumably, you can just change...
Did the error change after you made the input to `.half()`?
Yes sure, that is a well known fact. For my use case it doesn't matter, but if this is important to you, feel free to submit a pull request.