ollama-python
ollama-python copied to clipboard
OLLAMA Python, Why can't I use all the threads on the CPU?
I analyzed the problem in depth. I get faster responses when I use the terminal, something is wrong with Python. Just use E-cores and its too slow.
I HAVE:
i9 - 13980HX 24 Core - 32 Thread
When I test with the GPU, there is a 30% speed difference between running from Python and running from Terminal. Why is this problem happening?