Luigi Marangon

Results 1 comments of Luigi Marangon

My solution: set n_threads_batch with the cpus's number I set n_threads_batch to cpus's system number minus one import multiprocessing llm = LlamaCpp(... n_threads_batch=multiprocessing.cpu_count()-1, ... ) At line 234 https://llama-cpp-python.readthedocs.io/en/latest/api-reference/ you...