fenixlam issues

Repositories
Issues
Comments

Results 2 issues of


                                            fenixlam

How to increase the threads in inference?

How to increase the threads used in inference? I notice CPU usage in privateGPT.py running is 4 threads. I guess we can increase the number of threads to speed up...

Run on koboldcpp...

I copy the example python and run it. The only thing I change is comment the openai API and release local server setting: ```` os.environ["OPENAI_API_BASE"] = 'https://xxxxxxx-5000.proxy.runpod.net/v1' os.environ["OPENAI_MODEL_NAME"] ='openhermes' #...