Carlos F. Enguix
Carlos F. Enguix
[get_user_data_parallel2.zip](https://github.com/taspinar/twitterscraper/files/2544182/get_user_data_parallel2.zip)
Dear FastChat Developers, I am part of a research group working with integrating Semantic Web-based Knowledge Graphs and LLMs such as Vicuna. We are working on an open-source research project,...
Hi, I am running the api-example.py python script loading TheBloke_Llama-2-13B-chat-GGML model: llama-2-13b-chat.ggmlv3.q2_K On a RAM 64GB and Nvidia GPU 4GB VRAM token throughput per second is really slow under Windows...