dred0n
Results
2
issues of
dred0n
exLlama saved GPTQ, I've gone from 6 token/s to over 40, thank you! Currently it's only supports Llama based models. Here's a few other promising architectures such as: MPT Falcon...
Issue: The following endpoint used `json.dumps` as a return for the endpoints /refresh-files /select-file This conflicts with FastAPI's ability to assign the proper response headers causing CORS issues. Also, the...