geray comments

Results 13 comments of


                                            geray

Support x86_64

I met the same problem，what is the final solution?

Processing bug with CDN and local files

I also meet same problem.

Clarification on API Endpoint: /v1/completions vs /v1/chat/completions

The reference of documentation is that [llama_server](https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md). And I try to rewrtire gguf.py like follow: ```python import logging import time import requests from requests.exceptions import RequestException from tqdm import tqdm...

Clarification on API Endpoint: /v1/completions vs /v1/chat/completions

> What model here is being evaluated? I will try to look into this soon. Qwen1.8B I discovered that the original script gguf.py, when matched with llama-cpp-python to start the...

Clarification on API Endpoint: /v1/completions vs /v1/chat/completions

> hi, I met the same problem as yours, have you solved this problem? try to use llama-cpp-python replace llama.cpp start server

Clarification on API Endpoint: /v1/completions vs /v1/chat/completions

> In GGUF it should determine based on the offsets which tokens are part of the continuation and which are not (the `while` loop in the screenshot skips any context...

llama / gguf interface broken?

I meet same error at [#1637](https://github.com/EleutherAI/lm-evaluation-harness/issues/1637)

[Enhancement Request] Fine-Grained GTID Support for Improved Read-After-Write Performance

Thank you for your interest in this topic. If you would like to proceed, please feel free to send an email to [email protected]. Your understanding is correct, and we can...

[BUG] mysql scale cluster bench error: DDL command denied to user 'root'

fixed by https://github.com/apecloud/kubeblocks-addons/pull/341