vinvcn comments

Results 7 comments of


                                            vinvcn

[Feature]: Add Redis as a VectorStore

Please refer to the description of this feature request to use the redis async. Also refer to this issue for explanation: https://github.com/zilliztech/GPTCache/issues/415

GPU buffer comparator "Difference at <timestamp>: X vs Y"

seems like the precision issue? try set the dtype by your GPU type? ```python FlaxWhisperPipline("openai/whisper-large-v2", dtype=jnp.bfloat16, batch_size=16) ```

Is there a way to optimize the output token per second?

> we use a10 to inference, also looks slow. Could you provide a brief measurement on that? etc. token/word per second

Is there a way to optimize the output token per second?

> You can use other fast inference libraries like FasterTransformers. We will also soon release a high throughput batching backend. Thanks for your work to improve this. Would you please...

Publish CodeGeeX inference server code?

It seems the VSCode extension is using a combination of ChatGPT like models and the CodeGeeX models. It seems the CodeGeeX was trained on code data only, not good enough...

XlaRuntimeError: FAILED_PRECONDITION: DNN library initialization failed.

I had to set up Jax by following the install locally harder option. That is, install cuda drivers, Cudnn, then run above command to install jax[cuda11_local]. The caveat is you...

[performance] chat的响应速度很慢而且内容不及ChatPDF

chatpdf使用的微调过的模型？很想知道他们怎么做的？