研究社交 comments

Results 34 comments of


                                            研究社交

Multi-Node webGPU support

Thanks! I am currently working on a new version of the inference runtime which gets rid of waiting time in some CPU-bounded cases, and multi-device inferencing is on its roadmap!

Apple版本中，“设置/选择后端”为空不知该怎么办

> 这个嘛前端一个BUG 我今天给修一下。 Apple 的话应该是 Metal 后端，被 Web 前端过滤掉了。

Apple版本中，“设置/选择后端”为空不知该怎么办

v0.5.14 已经更新前端，请再试一次。

Higher number of Num Tokens?

@cgisky1980

Higher number of Num Tokens?

Actually the prefill speed would like to be maxed at about 256. Higher than that does not worth it. It is not a limit on the total token length. It...

Higher number of Num Tokens?

I see. There is a limit in the backend for a single request which is 4k. I can remove it anyway.

Higher number of Num Tokens?

The limit has been removed.

Expose a C api?

Thanks! There is a C ffi exists ([here](https://github.com/cryscan/web-rwkv-ffi)). It's not as flexible but is simple to use and extend. Feel free to extend it for your own usage, or reach...

Expose a C api?

Ah, I make it public now.

Expose a C api?

> The link https://github.com/cryscan/web-rwkv-ffi seems to be dead (404) Is that helpful to your application?