PAMD.LA comments

Results 10 comments of


                                            PAMD.LA

Tunnel is disconnected almost immediately

> I had a similar issue. Turns out I had to **open all the ports for my server in the network firewall** as Client tries to connect to assigned port...

是否可以有纯CPU的模式

推理慢是有很多原因的，一个原因就是模型很大（参数太多），我用gpt4all的cpu版本其速度是能接受的，作为对比的话。所以在想，MOSS有没有可能也做到那种程度。对于某些只有CPU大集群的公司来说，是有益的。

是否可以有纯CPU的模式

> 您只需要修改model_inference.py去掉.cude()和.to("cude")即可好的，我找找。

index page does not load

```bash # wandb local --upgrade wandb: WARNING `wandb local` has been replaced with `wandb server start`. ``` my os: MBP MacOS 12.6.3 wandb: 0.17.1 not working here ...

support for qwen3-embedding and qwen3-reranker models

great, i made it. however, reranker models are really helpful for rag, please make a plan for them.

DeepSeek-R1-Distill-Qwen-14B-GGUF 和 deepseek-r1-distill-qwen-14b-awq 都加载失败

解决了吗？我试着注册qwq32b的gguf也出了这个问题

Failed to initialize NVML

I installed xinf image via `docker pull xprobe/xinference:v1.4.1`. Does this image have no cuda ?

how to deploy the web server on http://mydomain.com/ragflow

not work

how to deploy the web server on http://mydomain.com/ragflow

yeah, but Idid not find a way out... kinda complicated if modifying frontend codes...

how to deploy the web server on http://mydomain.com/ragflow

no, I am not good at frontend like this kind of frameworks.