67lc

Results 2 comments of 67lc

Thank you for you answer @mmdbhs ,but i want to change C++ and CUDA code.This mode is unsuitable with precompiled wheel.

**Now,I try to use PD disaggregated with 2 nodes. Every node have 8 V100 with NVlinks** **Commands:** master:`CUDA_VISIBLE_DEVICES=0 python -m lightllm.server.api_server \ --model_dir /share/models/meta-llama/Llama-3.2-1B-Instruct \ --run_mode "pd_master" \ --host 10.0.0.103...