lora-scripts
lora-scripts copied to clipboard
多卡训练FLUX失败
我在提交任务后,终端报错如下:
ConnectionError: Tried to launch distributed communication on port 29500, but another process is utilizing it. Please specify a different port (such as using the --main_process_port flag or specifying a different main_process_port in your config file) and rerun your script. To automatically use the next open port (on a single node), you can set this to 0.
20:29:38-869927 ERROR Training failed / 训练失败
我的显卡型号是4090