Oruli comments

Results 17 comments of


                                            Oruli

How to train Flux Lora on multiple GPUs?

Still nothing? Why is MultiGPU ignored in so many of these projects?

How to train Flux Lora on multiple GPUs?

> > > Still nothing? Why is MultiGPU ignored in so many of these projects? > > > > > > Most likely, the impossibility of simple writing and debugging...

How to train Flux Lora on multiple GPUs?

This has been open for a year, is there actually anyone on the project who can at least acknowledge it and add to a todo or something?

[BUG] - Multiple 5090s failing on deepspeed.initialize()

> Hi [@Oruli](https://github.com/Oruli) - we don't have any 5090s so we cannot test this but I do not see this on a machine with 2 A6000s. Could you perhaps share...

[BUG] - Multiple 5090s failing on deepspeed.initialize()

@loadams Still no reply even after my offer to debug and get this fixed? I'm not the only person with 2 x 5090s.

[BUG] - Multiple 5090s failing on deepspeed.initialize()

> [@Oruli](https://github.com/Oruli) - I re-read the thread. Are you still seeing this with the latest DeepSpeed version? Just so we can narrow our search to the current commits. > >...

[BUG] - Multiple 5090s failing on deepspeed.initialize()

@loadams Thanks for the help. So to be clear my setup is using Conda, I'm installing as follows to be able to run your script: ``` conda create -n deepspeed...

[BUG] - Multiple 5090s failing on deepspeed.initialize()

@loadams this is still an issue, I would really like to use the other 5090 i paid for, any chance we can get it resolved?

[BUG] - Multiple 5090s failing on deepspeed.initialize()

> [@Oruli](https://github.com/Oruli), I noticed in your OP that the failure occurs during a `send/recv` operation. Can you also try the p2p tests in the communication benchmark suite? https://github.com/deepspeedai/DeepSpeedExamples/tree/master/benchmarks/communication As I'm...

[BUG] - Multiple 5090s failing on deepspeed.initialize()

@loadams @tdrussell thank you for the help. Here is the output: ``` deepspeed --num_gpus=2 test.py --deepspeed_config ds_config.json [2025-07-17 09:15:53,939] [INFO] [real_accelerator.py:254:get_accelerator] Setting ds_accelerator to cuda (auto detect) [2025-07-17 09:15:56,497] [WARNING]...