Oruli

Results 17 comments of Oruli

> [@Oruli](https://github.com/Oruli) - do you have a deepspeed config in that folder? I think the default of not having one is what is causing this error. @tdrussell I'm running your...

@loadams @tdrussell Error below, different to the above one I posted as I've now included your ds_config.json ``` [2025-07-28 11:04:04,813] [INFO] [config.py:944:print_user_config] json = { "train_batch_size": 8, "train_micro_batch_size_per_gpu": 1, "gradient_accumulation_steps":...

With @tdrussell 's updated script from here: https://github.com/tdrussell/diffusion-pipe/issues/235 I get this output ``` [2025-07-28 11:08:30,568] [INFO] [config.py:944:print_user_config] json = { "train_batch_size": 8, "train_micro_batch_size_per_gpu": 1, "gradient_accumulation_steps": 4, "fp16": { "enabled": true...

@loadams I've provided all the info you asked for, what else do you need? The discussing spilled over to the original thread but the developer cannot test any further: https://github.com/tdrussell/diffusion-pipe/issues/235

> [@Oruli](https://github.com/Oruli) - this recent one looks to be working (agree with [this comment](https://github.com/tdrussell/diffusion-pipe/issues/235#issuecomment-3138290598)). So DeepSpeed is able to load the model onto the GPU just fine. > > I'm...

@ChuckNovice It's been fixed for me for a while now. I thought it was an update here (assuming you've updated everything), but maybe another package. Make sure you are running...

> Not using single switches to control multiple pathways as I had originally asked about. But I am using the work around that I summarized in the "EDIT" section. >...