diffusers multiple gpu parallel for train dreambooth without cuda out memory

I have 2 gpus and I would like to use both to train dreambooth without cuda out memory

They say that I should use nn.DataParallel , but I don't know where to put it

Nov 19 '22 06:11 loboere

@loboere are you referring to this pytorch documentation?

if torch.cuda.device_count() > 1:
  print("Let's use", torch.cuda.device_count(), "GPUs!")
  # dim = 0 [30, xxx] -> [10, ...], [10, ...], [10, ...] on 3 GPUs
  model = nn.DataParallel(model)

model.to(device)

Nov 19 '22 19:11 averad

I am also curious 🤔 @loboere please try https://github.com/huggingface/accelerate

pip install accelerate
accelerate config

Nov 19 '22 22:11 camenduru

I strongly advise against using nn.DataParallel, even PyTorch doesn't recommend it using anymore. Instead one should use https://pytorch.org/docs/stable/generated/torch.nn.parallel.DistributedDataParallel.html#torch.nn.parallel.DistributedDataParallel instead

Nov 21 '22 09:11 patrickvonplaten

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Dec 19 '22 15:12 github-actions[bot]