examples icon indicating copy to clipboard operation
examples copied to clipboard

Example Tensor Parallelism Optimizer Bug

Open nrothGIT opened this issue 9 months ago • 0 comments

📚 Documentation

I believe the optimizer in this example should be declared after the parallelize module call, as in sequence parallelism. Without this, in latest torch, the example seems to not update the weights and thus not truly train. Please lmk if im missing anything and thanks so much for all your work!

Tiny fix PR below: https://github.com/pytorch/examples/pull/1324

nrothGIT avatar Apr 06 '25 22:04 nrothGIT