Neuron support in Axlearn
This PR enables use of neuron devices in Axlearn for model training.
- Chooses correct mesh for TRN devices for Fuji 7B with the mesh selector flag
--mesh_selector=neuron-trn1.32xlarge-64
@apoorvtintin I see this PR is quite stale for sometime. If no objection, I'd like to have @Ruixuan who is working on Trn from our end to port your change and continue iterate it?
@apoorvtintin I see this PR is quite stale for sometime. If no objection, I'd like to have @Ruixuan who is working on Trn from our end to port your change and continue iterate it?
Apoorv is on PTO right now. I am OK with you all taking over this PR. Can you add us as a reviewer when you finish? Thanks
Thanks for all the reviews, I fixed most of the comments on the PR.
Is this PR still needed?
Not needed, closing this