arjkesh

Results 13 comments of arjkesh

@shantanutrip - can you merge in master - there are some changes that need to be made, importantly that "ec2" should be used in place of "e3"

@philschmid there are 10 tests currently failing. Can you help debug the tests that are not "test_repo_anaconda_present"? Three of the failing tests are "test_repo_anaconda_not_present" - this one @kevinyang8 can help...

@saimidu please also include a description of the dockerfile changes made - right now this PR only references the test

outside of the comment about mamba version, LGTM - the merge strategy will be as follows - revert buildspec changes - revert toml changes - merge - rebase the PT...

Hi @dkey-amazon - we are changing the name from "e3" to "ec2" going forward - please take a look at TF 2.9 images, which have been updated to reflect this,...

Hi @ProxJ - I know this issue was opened long ago - are you still having issues with this? We have mpi4py installed in latest PT 2.0 training containers https://github.com/aws/deep-learning-containers/blob/master/pytorch/training/docker/2.0/py3/Dockerfile.cpu#L122