Dick Carter

Results 5 comments of Dick Carter

With cuda 11.0 of course comes support for A100 and TensorFloat-32 (TF32). FYI, I'm preparing a PR that refactors how our unittests handle tolerances, so that the proper tolerances are...

Before I dive into this more, could you check if the suggestions here for rebooting are helpful in this case: https://stackoverflow.com/questions/65721900/failed-to-initialize-nvml-driver-library-version-mismatch-is-ubuntu-server

Let me suggest a few things that may be involved in these results: - The BatchNorm implementations may not update the moving mean and variance at the same time. Some...

I'm seeing download failures broadly for this site, including other download requests of the mxnet CI like: ``` # wget http://data.mxnet.io/models/imagenet/resnet/18-layers/resnet-18-symbol.json --2024-02-08 00:55:10-- http://data.mxnet.io/models/imagenet/resnet/18-layers/resnet-18-symbol.json Resolving data.mxnet.io (data.mxnet.io)... 142.251.46.243, 2607:f8b0:4005:810::2013 Connecting...

The above-posted URL's from https://apache-mxnet.s3-accelerate.dualstack.amazonaws.com now work. The one I posted from https://data.mxnet.io.s3-website-us-west-1.amazonaws.com does not.