Hao Wu
Hao Wu
It looks the default ml20-all sample doesn't call sparse kernels. All I see is cublasSgemm. Log says "NNDataSet::CalculateSparseDatapointCounts: Maximum sparse datapoints (9254) per example in dataset gl_input too large for...
When I run ` bazel run --cuda //examples:hello_cuda`, it fails with the following error gcc-8: error: unrecognized command line option '-Wthread-safety'; did you mean '-fthread-jumps'? gcc-8: error: unrecognized command line...
# What does this PR do ? Improve partition_dim checker, i.e. explicitly check if it "is None". Get rid of some out of date fallback. pg_collection.use_mpu_process_group now has all the...