Max H. Gerlach
Max H. Gerlach
# tensorflow-1.13.1-ubuntu16.04-py27-no_avx-westmere TensorFlow 1.13.1 wheel built for Python 2.7 without GPU on Ubuntu 16.04 running on a Westmere CPU that does not support AVX instructions (one generation older than Sandy...
Hi, I'm not very savvy with Node development, but ran into an issue trying to build the standalone electron version, following the instructions at https://github.com/Joxit/docker-registry-ui/tree/main/examples/electron/README.md . ## Bug description `npm...
Seen in https://github.com/horovod/horovod/runs/7771432718?check_suite_focus=true Appears to be caused by this TF commit: https://github.com/tensorflow/tensorflow/commit/399e4071c471f6dc47bf245f8aeb8ab0c0374fce ``` 2022-08-10T16:38:11.6788601Z #44 157.3 [ 66%] Building CXX object horovod/tensorflow/CMakeFiles/tensorflow.dir/mpi_ops.cc.o 2022-08-10T16:38:11.6792823Z #44 157.3 cd /tmp/pip-req-build-ggufvm1f/build/temp.linux-x86_64-3.8/RelWithDebInfo/horovod/tensorflow && /usr/bin/c++ -DEIGEN_MPL2_ONLY=1...
This PR should not have any functional consequences. 1. There was some unused code in `horovod/torch` remaining from an earlier implementation change (from before `mpi_ops_v2.cc`). Its presence caused some confusion...
## Checklist before submitting - [x] Did you read the [contributor guide](https://github.com/horovod/horovod/blob/master/CONTRIBUTING.md)? - [ ] Did you update the docs? - [x] Did you write any tests to validate this...
Just observed on master: https://github.com/horovod/horovod/runs/7232373749?check_suite_focus=true ``` ... [1,0]:test_torch.py::TorchTests::test_horovod_allreduce_average [1,1]: [1,1]:test_torch.py::TorchTests::test_horovod_allreduce_average [1,1]:PASSED[1,0]:PASSED[1,1]: [1,1]:test_torch.py::TorchTests::test_horovod_allreduce_cpu_gpu_error [1,0]: [1,0]:test_torch.py::TorchTests::test_horovod_allreduce_cpu_gpu_error [1,1]:SKIPPED[1,0]:SKIPPED[1,1]: [1,1]:test_torch.py::TorchTests::test_horovod_allreduce_duplicate_name_error [1,0]: [1,0]:test_torch.py::TorchTests::test_horovod_allreduce_duplicate_name_error [1,1]:FAILED[1,1]: [1,0]:[2022-07-07 12:11:45.233659: W[1,0]: [1,0]:/Users/runner/work/horovod/horovod/horovod/common/stall_inspector.cc:107] [1,0]:One or more tensors were submitted to...
## Checklist before submitting - [x] Did you read the [contributor guide](https://github.com/horovod/horovod/blob/master/CONTRIBUTING.md)? - [ ] Did you update the docs? - [ ] Did you write any tests to validate...
In our CMake scripts we use `execute_process` in a couple of places for example to infer build flags with a short Python script. When such a script fails, for example...
Equivalently to #3660 for Allreduce
They should have the same effect as the arguments to Allreduce and the implementation should be equivalently efficient.