Brian Ko
Brian Ko
### Willingness to contribute The MLflow Community encourages bug fix contributions. Would you or another member of your organization be willing to contribute a fix for this bug to the...
I notice that so far there are JAX, TensorFlow, and NumPy backends. Are there any plans to have a PyTorch backend in the future?
### Is this a feature request or a bug? I updated my manifest version to version 3 and I can't use `web-ext` any longer. ### What is the current behavior?...
Is there a way to specify a template for each daily plan?
Implemented a basic script that demonstrates distributed processing with Hydra, as mentioned in #42. The command to run the script is: ```shell python ddp_00.py -m rank=... init_method=... ``` where `rank`...
Implements [ImageNet DDP](https://github.com/pytorch/examples/blob/master/imagenet/main.py), as mentioned in #33. Most of the code is the same, and the major differences are the handling of distributed processes and the configuration. One can use...
Distributed processing with Hydra in single-node multi-GPU setting, as mentioned [here](https://github.com/pytorch/hydra-torch/pull/38#issuecomment-738991461). - [ ] Explain PyTorch's distributed processing/training. - [ ] Simple demonstration of various distributed communication primitives. - [...
An example of DDP for ImageNet using multirun, as discussed with @omry in the Hydra repo.
Hi @abohmeed, I noticed that you have the file `kubernetes/node_startup.tmpl` that you aren't including in your Terraform setup right now. I read through the file and it seems that it's...