dask-cloudprovider icon indicating copy to clipboard operation
dask-cloudprovider copied to clipboard

Add support for AWS Batch

Open AlJohri opened this issue 5 years ago • 7 comments

It seems like AWS Batch can handle a lot of the logistics of bringing up and down the cluster automatically given a Docker image. I'm curious if it would make sense to integrate this into cloudprovider in some way: https://docs.aws.amazon.com/batch/latest/userguide/multi-node-parallel-jobs.html

AlJohri avatar Mar 04 '20 21:03 AlJohri

I would be supportive of this. It would be good to add a BatchCluster as another option.

I don't currently have time to work on this, but would be happy to review a PR.

jacobtomlinson avatar Mar 16 '20 15:03 jacobtomlinson

I wonder if there are any lessons to be learned or integration opportunities with https://github.com/nrdg/cloudknot?

hammer avatar May 19 '20 14:05 hammer

Thanks for highlighting this @hammer.

I'm not sure the two are trying to achieve the same goals. Cloudknot seems to be trying to execute functions remotely on Batch from within some Python code.

Dask would be attempting to bootstrap a distributed cluster on Batch and then run the function on the distributed cluster.

jacobtomlinson avatar May 19 '20 15:05 jacobtomlinson

Hi! Haven't contributed to Dask before but I'd be willing to look into this issue. I'm guessing the idea is to write a class similar to the Fargate one in https://github.com/dask/dask-cloudprovider/blob/master/dask_cloudprovider/providers/aws/ecs.py#L1167 but targeting AWS Batch?

samuelsinayoko avatar Jun 02 '20 11:06 samuelsinayoko

@samuelsinayoko that's right!

jacobtomlinson avatar Jun 04 '20 12:06 jacobtomlinson

Any news?

imanebosch avatar Dec 13 '20 10:12 imanebosch

This would be great feature if supported

vkarunarthne avatar Oct 31 '22 18:10 vkarunarthne