multi-model-server icon indicating copy to clipboard operation
multi-model-server copied to clipboard

How to achieve autoscaling when running MMS on a fargate?

Open sunilkumarmohanty opened this issue 4 years ago • 1 comments

Hi,

I would like to autoscale my model workers based on the request they receive. I am unable to locate any documentation on that. Could somebody please help me configure autoscaling.

I am running the MMS on fargate and I have autoscaling enabled at task level based on CPU. However, I am clueless on how to manage scaling of model workers inside a task.

Br, Sunil

sunilkumarmohanty avatar May 18 '21 13:05 sunilkumarmohanty

I'm facing the same issue!

sandruskyi avatar Jun 23 '22 10:06 sandruskyi