amazon-cloudwatch-agent icon indicating copy to clipboard operation
amazon-cloudwatch-agent copied to clipboard

NVIDIA GPU metrics in CloudWatch Explorer for AutoScaling Group

Open thawro opened this issue 9 months ago • 2 comments

Is it possible to view the GPU metrics in CloudWatch Dashboard explorer? I would like to plot GPU metrics across my AutoScaling Group. I set up the GPU monitoring on my EC2 instances according to: tutorial, but all I can plot is a single instance gpu metrics (as normal line plot) and not in an explorer for the whole ASG.

thawro avatar Apr 14 '25 21:04 thawro

Hi @thawro, thanks for reaching out! The agent is responsible for delivering the metrics to CloudWatch console, for dashboard question I suggest reaching out to AWS support for more clarifications. If you are missing GPU metrics, feel free to let us know and we can provide the debug process

zhihonl avatar Apr 17 '25 18:04 zhihonl

It sounds like you want to have the metrics sent with the AutoScaling Group as a dimension so you can see them in aggregate. I'm not sure what the configuration in the tutorial you followed is, but you'll need to add the AutoScalingGroupName dimension to your append_dimensions and aggregation_dimensions. See https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/CloudWatch-Agent-Configuration-File-Details.html for more details.

{
  "metrics": {
    "aggregation_dimensions" : [["AutoScalingGroupName"]],
    "append_dimensions": {
      "AutoScalingGroupName": "${aws:AutoScalingGroupName}"
    },
    ...
  }
}

jefchien avatar Apr 21 '25 19:04 jefchien

Please re-open if you require further assistance.

sky333999 avatar Jul 19 '25 03:07 sky333999