superbenchmark icon indicating copy to clipboard operation
superbenchmark copied to clipboard

Benchmarks: Micro benchmark - collect per-snapshot per-GPU flops/temp in gpu burn

Open yukirora opened this issue 5 months ago • 1 comments

Description gpu burn: collect per-snapshot per-GPU flops/temp and add summary metrics

Major Revision

  • Parse all performance snapshot lines containing "Gflop/s" and record per-snapshot, per-GPU metrics: gpu_<snap_idx>gflops:<gpu_index> and gpu<snap_idx>_temp:<gpu_index>
  • Aggregate per-GPU statistics across snapshots: Per-GPU average flops: gpu_avg_gflops:<gpu_index> Per-GPU flops variability metric: gpu_var_gflops:<gpu_index> (simple max-min based metric) Per-GPU max temperature: gpu_max_temp:<gpu_index>

yukirora avatar Sep 08 '25 11:09 yukirora

Codecov Report

:x: Patch coverage is 95.23810% with 2 lines in your changes missing coverage. Please review. :white_check_mark: Project coverage is 85.88%. Comparing base (0b4311c) to head (80eecae). :warning: Report is 13 commits behind head on main.

Files with missing lines Patch % Lines
...bench/benchmarks/micro_benchmarks/gpu_burn_test.py 95.23% 2 Missing :warning:
Additional details and impacted files
@@            Coverage Diff             @@
##             main     #735      +/-   ##
==========================================
- Coverage   86.47%   85.88%   -0.59%     
==========================================
  Files         102      102              
  Lines        7541     8204     +663     
==========================================
+ Hits         6521     7046     +525     
- Misses       1020     1158     +138     
Flag Coverage Δ
cpu-python3.10-unit-test 71.09% <95.23%> (-0.51%) :arrow_down:
cpu-python3.12-unit-test 71.09% <95.23%> (-0.51%) :arrow_down:
cpu-python3.7-unit-test 70.54% <95.23%> (-0.12%) :arrow_down:
cuda-unit-test 83.67% <95.23%> (-0.32%) :arrow_down:

Flags with carried forward coverage won't be shown. Click here to find out more.

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

:rocket: New features to boost your workflow:
  • :snowflake: Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • :package: JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

codecov[bot] avatar Sep 09 '25 17:09 codecov[bot]