runtime icon indicating copy to clipboard operation
runtime copied to clipboard

Infra improvements for Helix

Open agocke opened this issue 3 years ago • 6 comments

List of requests from Helix to reduce failure overhead from PayloadGroup0, which seems to be returning a non-zero exit code very often.

  • Investigate potentially reducing/combining files. There is significant overhead “per file”, and this run produces a lot of files that are fairly low value, from what I can see in them. Maybe it would be possible to only output a single, combined file? Or maybe only generate the files when something “interesting happens”?
  • Potentially name the PayloadGroup0 different things in different scenarios, so that it’s easier to identify where problems are coming from.
  • Investigate why this item is failing, on average, 1600 a day. It seems like if it’s failing this often, the value might be low… it seems unlikely anyone has the time to investigate 1600 failures every day. Maybe specific tests inside it can be disabled if we have some way to dig in a bit more about what’s going on inside them. (We might be able to help investigation here if you need)

Runfo Tracking Issue: payloadgroup0 work item

Build Definition Kind Run Name Console Core Dump Test Results Run Client
1975461 runtime PR 74744 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20220824230426-06f234f runclient.py
1975446 runtime PR 74736 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20220824230426-06f234f runclient.py
1974819 runtime PR 74623 mono windows x64 Release @ Windows.10.Amd64.Open console.log core dump runclient.py
1974819 runtime PR 74623 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1972465 runtime Rolling coreclr windows arm64 Checked @ Windows.10.Arm64v8.Open console.log core dump runclient.py
1972160 runtime PR 74719 coreclr OSX arm64 Checked no_tiered_compilation @ OSX.1200.ARM64.Open console.log runclient.py
1972160 runtime PR 74719 coreclr OSX x64 Checked no_tiered_compilation @ OSX.1200.Amd64.Open console.log runclient.py
1972160 runtime PR 74719 coreclr Linux x64 Checked no_tiered_compilation @ Ubuntu.1804.Amd64.Open console.log core dump runclient.py
1972160 runtime PR 74719 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20220824230426-06f234f console.log core dump runclient.py
1972160 runtime PR 74719 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log core dump runclient.py
1972160 runtime PR 74719 coreclr windows arm64 Checked no_tiered_compilation @ Windows.10.Arm64v8.Open console.log
1972160 runtime PR 74719 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1970957 runtime PR 74623 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1970957 runtime PR 74623 mono windows x64 Release @ Windows.10.Amd64.Open console.log core dump runclient.py
1969744 runtime PR 74623 coreclr Linux x64 Checked no_tiered_compilation @ Ubuntu.1804.Amd64.Open console.log runclient.py
1969744 runtime PR 74623 mono windows x64 Release @ Windows.10.Amd64.Open console.log core dump runclient.py
1969744 runtime PR 74623 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1965145 runtime PR 74532 coreclr windows x86 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1965145 runtime PR 74532 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1965145 runtime PR 74532 coreclr windows arm64 Checked no_tiered_compilation @ Windows.10.Arm64v8.Open console.log core dump runclient.py
1964132 runtime PR 74428 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1964010 runtime PR 74501 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1964010 runtime PR 74501 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1964002 runtime PR 74499 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1964002 runtime PR 74499 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1963992 runtime PR 74379 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1963992 runtime PR 74379 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1963985 runtime PR 74500 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1963985 runtime PR 74500 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1963947 runtime PR 73060 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1963947 runtime PR 73060 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1963889 runtime PR 74498 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1963889 runtime PR 74498 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1963822 runtime PR 74238 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1963792 runtime PR 74495 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1963792 runtime PR 74495 mono Linux arm64 Release @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1963188 runtime PR 74379 Mono Browser wasm Release @ Ubuntu.1804.Amd64.Open console.log runclient.py
1963188 runtime PR 74379 mono windows x64 Release @ Windows.10.Amd64.Open console.log runclient.py
1963188 runtime PR 74379 mono OSX x64 Release @ OSX.1200.Amd64.Open console.log runclient.py
1963188 runtime PR 74379 mono OSX x64 Release @ OSX.1200.Amd64.Open console.log runclient.py
1963188 runtime PR 74379 mono Linux x64 Release @ Ubuntu.1804.Amd64.Open console.log runclient.py
1963188 runtime PR 74379 mono Linux x64 Release @ Ubuntu.1804.Amd64.Open console.log runclient.py
1963188 runtime PR 74379 mono Linux arm64 Release @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log runclient.py
1962479 runtime PR 74379 Mono Browser wasm Release @ Ubuntu.1804.Amd64.Open console.log runclient.py
1962479 runtime PR 74379 mono windows x64 Release @ Windows.10.Amd64.Open console.log core dump runclient.py
1962479 runtime PR 74379 mono OSX x64 Release @ OSX.1200.Amd64.Open console.log runclient.py
1962479 runtime PR 74379 mono OSX x64 Release @ OSX.1200.Amd64.Open console.log runclient.py
1959643 runtime PR 74363 mono windows x64 Release @ Windows.10.Amd64.Open console.log core dump runclient.py
1959643 runtime PR 74363 coreclr windows x86 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1959643 runtime PR 74363 coreclr windows arm64 Checked no_tiered_compilation @ Windows.10.Arm64v8.Open console.log runclient.py
1959643 runtime PR 74363 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1959643 runtime PR 74363 mono Linux x64 Release @ Ubuntu.1804.Amd64.Open console.log core dump runclient.py
1959643 runtime PR 74363 mono OSX x64 Release @ OSX.1200.Amd64.Open console.log runclient.py
1959643 runtime PR 74363 mono OSX x64 Release @ OSX.1200.Amd64.Open console.log runclient.py
1959643 runtime PR 74363 mono Linux x64 Release @ Ubuntu.1804.Amd64.Open console.log core dump runclient.py
1959643 runtime PR 74363 mono Linux arm64 Release @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log core dump runclient.py
1959487 runtime Rolling coreclr windows x86 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1958648 runtime PR 74343 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1958648 runtime PR 74343 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1958619 runtime PR 72717 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1958538 runtime PR 74341 coreclr Linux arm Checked no_tiered_compilation @ (Ubuntu.1804.Arm32.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm32v7-bfcd90a-20200121150440 console.log
1958538 runtime PR 74341 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1958433 runtime PR 73968 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1958356 runtime PR 73289 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
1958356 runtime PR 73289 mono Linux arm64 Release @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log
22223 runtime PR 68610 coreclr OSX x64 Checked no_tiered_compilation @ OSX.1200.Amd64.Open console.log runclient.py
21112 runtime PR 75675 Mono Browser wasm Release @ Ubuntu.1804.Amd64.Open console.log core dump runclient.py
20317 runtime PR 74623 coreclr windows x86 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
20317 runtime PR 74623 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
19745 runtime PR 75748 mono OSX x64 Release @ OSX.1200.Amd64.Open console.log runclient.py
19745 runtime PR 75748 mono windows x64 Release @ Windows.10.Amd64.Open console.log core dump runclient.py
19745 runtime PR 75748 mono Linux x64 Release @ Ubuntu.1804.Amd64.Open console.log core dump runclient.py

Build Result Summary

Day Hit Count Week Hit Count Month Hit Count
0 4 30

agocke avatar Apr 18 '22 20:04 agocke

Tagging subscribers to this area: @dotnet/runtime-infrastructure See info in area-owners.md if you want to be subscribed.

Issue Details

List of requests from Helix to reduce failure overhead from PayloadGroup0, which seems to be returning a non-zero exit code very often.

  • Investigate potentially reducing/combining files. There is significant overhead “per file”, and this run produces a lot of files that are fairly low value, from what I can see in them. Maybe it would be possible to only output a single, combined file? Or maybe only generate the files when something “interesting happens”?
  • Potentially name the PayloadGroup0 different things in different scenarios, so that it’s easier to identify where problems are coming from.
  • Investigate why this item is failing, on average, 1600 a day. It seems like if it’s failing this often, the value might be low… it seems unlikely anyone has the time to investigate 1600 failures every day. Maybe specific tests inside it can be disabled if we have some way to dig in a bit more about what’s going on inside them. (We might be able to help investigation here if you need)
Author: agocke
Assignees: -
Labels:

area-Infrastructure

Milestone: -

ghost avatar Apr 18 '22 20:04 ghost

cc @MattGal @ChadNedzlek

agocke avatar Apr 18 '22 20:04 agocke

Updated the 1600 -> 100 (the 1600 was from a bad query I made). 100 is still a lot per day though.

ChadNedzlek avatar Apr 18 '22 20:04 ChadNedzlek

I'm just following along for now; please ping me if you want me to participate in any investigations or check whether improvements work (though if the work items start passing, that's a very good sign)

MattGal avatar Apr 18 '22 23:04 MattGal

Tagging subscribers to this area: @hoyosjs See info in area-owners.md if you want to be subscribed.

Issue Details

List of requests from Helix to reduce failure overhead from PayloadGroup0, which seems to be returning a non-zero exit code very often.

  • Investigate potentially reducing/combining files. There is significant overhead “per file”, and this run produces a lot of files that are fairly low value, from what I can see in them. Maybe it would be possible to only output a single, combined file? Or maybe only generate the files when something “interesting happens”?
  • Potentially name the PayloadGroup0 different things in different scenarios, so that it’s easier to identify where problems are coming from.
  • Investigate why this item is failing, on average, 1600 a day. It seems like if it’s failing this often, the value might be low… it seems unlikely anyone has the time to investigate 1600 failures every day. Maybe specific tests inside it can be disabled if we have some way to dig in a bit more about what’s going on inside them. (We might be able to help investigation here if you need)

Runfo Tracking Issue: payloadgroup0 work item

Build Definition Kind Run Name Console Core Dump Test Results Run Client
1902122 runtime Rolling coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1897450 runtime PR 72021 Mono Browser wasm Release @ Ubuntu.1804.Amd64.Open console.log runclient.py
1895796 runtime PR 72229 coreclr Linux x64 Checked no_tiered_compilation @ Ubuntu.1804.Amd64.Open console.log core dump runclient.py
1895796 runtime PR 72229 coreclr Linux arm64 Checked no_tiered_compilation @ (Ubuntu.1804.Arm64.Open)[email protected]/dotnet-buildtools/prereqs:ubuntu-18.04-helix-arm64v8-20210531091519-97d8652 console.log core dump runclient.py
1895796 runtime PR 72229 coreclr windows arm64 Checked no_tiered_compilation @ Windows.10.Arm64v8.Open console.log
1895796 runtime PR 72229 coreclr windows x64 Checked no_tiered_compilation @ Windows.10.Amd64.Open console.log core dump runclient.py
1892945 runtime PR 72529 mono windows x64 Release @ Windows.10.Amd64.Open console.log runclient.py
1892535 runtime PR 72517 Mono Browser wasm Release @ Ubuntu.1804.Amd64.Open console.log runclient.py
1892303 runtime Rolling coreclr windows x64 Checked @ Windows.10.Amd64.Open console.log core dump runclient.py
1889847 runtime PR 72021 Mono Browser wasm Release @ Ubuntu.1804.Amd64.Open console.log runclient.py
1886956 runtime Rolling coreclr windows x64 Checked @ Windows.10.Amd64.Open console.log core dump runclient.py
1883558 runtime PR 62863 Mono Browser wasm Release @ Ubuntu.1804.Amd64.Open console.log runclient.py

Build Result Summary

Day Hit Count Week Hit Count Month Hit Count
1 7 9
Author: agocke
Assignees: -
Labels:

area-Infrastructure-coreclr, blocking-clean-ci

Milestone: 7.0.0

ghost avatar Jul 26 '22 04:07 ghost

@agocke Is it a good idea to have a tracking issue of such a wide issue with runfo? (aka, if the common coreclr tests fails fails, immediately add it here)

hoyosjs avatar Aug 05 '22 18:08 hoyosjs

I think this is no longer interesting -- we're tracking things at a more granular level now and this is just catching everything in coreclr Pri0. Closing.

agocke avatar Jul 10 '23 16:07 agocke