daos icon indicating copy to clipboard operation
daos copied to clipboard

DAOS-14631 test: Adding WAL/Meta SSD fault/reintegration tests

Open phender opened this issue 2 years ago • 20 comments

Also fixing test_dmg_storage_query_device_state to handle setting SysXS devices faulty.

Skip-unit-tests: true Skip-fault-injection-test: true Skip-func-hw-test-medium-md-on-ssd: false Allow-unstable-test: true Test-tag: test_dmg_storage_query_device_state

Required-githooks: true

Before requesting gatekeeper:

  • [ ] Two review approvals and any prior change requests have been resolved.
  • [ ] Testing is complete and all tests passed or there is a reason documented in the PR why it should be force landed and forced-landing tag is set.
  • [ ] Features: (or Test-tag*) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.
  • [ ] Commit messages follows the guidelines outlined here.
  • [ ] Any tests skipped by the ticket being addressed have been run and passed in the PR.

Gatekeeper:

  • [ ] You are the appropriate gatekeeper to be landing the patch.
  • [ ] The PR has 2 reviews by people familiar with the code, including appropriate owners.
  • [ ] Githooks were used. If not, request that user install them and check copyright dates.
  • [ ] Checkpatch issues are resolved. Pay particular attention to ones that will show up on future PRs.
  • [ ] All builds have passed. Check non-required builds for any new compiler warnings.
  • [ ] Sufficient testing is done. Check feature pragmas and test tags and that tests skipped for the ticket are run and now pass with the changes.
  • [ ] If applicable, the PR has addressed any potential version compatibility issues.
  • [ ] Check the target branch. If it is master branch, should the PR go to a feature branch? If it is a release branch, does it have merge approval in the JIRA ticket.
  • [ ] Extra checks if forced landing is requested
    • [ ] Review comments are sufficiently resolved, particularly by prior reviewers that requested changes.
    • [ ] No new NLT or valgrind warnings. Check the classic view.
    • [ ] Quick-build or Quick-functional is not used.
  • [ ] Fix the commit message upon landing. Check the standard here. Edit it to create a single commit. If necessary, ask submitter for a new summary.

phender avatar Feb 06 '24 23:02 phender

Bug-tracker data: Ticket title is 'Test WAL/Meta SSD faulty & reintegration' Status is 'In Progress' Labels: 'md_on_ssd,test_2.6' https://daosio.atlassian.net/browse/DAOS-14631

github-actions[bot] avatar Feb 06 '24 23:02 github-actions[bot]

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/1/execution/node/919/log

daosbuild1 avatar Feb 07 '24 05:02 daosbuild1

Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/3/execution/node/1060/log

daosbuild1 avatar Feb 12 '24 23:02 daosbuild1

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/3/execution/node/1041/log

daosbuild1 avatar Feb 12 '24 23:02 daosbuild1

Bug-tracker data: Ticket title is 'Test WAL/Meta SSD faulty & reintegration' Status is 'In Progress' Labels: 'md_on_ssd,test_2.6' https://daosio.atlassian.net/browse/DAOS-14631

github-actions[bot] avatar Feb 21 '24 04:02 github-actions[bot]

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/4/execution/node/897/log

daosbuild1 avatar Feb 21 '24 12:02 daosbuild1

Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/4/execution/node/995/log

daosbuild1 avatar Feb 21 '24 13:02 daosbuild1

Bug-tracker data: Ticket title is 'Test WAL/Meta SSD faulty & reintegration' Status is 'In Progress' Labels: 'md_on_ssd,test_2.6' https://daosio.atlassian.net/browse/DAOS-14631

github-actions[bot] avatar Feb 21 '24 18:02 github-actions[bot]

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/5/execution/node/897/log

daosbuild1 avatar Feb 22 '24 06:02 daosbuild1

Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/5/execution/node/994/log

daosbuild1 avatar Feb 22 '24 07:02 daosbuild1

Bug-tracker data: Ticket title is 'Test WAL/Meta SSD faulty & reintegration' Status is 'In Progress' Labels: 'md_on_ssd,test_2.6' https://daosio.atlassian.net/browse/DAOS-14631

github-actions[bot] avatar Feb 22 '24 20:02 github-actions[bot]

Test stage Functional Hardware Medium MD on SSD completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-13763/6/testReport/

daosbuild1 avatar Feb 23 '24 11:02 daosbuild1

Ticket title is 'Test WAL/Meta SSD faulty & reintegration' Status is 'In Progress' Labels: '2.6.0,md_on_ssd,test_2.6' https://daosio.atlassian.net/browse/DAOS-14631

github-actions[bot] avatar Feb 27 '24 22:02 github-actions[bot]

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/7/execution/node/922/log

daosbuild1 avatar Feb 28 '24 12:02 daosbuild1

Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/7/execution/node/943/log

daosbuild1 avatar Feb 28 '24 13:02 daosbuild1

Test stage Functional Hardware Medium MD on SSD completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-13763/9/testReport/

daosbuild1 avatar Mar 03 '24 16:03 daosbuild1

Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/9/execution/node/941/log

daosbuild1 avatar Mar 03 '24 17:03 daosbuild1

Test stage Functional Hardware Large completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/10/execution/node/1073/log

daosbuild1 avatar Mar 06 '24 01:03 daosbuild1

Test stage Functional Hardware Large completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/11/execution/node/1075/log

daosbuild1 avatar Mar 08 '24 16:03 daosbuild1

Test stage Functional Hardware Large completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-13763/12/execution/node/901/log

daosbuild1 avatar Mar 11 '24 19:03 daosbuild1

Test stage Functional Hardware Large MD on SSD completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-13763/15/testReport/

daosbuild1 avatar Mar 27 '24 01:03 daosbuild1

All tests passed in https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-13763/16/testReport/

phender avatar Mar 28 '24 14:03 phender

Force landing requested as testing limited to modified functional tests which all passed:

  • src/tests/ftest/control/dmg_storage_query.py
    • https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-13763/16/testReport/FTEST_control/DmgStorageQuery/
  • src/tests/ftest/nvme/fault.py
    • https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-13763/16/testReport/FTEST_nvme/NvmeFault/
  • src/tests/ftest/nvme/health.py
    • https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-13763/16/testReport/FTEST_nvme/NvmeHealth/

phender avatar Mar 28 '24 14:03 phender