ozone icon indicating copy to clipboard operation
ozone copied to clipboard

HDDS-11367. Improve ozone balancing status command output

Open juncevich opened this issue 1 year ago • 9 comments

What changes were proposed in this pull request?

Minor improvements to command output:

  1. job task time consumption
  2. iteration time consumption
  3. unfinished iteration shouldn't be in iteration history section
  4. show data units, even it less than gb show it in mb

What is the link to the Apache JIRA

HDDS-11367

How was this patch tested?

Tested by robot test

juncevich avatar Aug 30 '24 13:08 juncevich

@juncevich can you please elaborate on what these points mean?

job task time consumption iteration time consumption

It will help reviewers understand what you're proposing.

Hi, @siddhantsangwan. Sure, i can explain. Balancing job constists from some iterations. Job task time - the whole balancing time, iteration time - particular iteration time. Job time = iteration time + iteration time ...

juncevich avatar Sep 11 '24 13:09 juncevich

Hi siddhantsangwan. Can you review my PR?

juncevich avatar Sep 23 '24 19:09 juncevich

@juncevich can you please check test timeouts / failures?

org.apache.hadoop.hdds.scm.container.balancer.TestContainerBalancerDatanodeNodeLimit
org.apache.hadoop.hdds.scm.container.balancer.TestContainerBalancerStatusInfo
org.apache.hadoop.hdds.scm.container.balancer.TestContainerBalancerTask

https://github.com/juncevich/ozone/actions/runs/11380961580/artifacts/2067990610

adoroszlai avatar Oct 17 '24 15:10 adoroszlai

@juncevich can you please check test timeouts / failures?

org.apache.hadoop.hdds.scm.container.balancer.TestContainerBalancerDatanodeNodeLimit
org.apache.hadoop.hdds.scm.container.balancer.TestContainerBalancerStatusInfo
org.apache.hadoop.hdds.scm.container.balancer.TestContainerBalancerTask

https://github.com/juncevich/ozone/actions/runs/11380961580/artifacts/2067990610

Thanks for the comment. Tests fixed.

juncevich avatar Oct 17 '24 19:10 juncevich

@siddhantsangwan can you please take another look?

adoroszlai avatar Oct 18 '24 05:10 adoroszlai

@juncevich Is this ready for another round of reviews?

siddhantsangwan avatar Oct 22 '24 08:10 siddhantsangwan

@juncevich Is this ready for another round of reviews?

Yes, I try to fix the review notices ASAP because they are blocking another fix. I didn’t bring it up because it’s a different task

juncevich avatar Oct 22 '24 09:10 juncevich

@ivanzlenko No need to use Atomic for OffsetDateTime, because it thread-safe by design. image

juncevich avatar Oct 23 '24 21:10 juncevich

@siddhantsangwan, can you take another look at this PR?

juncevich avatar Oct 26 '24 09:10 juncevich

@juncevich please let me know when you've addressed comments and this is ready for another review.

siddhantsangwan avatar Nov 05 '24 12:11 siddhantsangwan

please let me know when you've addressed comments and this is ready for another review.

@siddhantsangwan I've not understood about comments, can you explain about? PR is ready for another review.

juncevich avatar Nov 05 '24 13:11 juncevich

@siddhantsangwan, can you take another look at this PR?

juncevich avatar Nov 11 '24 07:11 juncevich

it seems issues @siddhantsangwan mentioned have been addressed. @siddhantsangwan could you take look again, please?

myskov avatar Nov 22 '24 11:11 myskov

@juncevich if you've addressed all comments and fixed tests, I can start the CI workflow.

siddhantsangwan avatar Nov 22 '24 12:11 siddhantsangwan

@juncevich if you've addressed all comments and fixed tests, I can start the CI workflow.

Great, thanks! Just fixed the flaky test. Everything should be OK.

juncevich avatar Nov 22 '24 12:11 juncevich

@siddhantsangwan I've fixed your review notices. Could you look through the changed code? I have to fix the robot test and PR will be ready.

juncevich avatar Nov 25 '24 21:11 juncevich

@siddhantsangwan robot test fixed. PR is ready for another review round.

juncevich avatar Nov 26 '24 21:11 juncevich

@juncevich Thanks for updating. However, please try not to force push changes as that makes it hard to review the updates.

siddhantsangwan avatar Dec 02 '24 06:12 siddhantsangwan

@juncevich Thanks for updating. However, please try not to force push changes as that makes it hard to review the updates.

@siddhantsangwan I respect your point, but I can't entirely agree with it. You can choose the last commit with all fixes and without 'dirty' changes(commits). Force push makes the commit history more readable. https://github.com/apache/ozone/pull/7139/commits/2e60bf3a340438607dd706ef0cad052853ae56be

juncevich avatar Dec 02 '24 09:12 juncevich

While trying to resolve merge conflicts almost broke everything. Sorry.

juncevich avatar Dec 04 '24 08:12 juncevich

@juncevich thanks for updating the patch. LGTM. @siddhantsangwan do you have any other comments?

myskov avatar Dec 11 '24 12:12 myskov

TestContainerBalancerStatusInfo.testGetCurrentStatisticsWhileBalancingInProgress seems failing. https://github.com/apache/ozone/actions/runs/12313619015/job/34368477497?pr=7551

ashishkumar50 avatar Dec 13 '24 11:12 ashishkumar50

https://github.com/apache/ozone/actions/runs/12313619015/job/34368477497?pr=7551

Thanks. It is still flaky. Will check.

juncevich avatar Dec 13 '24 11:12 juncevich

Created HDDS-11927 for the flaky test.

adoroszlai avatar Dec 13 '24 13:12 adoroszlai

Added PR with fix HDDS-11927: https://github.com/apache/ozone/pull/7579

juncevich avatar Dec 14 '24 10:12 juncevich