tez icon indicating copy to clipboard operation
tez copied to clipboard

TEZ-4245: Optimise split grouping when locality information is set to…

Open rbalamohan opened this issue 5 years ago • 1 comments

https://issues.apache.org/jira/browse/TEZ-4245

Split information without any locality information (localhost/null/empty) should be treated equally, so that split grouping can do meaningful grouping based on cluster size. This is to avoid creating small split groups, which can significantly increase runtime due to sequential processing (i.e same map task getting lots of inputs and system ends up spending time in open/seek/close on objectstores).

rbalamohan avatar Oct 27 '20 05:10 rbalamohan

:confetti_ball: +1 overall

Vote Subsystem Runtime Comment
+0 :ok: reexec 17m 11s Docker mode activated.
_ Prechecks _
+1 :green_heart: dupname 0m 0s No case conflicting files found.
+1 :green_heart: @author 0m 0s The patch does not contain any @author tags.
+1 :green_heart: test4tests 0m 0s The patch appears to include 1 new or modified test files.
_ master Compile Tests _
+1 :green_heart: mvninstall 13m 20s master passed
+1 :green_heart: compile 0m 29s master passed with JDK Ubuntu-11.0.11+9-Ubuntu-0ubuntu2.20.04
+1 :green_heart: compile 0m 27s master passed with JDK Private Build-1.8.0_292-8u292-b10-0ubuntu1~20.04-b10
+1 :green_heart: checkstyle 0m 57s master passed
+1 :green_heart: javadoc 0m 39s master passed with JDK Ubuntu-11.0.11+9-Ubuntu-0ubuntu2.20.04
+1 :green_heart: javadoc 0m 25s master passed with JDK Private Build-1.8.0_292-8u292-b10-0ubuntu1~20.04-b10
+0 :ok: spotbugs 1m 23s Used deprecated FindBugs config; considering switching to SpotBugs.
+1 :green_heart: findbugs 1m 19s master passed
_ Patch Compile Tests _
+1 :green_heart: mvninstall 0m 25s the patch passed
+1 :green_heart: compile 0m 25s the patch passed with JDK Ubuntu-11.0.11+9-Ubuntu-0ubuntu2.20.04
+1 :green_heart: javac 0m 25s the patch passed
+1 :green_heart: compile 0m 23s the patch passed with JDK Private Build-1.8.0_292-8u292-b10-0ubuntu1~20.04-b10
+1 :green_heart: javac 0m 23s the patch passed
+1 :green_heart: checkstyle 0m 15s the patch passed
+1 :green_heart: whitespace 0m 0s The patch has no whitespace issues.
+1 :green_heart: javadoc 0m 22s the patch passed with JDK Ubuntu-11.0.11+9-Ubuntu-0ubuntu2.20.04
+1 :green_heart: javadoc 0m 20s the patch passed with JDK Private Build-1.8.0_292-8u292-b10-0ubuntu1~20.04-b10
+1 :green_heart: findbugs 1m 0s the patch passed
_ Other Tests _
+1 :green_heart: unit 1m 32s tez-mapreduce in the patch passed.
+1 :green_heart: asflicense 0m 13s The patch does not generate ASF License warnings.
40m 37s
Subsystem Report/Notes
Docker ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/tez-multibranch/job/PR-78/1/artifact/out/Dockerfile
GITHUB PR https://github.com/apache/tez/pull/78
JIRA Issue TEZ-4245
Optional Tests dupname asflicense javac javadoc unit spotbugs findbugs checkstyle compile
uname Linux 56b2059ccbd2 4.15.0-147-generic #151-Ubuntu SMP Fri Jun 18 19:21:19 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality personality/tez.sh
git revision master / c875b8216
Default Java Private Build-1.8.0_292-8u292-b10-0ubuntu1~20.04-b10
Multi-JDK versions /usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.11+9-Ubuntu-0ubuntu2.20.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_292-8u292-b10-0ubuntu1~20.04-b10
Test Results https://ci-hadoop.apache.org/job/tez-multibranch/job/PR-78/1/testReport/
Max. process+thread count 210 (vs. ulimit of 5500)
modules C: tez-mapreduce U: tez-mapreduce
Console output https://ci-hadoop.apache.org/job/tez-multibranch/job/PR-78/1/console
versions git=2.25.1 maven=3.6.3 findbugs=3.0.1
Powered by Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

tez-yetus avatar Sep 08 '21 21:09 tez-yetus