MAPREDUCE-7470: multi-thread mapreduce committer #6469

lastbus · 2024-01-19T14:47:56Z

Description of PR

In cloud environment, such as aws, aliyun etc., the internet delay is non-trival when we commit thounds of files.

In our situation, the ping delay is about 0.03ms in IDC, but when move to Coud, the ping delay is about 3ms, which is roughly 100x slower. We found that, committing tens thounds of files will cost a few tens of minutes. The more files there are, the logger it takes.

So we propose a new committer algorithm, which is a variant of committer algorithm version 1, called 3. In this new algorithm 3, in order to decrease the committer time, we use a thread pool to commit job's final output.

Our test result in Cloud production shows that, the new algorithm 3 has decrease the committer time by serveral tens of times.

How was this patch tested?

For code changes:

hadoop-yetus · 2024-01-19T16:13:31Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 19s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
-1 ❌	test4tests	0m 0s		The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
			_ trunk Compile Tests _
+1 💚	mvninstall	31m 48s		trunk passed
+1 💚	compile	0m 22s		trunk passed with JDK Ubuntu-11.0.21+9-post-Ubuntu-0ubuntu120.04
+1 💚	compile	0m 19s		trunk passed with JDK Private Build-1.8.0_392-8u392-ga-1~20.04-b08
+1 💚	checkstyle	0m 23s		trunk passed
+1 💚	mvnsite	0m 27s		trunk passed
+1 💚	javadoc	0m 22s		trunk passed with JDK Ubuntu-11.0.21+9-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	0m 18s		trunk passed with JDK Private Build-1.8.0_392-8u392-ga-1~20.04-b08
+1 💚	spotbugs	0m 54s		trunk passed
+1 💚	shadedclient	19m 34s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+1 💚	mvninstall	0m 19s		the patch passed
+1 💚	compile	0m 19s		the patch passed with JDK Ubuntu-11.0.21+9-post-Ubuntu-0ubuntu120.04
+1 💚	javac	0m 19s		the patch passed
+1 💚	compile	0m 17s		the patch passed with JDK Private Build-1.8.0_392-8u392-ga-1~20.04-b08
+1 💚	javac	0m 17s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
-0 ⚠️	checkstyle	0m 16s	/results-checkstyle-hadoop-mapreduce-project_hadoop-mapreduce-client_hadoop-mapreduce-client-core.txt	hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core: The patch generated 1 new + 16 unchanged - 0 fixed = 17 total (was 16)
+1 💚	mvnsite	0m 22s		the patch passed
+1 💚	javadoc	0m 13s		the patch passed with JDK Ubuntu-11.0.21+9-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	0m 14s		the patch passed with JDK Private Build-1.8.0_392-8u392-ga-1~20.04-b08
+1 💚	spotbugs	0m 50s		the patch passed
+1 💚	shadedclient	19m 24s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
-1 ❌	unit	5m 25s	/patch-unit-hadoop-mapreduce-project_hadoop-mapreduce-client_hadoop-mapreduce-client-core.txt	hadoop-mapreduce-client-core in the patch passed.
+1 💚	asflicense	0m 23s		The patch does not generate ASF License warnings.
		84m 35s

Reason	Tests
Failed junit tests	hadoop.mapreduce.lib.output.TestFileOutputCommitter

Subsystem	Report/Notes
Docker	ClientAPI=1.44 ServerAPI=1.44 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6469/1/artifact/out/Dockerfile
GITHUB PR	#6469
Optional Tests	dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell detsecrets
uname	Linux 2ac0aa160159 5.15.0-88-generic #98-Ubuntu SMP Mon Oct 2 15:18:56 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `ca1b7f9`
Default Java	Private Build-1.8.0_392-8u392-ga-1~20.04-b08
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.21+9-post-Ubuntu-0ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_392-8u392-ga-1~20.04-b08
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6469/1/testReport/
Max. process+thread count	1474 (vs. ulimit of 5500)
modules	C: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core U: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6469/1/console
versions	git=2.25.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

slfan1989 · 2024-01-21T00:17:25Z

@lastbus Thanks for the contribution! we need to fix the checkstyle issue.

saxepran · 2024-01-27T07:52:29Z

...ce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/output/FileOutputCommitter.java

@@ -404,6 +423,37 @@ protected void commitJobInternal(JobContext context) throws IOException {
        for (FileStatus stat: getAllCommittedTaskPaths(context)) {
          mergePaths(fs, stat, finalOutput, context);
        }
+      } else if (algorithmVersion == 3) {
+        ExecutorService pool = Executors.newFixedThreadPool(commitJobThreadIONum);


Should we have a common executorService per object. That would save overhead of creating new threads each time this method is called.

saxepran · 2024-01-27T07:54:22Z

...ce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/output/FileOutputCommitter.java

@@ -100,12 +105,22 @@ public class FileOutputCommitter extends PathOutputCommitter {
  public static final boolean
      FILEOUTPUTCOMMITTER_TASK_CLEANUP_ENABLED_DEFAULT = false;

+  public static final String FILEOUTPUTCOMMITTER_COMMIT_JOB_THREAD_IO_NUM = "mapreduce.fileoutputcommitter.jobcommit.thread.io.num";
+
+  public static final int FILEOUTPUTCOMMITTER_COMMIT_JOB_THREAD_IO_NUM_DEFAULT = 50;


Should we have this as a factor of the availableProcessors to the JVM. This would keep excessive context switching in check.

saxepran · 2024-01-27T08:32:40Z

...ce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/output/FileOutputCommitter.java

+      if (!fs.exists(to)) {
+        commitJobLock.lock();
+        try {
+          if (!fs.exists(to)) { // double check


Was thinking if we should not have this double check. Reason being, between line 567 and 568, there could be some other parallel operation which can create to. So, we cannot really ensure if its going to be to will not be there while renaming. Now at the instant of renaming, two things can happen:

to path is not there

to path is there and is a directory

to path is there and is a file

Now, in rename, it will give false if to path is there and is a file (ref: https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-common/filesystem/filesystem.html#boolean_rename.28Path_src.2C_Path_d.29:~:text=HDFS%20%3A%20The%20rename%20fails%2C%20no%20exception%20is%20raised.%20Instead%20the%20method%20call%20simply%20returns%20false), or it will give true. Now two things can happen:

to was not existing before rename -> after rename to path will be made.

to was an existing dir before rename -> after rename to/from will get created (ref: https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-common/filesystem/filesystem.html#boolean_rename.28Path_src.2C_Path_d.29:~:text=If%20the%20destination%20exists%20and%20is%20a%20directory%2C%20the%20final%20destination%20of%20the%20rename%20becomes%20the%20destination%20%2B%20the%20filename%20of%20the%20source%20path.).

Proposal is to:

let rename happen

if rename happens, dont do anything more. Reason being, either to initially was not there or to was directory, then rename would have done similar to what is done on line 578 to 582

steveloughran · 2024-02-01T10:33:24Z

Like I said on the jira, I don't want this. It has the same scale issues encountered on abfs as #6399 and #6378, the same correctness problems on GCS as v2, as in "incorrect task commit semantics" unless v1 commit can made to not rely on atomic directory rename, but instead "atomic file rename", which does work there.

which cloud store have you tested this against? Does it actually have the semantics of rename for v1 task commit?
what was the depth/width of the directory structure?
did you try a terasort?
did you try multiple jobs through spark at the same time? as there memory is a problem: MAPREDUCE-7435. Manifest Committer OOM on abfs (#5519) #5728

Even if the store meets the v1 correctness pre-requisites I would like to see a comparison of the same job you have tested through the manifest committer. Ideally with any profiling to highlight where it could be improved.

hadoop-yetus · 2024-04-26T03:18:35Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
			_ Prechecks _
+1 💚	dupname	0m 00s		No case conflicting files found.
+0 🆗	spotbugs	0m 00s		spotbugs executables are not available.
+0 🆗	codespell	0m 00s		codespell was not available.
+0 🆗	detsecrets	0m 00s		detect-secrets was not available.
+1 💚	@author	0m 01s		The patch does not contain any @author tags.
-1 ❌	test4tests	0m 00s		The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
			_ trunk Compile Tests _
+1 💚	mvninstall	88m 55s		trunk passed
+1 💚	compile	5m 06s		trunk passed
+1 💚	checkstyle	4m 30s		trunk passed
+1 💚	mvnsite	4m 59s		trunk passed
+1 💚	javadoc	4m 29s		trunk passed
+1 💚	shadedclient	143m 04s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+1 💚	mvninstall	3m 16s		the patch passed
+1 💚	compile	2m 18s		the patch passed
+1 💚	javac	2m 18s		the patch passed
+1 💚	blanks	0m 01s		The patch has no blanks issues.
+1 💚	checkstyle	2m 03s		the patch passed
+1 💚	mvnsite	2m 28s		the patch passed
+1 💚	javadoc	2m 02s		the patch passed
+1 💚	shadedclient	155m 09s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	asflicense	5m 16s		The patch does not generate ASF License warnings.
		410m 29s

Subsystem	Report/Notes
GITHUB PR	#6469
JIRA Issue	MAPREDUCE-7470
Optional Tests	dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell detsecrets
uname	MINGW64_NT-10.0-17763 153407ddae7f 3.4.10-87d57229.x86_64 2024-02-14 20:17 UTC x86_64 Msys
Build tool	maven
Personality	/c/hadoop/dev-support/bin/hadoop.sh
git revision	trunk / `ca1b7f9`
Default Java	Azul Systems, Inc.-1.8.0_332-b09
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch-windows-10/job/PR-6469/1/testReport/
modules	C: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core U: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch-windows-10/job/PR-6469/1/console
versions	git=2.44.0.windows.1
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

multi-thread mapreduce committer

ca1b7f9

github-actions bot added MapReduce trunk labels Jan 19, 2024

saxepran suggested changes Jan 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAPREDUCE-7470: multi-thread mapreduce committer #6469

MAPREDUCE-7470: multi-thread mapreduce committer #6469

lastbus commented Jan 19, 2024 •

edited

hadoop-yetus commented Jan 19, 2024

slfan1989 commented Jan 21, 2024

saxepran Jan 27, 2024

saxepran Jan 27, 2024

saxepran Jan 27, 2024

steveloughran commented Feb 1, 2024

hadoop-yetus commented Apr 26, 2024

MAPREDUCE-7470: multi-thread mapreduce committer #6469

Are you sure you want to change the base?

MAPREDUCE-7470: multi-thread mapreduce committer #6469

Conversation

lastbus commented Jan 19, 2024 • edited

Description of PR

How was this patch tested?

For code changes:

hadoop-yetus commented Jan 19, 2024

slfan1989 commented Jan 21, 2024

saxepran Jan 27, 2024

Choose a reason for hiding this comment

saxepran Jan 27, 2024

Choose a reason for hiding this comment

saxepran Jan 27, 2024

Choose a reason for hiding this comment

steveloughran commented Feb 1, 2024

hadoop-yetus commented Apr 26, 2024

lastbus commented Jan 19, 2024 •

edited