HADOOP-18257. Merging and Parsing S3A audit logs into Avro format for analysis. #6000

mehakmeet · 2023-08-30T07:38:45Z

Description of PR

This is a follow-up to #4383 PR with most of the code from that PR already in place.
Adding support for an Audit Tool to merge, parse audit logs into avro file.

How was this patch tested?

mvn clean verify -Dparallel-tests -DtestsThreadCount=4 -Dscale

Tests run: 454, Failures: 0, Errors: 0, Skipped: 4

Tests run: 1171, Failures: 0, Errors: 0, Skipped: 138

Tests run: 135, Failures: 0, Errors: 1, Skipped: 10 (Timeout, unrelated)

For code changes:

Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?
Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation?
If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?
If applicable, have you updated the LICENSE, LICENSE-binary, NOTICE-binary files?

… analysis.

hadoop-yetus · 2023-08-30T09:56:12Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	1m 4s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+0 🆗	xmllint	0m 0s		xmllint was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	test4tests	0m 0s		The patch appears to include 2 new or modified test files.
			_ trunk Compile Tests _
+1 💚	mvninstall	46m 22s		trunk passed
+1 💚	compile	0m 39s		trunk passed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04
+1 💚	compile	0m 31s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	checkstyle	0m 30s		trunk passed
+1 💚	mvnsite	0m 39s		trunk passed
+1 💚	javadoc	0m 24s		trunk passed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04
+1 💚	javadoc	0m 32s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	1m 5s		trunk passed
-1 ❌	shadedclient	36m 46s		branch has errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+1 💚	mvninstall	0m 28s		the patch passed
+1 💚	compile	0m 34s		the patch passed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04
+1 💚	javac	0m 34s		the patch passed
+1 💚	compile	0m 26s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	javac	0m 26s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	checkstyle	0m 19s		the patch passed
+1 💚	mvnsite	0m 30s		the patch passed
+1 💚	javadoc	0m 15s		the patch passed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04
+1 💚	javadoc	0m 24s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	1m 6s		the patch passed
-1 ❌	shadedclient	36m 24s		patch has errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	unit	2m 29s		hadoop-aws in the patch passed.
+1 💚	asflicense	0m 34s		The patch does not generate ASF License warnings.
		136m 2s

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/1/artifact/out/Dockerfile
GITHUB PR	#6000
Optional Tests	dupname asflicense codespell detsecrets xmllint compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle
uname	Linux b7186fac4495 4.15.0-212-generic #223-Ubuntu SMP Tue May 23 13:09:22 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `e4f1188`
Default Java	Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/1/testReport/
Max. process+thread count	457 (vs. ulimit of 5500)
modules	C: hadoop-tools/hadoop-aws U: hadoop-tools/hadoop-aws
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/1/console
versions	git=2.25.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

mehakmeet · 2023-09-04T04:49:33Z

CC: @steveloughran @mukund-thakur

mukund-thakur

Looking good. Some minor comments.
Have you tested this on actual files? And specifically so many files... total size in GB's kind of scale testing?
I don't see any memory issues as data gets written after being processed.
If there are so many files for ex 1000, does it launch multiple mappers to process x files ny each mapper based on the splits?

mukund-thakur · 2023-09-06T21:38:08Z

hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/AuditTool.java

+  private static final int INVALID_ARGUMENT = EXIT_COMMAND_ARGUMENT_ERROR;
+
+  private static final String USAGE =
+      "bin/hadoop " + "Class" + " DestinationPath" + " SourcePath" + "\n" +


Give an example of a command. Why bin/hadoop twice?

this is to essentially define the command in verbose vs an example. This is what we'll see

❯ bin/hadoop org.apache.hadoop.fs.s3a.audit.AuditTool bin/hadoop Class DestinationPath SourcePath bin/hadoop org.apache.hadoop.fs.s3a.audit.AuditTool s3a://BUCKET s3a://BUCKET

I'll simplify this, it may be confusing

Okay.. change one BUCKET to source_bucket and other to destination_bucket.

hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/AuditTool.java

mukund-thakur · 2023-09-06T21:50:49Z

hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/AuditTool.java

+    try {
+      uri = new URI(s3Path);
+    } catch (URISyntaxException e) {
+      throw invalidArgs("Not a valid fileystem path: %s", s3Path);


this is not an invalidArgs exception.

hmm, interesting, I don't see this being used anywhere, maybe it is a stale code from last PR. I'll remove this.

hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/AuditTool.java

...p-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/mapreduce/S3AAuditLogMergerAndParser.java

mehakmeet · 2023-09-25T11:55:05Z

Have you tested this on actual files? And specifically so many files... total size in GB's kind of scale testing?

Have tested this but not at scale. Will do that.
Example test:

❯ bin/hadoop org.apache.hadoop.fs.s3a.audit.AuditTool s3a://mehakmeet-singh-data/logdir2/ s3a://mehakmeet-singh-data/logsdir/
16:48:41,319 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
16:49:58,339 INFO mapreduce.S3AAuditLogMergerAndParser: Successfully generated avro data
16:49:58,839 INFO mapreduce.S3AAuditLogMergerAndParser: Successfully parsed :7547 audit logs and 6718 referrer headers logs in the logs
16:49:58,854 INFO impl.MetricsSystemImpl: Stopping s3a-file-system metrics system...
16:49:58,854 INFO impl.MetricsSystemImpl: s3a-file-system metrics system stopped.
16:49:58,854 INFO impl.MetricsSystemImpl: s3a-file-system metrics system shutdown complete.

Since we're reading each file serially per line, I would assume this would be alot slower in that scenario. Optimisation can be a follow-up patch.

If there are so many files for ex 1000, does it launch multiple mappers to process x files ny each mapper based on the splits?

Not currently. Is that something we would have to write the logic off of, I'll have to check the code for it? Specifically for number of mappers maybe we could have a threshold of number of files and then paginate them based on that.

hadoop-yetus · 2023-09-25T15:07:55Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	1m 35s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+0 🆗	xmllint	0m 0s		xmllint was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	test4tests	0m 0s		The patch appears to include 2 new or modified test files.
			_ trunk Compile Tests _
+1 💚	mvninstall	56m 27s		trunk passed
+1 💚	compile	1m 15s		trunk passed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04
+1 💚	compile	0m 44s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	checkstyle	0m 52s		trunk passed
+1 💚	mvnsite	1m 6s		trunk passed
+1 💚	javadoc	0m 38s		trunk passed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04
+1 💚	javadoc	0m 51s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
-1 ❌	spotbugs	0m 53s	/branch-spotbugs-hadoop-tools_hadoop-aws.txt	hadoop-aws in trunk failed.
+1 💚	shadedclient	5m 17s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
-1 ❌	mvninstall	0m 40s	/patch-mvninstall-hadoop-tools_hadoop-aws.txt	hadoop-aws in the patch failed.
-1 ❌	compile	0m 26s	/patch-compile-hadoop-tools_hadoop-aws-jdkUbuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04.txt	hadoop-aws in the patch failed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04.
-1 ❌	javac	0m 26s	/patch-compile-hadoop-tools_hadoop-aws-jdkUbuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04.txt	hadoop-aws in the patch failed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04.
-1 ❌	compile	0m 24s	/patch-compile-hadoop-tools_hadoop-aws-jdkPrivateBuild-1.8.0_382-8u382-ga-1~20.04.1-b05.txt	hadoop-aws in the patch failed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05.
-1 ❌	javac	0m 24s	/patch-compile-hadoop-tools_hadoop-aws-jdkPrivateBuild-1.8.0_382-8u382-ga-1~20.04.1-b05.txt	hadoop-aws in the patch failed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05.
+1 💚	blanks	0m 0s		The patch has no blanks issues.
-0 ⚠️	checkstyle	0m 22s	/buildtool-patch-checkstyle-hadoop-tools_hadoop-aws.txt	The patch fails to run checkstyle in hadoop-aws
-1 ❌	mvnsite	0m 22s	/patch-mvnsite-hadoop-tools_hadoop-aws.txt	hadoop-aws in the patch failed.
-1 ❌	javadoc	0m 24s	/patch-javadoc-hadoop-tools_hadoop-aws-jdkUbuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04.txt	hadoop-aws in the patch failed with JDK Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04.
-1 ❌	javadoc	0m 25s	/patch-javadoc-hadoop-tools_hadoop-aws-jdkPrivateBuild-1.8.0_382-8u382-ga-1~20.04.1-b05.txt	hadoop-aws in the patch failed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05.
-1 ❌	spotbugs	0m 26s	/patch-spotbugs-hadoop-tools_hadoop-aws.txt	hadoop-aws in the patch failed.
+1 💚	shadedclient	5m 19s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
-1 ❌	unit	0m 26s	/patch-unit-hadoop-tools_hadoop-aws.txt	hadoop-aws in the patch failed.
+0 🆗	asflicense	0m 28s		ASF License check generated no output?
		79m 10s

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/2/artifact/out/Dockerfile
GITHUB PR	#6000
Optional Tests	dupname asflicense codespell detsecrets xmllint compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle
uname	Linux 15d607186cba 4.15.0-213-generic #224-Ubuntu SMP Mon Jun 19 13:30:12 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `3d87bbf`
Default Java	Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.20+8-post-Ubuntu-1ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/2/testReport/
Max. process+thread count	77 (vs. ulimit of 5500)
modules	C: hadoop-tools/hadoop-aws U: hadoop-tools/hadoop-aws
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/2/console
versions	git=2.25.1 maven=3.6.3
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

steveloughran

I'm going to propose having a small log file in test/resources so that test can actually take a larger file and work through it, with a broader set of requests. A few kb of test run logs is enough.

steveloughran · 2023-09-26T11:36:39Z

hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/AuditTool.java

+ * Its functionality is to parse the audit log files
+ * and generate avro file.
+ */
+public class AuditTool extends Configured implements Tool, Closeable {


what about makign this something the hadoop s3guard can invoke?

I think initially we went with that, but then changed it to be an alone audit log tool, not quite sure why we didn't go that route. Were there any plans to remove s3guard tool in the future, since we would have to separate it out then.

steveloughran · 2023-09-26T11:37:12Z

hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/AuditTool.java

+  private static final int INVALID_ARGUMENT = EXIT_COMMAND_ARGUMENT_ERROR;
+
+  private static final String USAGE =
+      "bin/hadoop " + AUDIT_TOOL +


cut the "bin/"; that's only because we do it in our local builds. real deployments have hadoop on the path

steveloughran · 2023-09-26T11:39:47Z

.../hadoop-aws/src/test/java/org/apache/hadoop/fs/s3a/audit/TestS3AAuditLogMergerAndParser.java

+  public void testParseAuditLogEmptyAndNull() {
+    Map<String, String> parseAuditLogResultEmpty =
+        s3AAuditLogMergerAndParser.parseAuditLog("");
+    assertTrue("the returned list should be empty for this test",


use assertJ asserts as they will include the map if an exception is generated

steveloughran · 2023-09-26T11:40:39Z

.../hadoop-aws/src/test/java/org/apache/hadoop/fs/s3a/audit/TestS3AAuditLogMergerAndParser.java

+        s3AAuditLogMergerAndParser.mergeAndParseAuditLogFiles(fileSystem,
+            logsPath, destPath);
+    assertTrue("the result should be true", mergeAndParseResult);
+  }


should explain what "the result" is. better "the merge and parse failed"

L250 and 251 are duplicate

comment doesn't seem to match the assert

propose: cut the comment, and add in assertEquals what you are checking -"count of header logs parsed across both files"

+1 on assert message.

mukund-thakur · 2023-09-26T20:34:20Z

Not currently. Is that something we would have to write the logic off of, I'll have to check the code for it? Specifically for number of mappers maybe we could have a threshold of number of files and then paginate them based on that.

Looks like this is completely serial now. But you can think of this as a follow-up and maybe add support for that in the future once this gets used. Just create a jira for now.

hadoop-yetus · 2023-10-03T15:40:51Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 38s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 1s		No case conflicting files found.
+0 🆗	codespell	0m 1s		codespell was not available.
+0 🆗	detsecrets	0m 1s		detect-secrets was not available.
+0 🆗	xmllint	0m 1s		xmllint was not available.
+0 🆗	shelldocs	0m 0s		Shelldocs was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	test4tests	0m 0s		The patch appears to include 4 new or modified test files.
			_ trunk Compile Tests _
+1 💚	mvninstall	44m 32s		trunk passed
+1 💚	compile	0m 45s		trunk passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	compile	0m 39s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	checkstyle	0m 35s		trunk passed
+1 💚	mvnsite	0m 42s		trunk passed
+1 💚	javadoc	0m 32s		trunk passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	0m 38s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	1m 17s		trunk passed
+1 💚	shadedclient	36m 57s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+1 💚	mvninstall	0m 34s		the patch passed
+1 💚	compile	0m 37s		the patch passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javac	0m 37s		the patch passed
+1 💚	compile	0m 30s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	javac	0m 30s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	checkstyle	0m 22s		the patch passed
+1 💚	mvnsite	0m 34s		the patch passed
+1 💚	shellcheck	0m 0s		No new issues.
+1 💚	javadoc	0m 19s		the patch passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	0m 29s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	1m 15s		the patch passed
+1 💚	shadedclient	35m 50s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	unit	2m 58s		hadoop-aws in the patch passed.
-1 ❌	asflicense	0m 42s	/results-asflicense.txt	The patch generated 2 ASF License warnings.
		135m 39s

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/3/artifact/out/Dockerfile
GITHUB PR	#6000
Optional Tests	dupname asflicense codespell detsecrets xmllint compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle shellcheck shelldocs
uname	Linux c5cdbb7413f0 4.15.0-213-generic #224-Ubuntu SMP Mon Jun 19 13:30:12 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `ac7266b`
Default Java	Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/3/testReport/
Max. process+thread count	681 (vs. ulimit of 5500)
modules	C: hadoop-tools/hadoop-aws U: hadoop-tools/hadoop-aws
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/3/console
versions	git=2.25.1 maven=3.6.3 spotbugs=4.2.2 shellcheck=0.7.0
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

mehakmeet · 2023-11-06T07:03:53Z

Addressed the review comments. CC: @mukund-thakur @steveloughran

steveloughran

minor test tweak; main blocker is getting rat to stop complaining about licenses.

you need to add an excludes entry in the apache-rat-plugin in hadoop-main pom;

if you name the logs something with a .log suffix then you can do a wildcard of hadoop-tools/hadoop-aws/src/test/resources/TestAuditLogs/*.log for future expansion

steveloughran · 2023-11-06T17:47:45Z

.../hadoop-aws/src/test/java/org/apache/hadoop/fs/s3a/audit/TestS3AAuditLogMergerAndParser.java

+        s3AAuditLogMergerAndParser.mergeAndParseAuditLogFiles(fileSystem,
+            logsPath, destPath);
+    assertTrue("the result should be true", mergeAndParseResult);
+  }


L250 and 251 are duplicate

comment doesn't seem to match the assert

propose: cut the comment, and add in assertEquals what you are checking -"count of header logs parsed across both files"

hadoop-tools/hadoop-aws/src/test/resources/TestAuditLogs/sampleLog1

mukund-thakur

looks good overall. Let's fix the yetus and we are good to merge.

...p-aws/src/main/java/org/apache/hadoop/fs/s3a/audit/mapreduce/S3AAuditLogMergerAndParser.java

mukund-thakur · 2023-11-08T21:48:07Z

.../hadoop-aws/src/test/java/org/apache/hadoop/fs/s3a/audit/TestS3AAuditLogMergerAndParser.java

+        s3AAuditLogMergerAndParser.mergeAndParseAuditLogFiles(fileSystem,
+            logsPath, destPath);
+    assertTrue("the result should be true", mergeAndParseResult);
+  }


+1 on assert message.

hadoop-yetus · 2023-11-15T22:12:56Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 21s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+0 🆗	xmllint	0m 0s		xmllint was not available.
+0 🆗	shelldocs	0m 0s		Shelldocs was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	test4tests	0m 0s		The patch appears to include 4 new or modified test files.
			_ trunk Compile Tests _
+0 🆗	mvndep	13m 57s		Maven dependency ordering for branch
+1 💚	mvninstall	20m 8s		trunk passed
+1 💚	compile	8m 15s		trunk passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	compile	7m 29s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	checkstyle	2m 6s		trunk passed
+1 💚	mvnsite	11m 26s		trunk passed
+1 💚	javadoc	4m 41s		trunk passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	4m 57s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	16m 57s		trunk passed
+1 💚	shadedclient	35m 3s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+0 🆗	mvndep	0m 27s		Maven dependency ordering for patch
+1 💚	mvninstall	16m 58s		the patch passed
+1 💚	compile	8m 1s		the patch passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javac	8m 1s		the patch passed
+1 💚	compile	7m 27s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	javac	7m 27s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	checkstyle	1m 59s		the patch passed
+1 💚	mvnsite	6m 54s		the patch passed
+1 💚	shellcheck	0m 0s		No new issues.
+1 💚	javadoc	4m 39s		the patch passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	4m 55s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	17m 24s		the patch passed
+1 💚	shadedclient	35m 9s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
-1 ❌	unit	630m 50s	/patch-unit-root.txt	root in the patch passed.
+1 💚	asflicense	0m 50s		The patch does not generate ASF License warnings.
		832m 18s

Reason	Tests
Failed junit tests	hadoop.yarn.server.timelineservice.security.TestTimelineAuthFilterForV2

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/4/artifact/out/Dockerfile
GITHUB PR	#6000
Optional Tests	dupname asflicense codespell detsecrets xmllint compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle shellcheck shelldocs
uname	Linux 198df243d65b 5.15.0-88-generic #98-Ubuntu SMP Mon Oct 2 15:18:56 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `8cd07fd`
Default Java	Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/4/testReport/
Max. process+thread count	3779 (vs. ulimit of 5500)
modules	C: hadoop-tools/hadoop-aws . U: .
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/4/console
versions	git=2.25.1 maven=3.6.3 spotbugs=4.2.2 shellcheck=0.7.0
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

mehakmeet · 2023-11-17T07:57:38Z

/patch-unit-root.txt says the Build was success but still gave a -1. Going to put an empty patch up for yetus again
CC: @mukund-thakur @steveloughran

hadoop-yetus · 2023-11-18T03:19:27Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 51s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 0s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+0 🆗	xmllint	0m 0s		xmllint was not available.
+0 🆗	shelldocs	0m 0s		Shelldocs was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
+1 💚	test4tests	0m 0s		The patch appears to include 4 new or modified test files.
			_ trunk Compile Tests _
+0 🆗	mvndep	14m 13s		Maven dependency ordering for branch
+1 💚	mvninstall	35m 2s		trunk passed
+1 💚	compile	18m 26s		trunk passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	compile	16m 33s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	checkstyle	4m 43s		trunk passed
+1 💚	mvnsite	18m 40s		trunk passed
+1 💚	javadoc	8m 47s		trunk passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	7m 29s		trunk passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	31m 26s		trunk passed
+1 💚	shadedclient	67m 22s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+0 🆗	mvndep	0m 41s		Maven dependency ordering for patch
+1 💚	mvninstall	33m 36s		the patch passed
+1 💚	compile	17m 42s		the patch passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javac	17m 42s		the patch passed
+1 💚	compile	16m 24s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	javac	16m 24s		the patch passed
+1 💚	blanks	0m 0s		The patch has no blanks issues.
+1 💚	checkstyle	4m 35s		the patch passed
+1 💚	mvnsite	14m 20s		the patch passed
+1 💚	shellcheck	0m 1s		No new issues.
+1 💚	javadoc	8m 32s		the patch passed with JDK Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	7m 22s		the patch passed with JDK Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
+1 💚	spotbugs	32m 2s		the patch passed
+1 💚	shadedclient	67m 37s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
-1 ❌	unit	783m 45s	/patch-unit-root.txt	root in the patch passed.
+1 💚	asflicense	1m 33s		The patch does not generate ASF License warnings.
		1158m 10s

Reason	Tests
Failed junit tests	hadoop.hdfs.server.datanode.TestDirectoryScanner
	hadoop.hdfs.TestRollingUpgrade

Subsystem	Report/Notes
Docker	ClientAPI=1.43 ServerAPI=1.43 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/5/artifact/out/Dockerfile
GITHUB PR	#6000
Optional Tests	dupname asflicense codespell detsecrets xmllint compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle shellcheck shelldocs
uname	Linux 3aa23921fa44 5.15.0-88-generic #98-Ubuntu SMP Mon Oct 2 15:18:56 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `798a48a`
Default Java	Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.20.1+1-post-Ubuntu-0ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_382-8u382-ga-1~20.04.1-b05
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/5/testReport/
Max. process+thread count	2510 (vs. ulimit of 5500)
modules	C: hadoop-tools/hadoop-aws . U: .
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-6000/5/console
versions	git=2.25.1 maven=3.6.3 spotbugs=4.2.2 shellcheck=0.7.0
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

HADOOP-18257. Merging and Parsing S3A audit logs into Avro format for…

e4f1188

… analysis.

github-actions bot added build trunk TOOLS AWS labels Aug 30, 2023

mukund-thakur reviewed Sep 20, 2023

View reviewed changes

HADOOP-18257. review comments.

3d87bbf

steveloughran requested changes Sep 26, 2023

View reviewed changes

HADOOP-18257. test changes + review comments.

ac7266b

steveloughran requested changes Nov 6, 2023

View reviewed changes

mukund-thakur reviewed Nov 8, 2023

View reviewed changes

HADOOP-18257. review comments.

8cd07fd

HADOOP-18257. empty commit for Yetus.

798a48a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HADOOP-18257. Merging and Parsing S3A audit logs into Avro format for analysis. #6000

HADOOP-18257. Merging and Parsing S3A audit logs into Avro format for analysis. #6000

mehakmeet commented Aug 30, 2023

hadoop-yetus commented Aug 30, 2023

mehakmeet commented Sep 4, 2023

mukund-thakur left a comment

mukund-thakur Sep 6, 2023

mehakmeet Sep 25, 2023

mukund-thakur Sep 26, 2023

mukund-thakur Sep 6, 2023

mehakmeet Sep 25, 2023

mehakmeet commented Sep 25, 2023 •

edited

hadoop-yetus commented Sep 25, 2023

steveloughran left a comment

steveloughran Sep 26, 2023

mehakmeet Sep 29, 2023

steveloughran Sep 26, 2023

steveloughran Sep 26, 2023

steveloughran Sep 26, 2023

steveloughran Nov 6, 2023

mukund-thakur Nov 8, 2023

mukund-thakur commented Sep 26, 2023

hadoop-yetus commented Oct 3, 2023

mehakmeet commented Nov 6, 2023

steveloughran left a comment

steveloughran Nov 6, 2023

mukund-thakur left a comment

mukund-thakur Nov 8, 2023

hadoop-yetus commented Nov 15, 2023

mehakmeet commented Nov 17, 2023

hadoop-yetus commented Nov 18, 2023

HADOOP-18257. Merging and Parsing S3A audit logs into Avro format for analysis. #6000

Are you sure you want to change the base?

HADOOP-18257. Merging and Parsing S3A audit logs into Avro format for analysis. #6000

Conversation

mehakmeet commented Aug 30, 2023

Description of PR

How was this patch tested?

For code changes:

hadoop-yetus commented Aug 30, 2023

mehakmeet commented Sep 4, 2023

mukund-thakur left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mehakmeet commented Sep 25, 2023 • edited

hadoop-yetus commented Sep 25, 2023

steveloughran left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mukund-thakur commented Sep 26, 2023

hadoop-yetus commented Oct 3, 2023

mehakmeet commented Nov 6, 2023

steveloughran left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mukund-thakur left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hadoop-yetus commented Nov 15, 2023

mehakmeet commented Nov 17, 2023

hadoop-yetus commented Nov 18, 2023

mehakmeet commented Sep 25, 2023 •

edited