HADOOP-18013. ABFS: add cloud trash policy with per-schema policy selection #4729

steveloughran · 2022-08-10T18:45:30Z

New trash policies, and a schema specific trash policy set by

fs.SCHEMA.trash.policy.

This lets clusters declare different policies for different stores
in the same cluster.

CloudTrashPolicy: for abfs with rename failure resilience and auto cleanup of old checkpoints.
DeleteFilesTrashPolicy: for versioned s3 buckets; delete the files.
maybe for s3 we should get an enum of all the files + versions, i.e. deep list and save that to trash, so we know what to restore? list() gives version info if you cast; we can build/save a manifest (avro?) so that restore is a matter of using the relevant recovery API or explicitly copying somewhere else

How was this patch tested?

what do you mean, tested?

For code changes:

Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?
Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation?
If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?
If applicable, have you updated the LICENSE, LICENSE-binary, NOTICE-binary files?

New trash policies, and a schema specific trash policy set by fs.SCHEMA.trash.policy. This lets clusters declare different policies for different stores in the same cluster. Change-Id: I8f4c478ca4d7b763a4499e80b2fe76f4777af054 ResilientTrashPolicy: for abfs with rename failure resilience DeleteFilesTrashPolicy: for versioned s3 buckets; delete the files.

...as RawLocalFS doesn't have a schema Change-Id: I2f0983e7ea67f6cef71e24ebe666e0ff652a85b4

Change-Id: Ie3e676ed023f573162c2de9dd9518d1273ab1215

Stats collection; option to cleanup. ABFS configured to collect the matching stats. no tests/docs Change-Id: I324bf687da1841354748a2d479c287594486dc58

steveloughran · 2022-09-08T17:13:38Z

Still a WiP but should interest people working with abfs/gcs and to a lesser degree s3a

hadoop-yetus · 2022-09-08T21:13:55Z

💔 -1 overall

Vote	Subsystem	Runtime	Logfile	Comment
+0 🆗	reexec	0m 45s		Docker mode activated.
			_ Prechecks _
+1 💚	dupname	0m 1s		No case conflicting files found.
+0 🆗	codespell	0m 0s		codespell was not available.
+0 🆗	detsecrets	0m 0s		detect-secrets was not available.
+1 💚	@author	0m 0s		The patch does not contain any @author tags.
-1 ❌	test4tests	0m 0s		The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
			_ trunk Compile Tests _
+0 🆗	mvndep	15m 15s		Maven dependency ordering for branch
+1 💚	mvninstall	29m 18s		trunk passed
+1 💚	compile	25m 25s		trunk passed with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04
+1 💚	compile	21m 37s		trunk passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
+1 💚	checkstyle	4m 14s		trunk passed
+1 💚	mvnsite	3m 17s		trunk passed
+1 💚	javadoc	2m 44s		trunk passed with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04
+1 💚	javadoc	2m 6s		trunk passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
+1 💚	spotbugs	4m 55s		trunk passed
+1 💚	shadedclient	22m 7s		branch has no errors when building and testing our client artifacts.
			_ Patch Compile Tests _
+0 🆗	mvndep	0m 31s		Maven dependency ordering for patch
+1 💚	mvninstall	1m 44s		the patch passed
+1 💚	compile	24m 9s		the patch passed with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04
-1 ❌	javac	24m 9s	/results-compile-javac-root-jdkUbuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04.txt	root-jdkUbuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04 with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04 generated 5 new + 2850 unchanged - 4 fixed = 2855 total (was 2854)
+1 💚	compile	21m 23s		the patch passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
-1 ❌	javac	21m 23s	/results-compile-javac-root-jdkPrivateBuild-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07.txt	root-jdkPrivateBuild-1.8.0_342-8u342-b07-0ubuntu1~~20.04-b07 with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~~20.04-b07 generated 4 new + 2649 unchanged - 3 fixed = 2653 total (was 2652)
+1 💚	blanks	0m 0s		The patch has no blanks issues.
-0 ⚠️	checkstyle	4m 6s	/results-checkstyle-root.txt	root: The patch generated 5 new + 73 unchanged - 4 fixed = 78 total (was 77)
+1 💚	mvnsite	3m 8s		the patch passed
-1 ❌	javadoc	1m 23s	/results-javadoc-javadoc-hadoop-common-project_hadoop-common-jdkUbuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04.txt	hadoop-common-project_hadoop-common-jdkUbuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04 with JDK Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04 generated 3 new + 0 unchanged - 0 fixed = 3 total (was 0)
+1 💚	javadoc	2m 8s		the patch passed with JDK Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
-1 ❌	spotbugs	3m 3s	/new-spotbugs-hadoop-common-project_hadoop-common.html	hadoop-common-project/hadoop-common generated 1 new + 0 unchanged - 0 fixed = 1 total (was 0)
+1 💚	shadedclient	22m 20s		patch has no errors when building and testing our client artifacts.
			_ Other Tests _
+1 💚	unit	19m 12s		hadoop-common in the patch passed.
+1 💚	unit	2m 34s		hadoop-azure in the patch passed.
+1 💚	asflicense	1m 24s		The patch does not generate ASF License warnings.
		246m 20s

Reason	Tests
SpotBugs	module:hadoop-common-project/hadoop-common
	Dead store to dir in org.apache.hadoop.fs.TrashPolicyDefault.deleteCheckpoint(Path) At TrashPolicyDefault.java:org.apache.hadoop.fs.TrashPolicyDefault.deleteCheckpoint(Path) At TrashPolicyDefault.java:[line 412]

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4729/3/artifact/out/Dockerfile
GITHUB PR	#4729
Optional Tests	dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient spotbugs checkstyle codespell detsecrets
uname	Linux 0cebf7eb06c2 4.15.0-191-generic #202-Ubuntu SMP Thu Aug 4 01:49:29 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/bin/hadoop.sh
git revision	trunk / `5a16dea`
Default Java	Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
Multi-JDK versions	/usr/lib/jvm/java-11-openjdk-amd64:Ubuntu-11.0.16+8-post-Ubuntu-0ubuntu120.04 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_342-8u342-b07-0ubuntu1~20.04-b07
Test Results	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4729/3/testReport/
Max. process+thread count	2928 (vs. ulimit of 5500)
modules	C: hadoop-common-project/hadoop-common hadoop-tools/hadoop-azure U: .
Console output	https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4729/3/console
versions	git=2.25.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

xinglin · 2022-09-12T23:47:05Z

...p-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/CloudStoreTrashPolicy.java

+        trashRoot -> {
+          try {
+            count.addAndGet(deleteCheckpoint(trashRoot.getPath(), false));
+            createCheckpoint(trashRoot.getPath(), new Date(now));


moveToTrash() will be called by thousands of clients. IIRC, a new snapshot will be created, as long as the CURRENT dir exists. super.moveToTrash() will create the CURRENT dir if it does not exist. So, I'd image every moveToTrash() would create a new checkpoint which is probably not ideal.

Each client will also try to delete the same set of snapshots. I'd image some of clients will fail due to FILE_NOT_FOUND exception, because a checkpoint dir is removed by other clients.

Cleaning is something we need to handle for trash and if we could make this approach work, I would think that would be great.

hmmm. good explanation of the problems you see.

that auto cleanup was added because of the reported problem of user home dirs being full. maybe we need to think of better strategies here, even if just hadoop fs -expunge updated to work better in this world.

steveloughran marked this pull request as draft August 10, 2022 18:45

steveloughran added 2 commits September 6, 2022 17:59

HADOOP-18013. resilient trash; use fs.getURI over getSchema

4cd392b

...as RawLocalFS doesn't have a schema Change-Id: I2f0983e7ea67f6cef71e24ebe666e0ff652a85b4

steveloughran force-pushed the azure/HADOOP-18013-resilient-trash-policy branch from 054a651 to 4cd392b Compare September 7, 2022 16:48

steveloughran added 2 commits September 7, 2022 18:36

HADOOP-18013. preparing to add cleanup to resilient trash

224fd8a

Change-Id: Ie3e676ed023f573162c2de9dd9518d1273ab1215

HADOOP-18013. Cloud Trash Policy.

5a16dea

Stats collection; option to cleanup. ABFS configured to collect the matching stats. no tests/docs Change-Id: I324bf687da1841354748a2d479c287594486dc58

steveloughran changed the title ~~HADOOP-18013. ABFS: add resilient trash policy.~~ HADOOP-18013. ABFS: add cloud trash policy with per-schema policy selection Sep 8, 2022

apache deleted a comment from hadoop-yetus Sep 9, 2022

steveloughran mentioned this pull request Sep 9, 2022

HADOOP-18444 Add Support for localized trash for ViewFileSystem in Trash.moveToAppropriateTrash #4869

Merged

xinglin reviewed Sep 12, 2022

View reviewed changes

mehakmeet mentioned this pull request Sep 13, 2023

HADOOP-18893. Make Trash Policy pluggable for different FileSystems #6061

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HADOOP-18013. ABFS: add cloud trash policy with per-schema policy selection #4729

HADOOP-18013. ABFS: add cloud trash policy with per-schema policy selection #4729

steveloughran commented Aug 10, 2022 •

edited

steveloughran commented Sep 8, 2022

hadoop-yetus commented Sep 8, 2022

xinglin Sep 12, 2022

steveloughran Sep 19, 2022

HADOOP-18013. ABFS: add cloud trash policy with per-schema policy selection #4729

Are you sure you want to change the base?

HADOOP-18013. ABFS: add cloud trash policy with per-schema policy selection #4729

Conversation

steveloughran commented Aug 10, 2022 • edited

How was this patch tested?

For code changes:

steveloughran commented Sep 8, 2022

hadoop-yetus commented Sep 8, 2022

xinglin Sep 12, 2022

Choose a reason for hiding this comment

steveloughran Sep 19, 2022

Choose a reason for hiding this comment

steveloughran commented Aug 10, 2022 •

edited