-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SUPPORT] Duplicates on MOR table #8703
Comments
Problem causes: drop a table and create the table again, the table path is not deleted, so two files at the same time. |
Is the table a managed table or external table then? |
A hudi table created by flink,it's not an external table. |
How do you drop the table, if it is a managed table and if you use the Flink Hudi Hive catalog, the table path would be deleted. |
Drop the hudi table using SparkSQL |
There is a param named case class DropHoodieTableCommand(
tableIdentifier: TableIdentifier,
ifExists: Boolean,
isView: Boolean,
purge: Boolean) extends HoodieLeafRunnableCommand { If it is true, the directory on fs would be deleted recursively. |
Thanks, it's good. |
Hudi: 0.12.1
Flink: 0.15
Spark: 3.1
Duplicates span multiple files within the same partitionpath:
Can not use the records deduplicate command to fix for no permissions.
The text was updated successfully, but these errors were encountered: