make load hive table more robust #1687

wg1026688210 · 2020-10-29T08:11:39Z

we will seek previous_metadata_location when it occurs File Not Found seek from metadata_location of hive table properties
2 if we used previous_metadata_location on freshing operation ,we will set it to metadata_location in case of both metadataLocation no found when next fresh

RussellSpitzer · 2020-10-29T16:38:05Z

core/src/main/java/org/apache/iceberg/BaseMetastoreTableOperations.java

  }

-  protected void refreshFromMetadataLocation(String newLocation, Predicate<Exception> shouldRetry,
-                                             int numRetries) {
+  protected boolean refreshFromMetadataLocation(


iceberg style is to put the new line only before the arguments that don't fit within the line width. So

foo( x, y, z)

Not

foo (x, y, z)

RussellSpitzer · 2020-10-29T16:45:50Z

I have some general concerns before looking more closely, one thing that this PR would do is double the time to failure on misconfiguration which is already quite long. I'm also a little confused about the use case for this PR, are the two locations both equally valid? It seems like we don't wouldn't normally want to fall back to different location?

wg1026688210 · 2020-10-30T09:29:22Z

hi~ @RussellSpitzer thanks for your concerns .
I do some improvments on trying previous_location use quite a long time on a new commit .
The parsing metadata retry will stop when it throws NotFoundException.

about the use case for this PR
I found the metadata_location file was deleted when commit fail as I mentioned in #1688 ,
but the metadata_location property of the hive table still became the metadata_location which is deleted.
Flink job fail to restart from checkpoint all the time due to the deleted file

iceberg/hive-metastore/src/main/java/org/apache/iceberg/hive/HiveTableOperations.java

Lines 201 to 204 in 4ba48be

    
           if (threw) { 
        
             // if anything went wrong, clean up the uncommitted metadata file 
        
             io().deleteFile(newMetadataLocation); 
        
           }

I want to make a improvment that trying the previous_metadata_location only if geting the metadata_location throws NotFoundException, and if the metadata_location is no found , I will set the previous_metadata_location value to it in case of both metadata_location are not found.

2.the improvment of trying previous_location use quite a long time

shardulm94 · 2020-10-31T01:46:35Z

hive-metastore/src/main/java/org/apache/iceberg/hive/HiveTableOperations.java

+      if (refreshFromMetadataLocation(metadataLocation, previousMetadataLocation) && table != null) {
+        Map<String, String> parameters = table.getParameters();
+        parameters.put(METADATA_LOCATION_PROP, previousMetadataLocation);
+        persistTable(table, true);


This introduces a write operation in the Iceberg read path of a table. I feel that it is not a good idea for readers to modify table metadata in the metastore. Would it be okay to not try to fix the table during reads? The readers can still probably use the previousMetadataLocation for loads. To solve the issue you see in #1688 would it better if the writer itself fixes the issue?

Committing this change introduces a correctness problem. I don't think this should be committed. It is not safe to automatically roll back if there is a problem loading the current metadata file because the changes in the current metadata file are lost.

There are cases where the current metadata file may not be found. The S3A file system may throw NotFoundException because of negative caching in S3, even though the metadata file exists. If that happens, then this would silently roll back anything committed in the current metadata. There are other cases where this may happen as well and we simply don't know why the current metadata file was deleted. Assuming a specific cause is not a viable solution.

The problem in #1688 is that the Hive table was updated, but the operation appeared to fail on the client. The client deletes the metadata file from the commit attempt so that the commit cannot be accidentally successful (the case you hit) because the client will throw an exception that will very likely result in a retry. For example, Flink appending files will keep trying to append the same files.

I think the right solution to the problem is to improve the handling in the commit path, not the read path. The commit path can load the table again and check whether the metadata file is the one it attempted to commit. If it is, then it can return success instead of deleting the metadata file.

@shardulm94 thanks for your suggestions .I'll try some improvments on writing.

rdblue · 2020-11-01T19:49:16Z

Committing this change introduces a correctness problem. I don't think this should be committed. It is not safe to automatically roll back if there is a problem loading the current metadata file because the changes in the current metadata file are lost.

There are cases where the current metadata file may not be found. The S3A file system may throw NotFoundException because of negative caching in S3, even though the metadata file exists. If that happens, then this would silently roll back anything committed in the current metadata. There are other cases where this may happen as well and we simply don't know why the current metadata file was deleted. Assuming a specific cause is not a viable solution.

The problem in #1688 is that the Hive table was updated, but the operation appeared to fail on the client. The client deletes the metadata file from the commit attempt so that the commit cannot be accidentally successful (the case you hit) because the client will throw an exception that will very likely result in a retry. For example, Flink appending files will keep trying to append the same files.

I think the right solution to the problem is to improve the handling in the commit path, not the read path. The commit path can load the table again and check whether the metadata file is the one it attempted to commit. If it is, then it can return success instead of deleting the metadata file.

wg1026688210 · 2020-11-02T04:54:23Z

Committing this change introduces a correctness problem. I don't think this should be committed. It is not safe to automatically roll back if there is a problem loading the current metadata file because the changes in the current metadata file are lost.

There are cases where the current metadata file may not be found. The S3A file system may throw NotFoundException because of negative caching in S3, even though the metadata file exists. If that happens, then this would silently roll back anything committed in the current metadata. There are other cases where this may happen as well and we simply don't know why the current metadata file was deleted. Assuming a specific cause is not a viable solution.

The problem in #1688 is that the Hive table was updated, but the operation appeared to fail on the client. The client deletes the metadata file from the commit attempt so that the commit cannot be accidentally successful (the case you hit) because the client will throw an exception that will very likely result in a retry. For example, Flink appending files will keep trying to append the same files.

I think the right solution to the problem is to improve the handling in the commit path, not the read path. The commit path can load the table again and check whether the metadata file is the one it attempted to commit. If it is, then it can return success instead of deleting the metadata file.

thanks @rdblue ,I will add some retry on commit and try checking whether the metadata file is the one it attempted to commit before deleting the metadata file .

hive catalog improvment

616fafc

wg1026688210 changed the title ~~make hive table commit more robust~~ make load hive table more robust Oct 29, 2020

RussellSpitzer reviewed Oct 29, 2020

View reviewed changes

1.function name code format

21fe1c2

2.the improvment of trying previous_location use quite a long time

shardulm94 reviewed Oct 31, 2020

View reviewed changes

rdblue mentioned this pull request Nov 1, 2020

iceberg flink sink job can't restart due to metadata location not found #1688

Closed

wg1026688210 added 5 commits November 3, 2020 14:55

1.add retry

e6ce735

2.Remove redundant changes

5838b1e

2.Remove redundant changes

5bfae5a

2.Remove redundant changes

b4a9385

2.Remove redundant changes

f1e142c

wg1026688210 closed this Nov 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make load hive table more robust #1687

make load hive table more robust #1687

wg1026688210 commented Oct 29, 2020 •

edited

RussellSpitzer Oct 29, 2020 •

edited

wg1026688210 Oct 30, 2020

RussellSpitzer commented Oct 29, 2020

wg1026688210 commented Oct 30, 2020

shardulm94 Oct 31, 2020

wg1026688210 Nov 2, 2020

rdblue commented Nov 1, 2020

wg1026688210 commented Nov 2, 2020

make load hive table more robust #1687

make load hive table more robust #1687

Conversation

wg1026688210 commented Oct 29, 2020 • edited

RussellSpitzer Oct 29, 2020 • edited

Choose a reason for hiding this comment

wg1026688210 Oct 30, 2020

Choose a reason for hiding this comment

RussellSpitzer commented Oct 29, 2020

wg1026688210 commented Oct 30, 2020

shardulm94 Oct 31, 2020

Choose a reason for hiding this comment

wg1026688210 Nov 2, 2020

Choose a reason for hiding this comment

rdblue commented Nov 1, 2020

wg1026688210 commented Nov 2, 2020

wg1026688210 commented Oct 29, 2020 •

edited

RussellSpitzer Oct 29, 2020 •

edited