Writing to S3 sometimes results in corrupted files #10710

sshkvar · 2022-01-20T14:36:53Z

Hi we have migrated to Trino 367 and faced issue with create table as select in iceberg (Potentially all connectors which uses TrinoS3FileSystem affected)

We are doing CTAS in iceberg from other table (~482013195 records).
After when we try to query created table

select count(uuid) from iceberg.schema.just_created_table

we got exception

Query 20220119_153450_24112_cjit2 failed: Error reading from s3a://.../94a0afda-26d1-497d-9d48-362c5b36e363.parquet at position 27214944

After some investigation we found that because of some reasons we have multiple versions of parquet files, this is strange because each parquet file has uuid in name and we just created this table.
you can find screenshot in this thread in slack

Additional retries of table creation shoved us that first version of parquet file always has the same size: 16 MB which is equal to s3MultipartMinFileSize.

After it we checked all changes in io.trino.plugin.hive.s3 package and found this one 7383734.
We have reverted this change and it helped. We didn't have corrupted parquets in iceberg tables anymore.

Important note: issue is accidental, sometimes table created without issue.

The text was updated successfully, but these errors were encountered:

hashhar · 2022-01-20T14:43:36Z

cc: @findepi @losipiuk @linzebing

findepi · 2022-01-20T15:51:22Z

Additional retries of table creation shoved us that first version of parquet file always has the same size: 16 MB which is equal to s3MultipartMinFileSize.

this is very suspicious

After it we checked all changes in io.trino.plugin.hive.s3 package and found this one 7383734.

Sounds like #9715 is related then
cc @joshthoward @electrum

We have reverted this change and it helped. We didn't have corrupted parquets in iceberg tables anymore.

good to hear that.

that also means you were testing with two builds, ie local modifications
did you also try with hive.s3.streaming.enabled off?

losipiuk · 2022-01-20T18:01:09Z

If revert helps I vote for reverting as initial fix and then rework original improvement

findepi · 2022-01-21T09:33:55Z

No longer considered a release blocker, as #10716 merged.
thank you @linzebing @losipiuk

findepi · 2022-01-21T09:35:29Z

@sshkvar can you please confirm the current master no longer exhibits the problem?
@linzebing any chance this is testable? or there is nothing tbd in this issue?

sbernauer · 2022-01-21T09:58:25Z

We had the same problem with corrupt parquet and ORC files. In our case the CTAS in iceberg failed with org.apache.iceberg.parquet.ParquetIO$ParquetInputFile@2700c43b is not a Parquet file. Expected magic number at tail, but found [49, -21, -31, 15]
The current master with reverted 7383734 fixed the issue for us 👍
Thanks @sshkvar for bringing this up!

sshkvar · 2022-01-21T10:46:30Z

@sshkvar can you please confirm the current master no longer exhibits the problem?

@findepi current master works without issue for us

losipiuk · 2022-01-21T15:03:17Z

@sshkvar I reworked the commit which caused the problem and it should be fine now. The new PR is #10729. Would that be ok for you to test it out?

sshkvar · 2022-01-24T15:59:16Z

@sshkvar I reworked the commit which caused the problem and it should be fine now. The new PR is #10729. Would that be ok for you to test it out?

@losipiuk looks like reworked fix also working for us, I will continue testing and let you know in case of any issue

losipiuk · 2022-01-24T17:35:26Z

@sshkvar I reworked the commit which caused the problem and it should be fine now. The new PR is #10729. Would that be ok for you to test it out?

@losipiuk looks like reworked fix also working for us, I will continue testing and let you know in case of any issue

Thanks @sshkvar. Very much appreciate your help here.

Add a test testInsertIntoPartitionedTableLargeFiles to exercise multiple code paths of S3 streaming upload, with upload part size 5MB: 1. file size <= 5MB (shortcut to direct upload) 2. file size > 5MB but <= 10MB (which triggered trinodb#10710) 3. file size > 10MB

Add a test testInsertIntoPartitionedTableLargeFiles to exercise multiple code paths of S3 streaming upload, with upload part size 5MB: 1. file size <= 5MB (shortcut to direct upload) 2. file size > 5MB but <= 10MB (which triggered #10710) 3. file size > 10MB

findepi changed the title ~~TrinoS3FileSystem works incorrect in some cases~~ Writing to S3 sometimes results in corrupted files Jan 20, 2022

findepi added bug Something isn't working correctness RELEASE-BLOCKER labels Jan 20, 2022

linzebing mentioned this issue Jan 20, 2022

Revert "Pipeline buffering and upload in S3 multi-upload" #10716

Merged

findepi removed the RELEASE-BLOCKER label Jan 21, 2022

losipiuk mentioned this issue Jan 21, 2022

Pipeline buffering and upload in S3 multi-upload #10729

Merged

losipiuk closed this as completed in #10729 Jan 24, 2022

losipiuk mentioned this issue Jan 24, 2022

Release notes for 369 #10552

Closed

hashhar mentioned this issue Jan 25, 2022

Iceberg: INSERT INTO ... SELECT creates corrupted parquet files #10787

Closed

losipiuk mentioned this issue Jan 25, 2022

Extend test coverage for S3 streaming upload #10797

Closed

esselius mentioned this issue Feb 2, 2022

Upgrade to 369, remove deprecated settings and enable setting arbitrary config valeriano-manassero/helm-charts#103

Merged

linzebing mentioned this issue Mar 22, 2022

Increase test coverage for Hive S3 streaming upload #11607

Merged

maltesander mentioned this issue Apr 12, 2022

Provide and test trino-server-377 stackabletech/trino-operator#187

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Writing to S3 sometimes results in corrupted files #10710

Writing to S3 sometimes results in corrupted files #10710

sshkvar commented Jan 20, 2022

hashhar commented Jan 20, 2022 •

edited

findepi commented Jan 20, 2022

losipiuk commented Jan 20, 2022

findepi commented Jan 21, 2022

findepi commented Jan 21, 2022

sbernauer commented Jan 21, 2022

sshkvar commented Jan 21, 2022

losipiuk commented Jan 21, 2022

sshkvar commented Jan 24, 2022

losipiuk commented Jan 24, 2022

Writing to S3 sometimes results in corrupted files #10710

Writing to S3 sometimes results in corrupted files #10710

Comments

sshkvar commented Jan 20, 2022

hashhar commented Jan 20, 2022 • edited

findepi commented Jan 20, 2022

losipiuk commented Jan 20, 2022

findepi commented Jan 21, 2022

findepi commented Jan 21, 2022

sbernauer commented Jan 21, 2022

sshkvar commented Jan 21, 2022

losipiuk commented Jan 21, 2022

sshkvar commented Jan 24, 2022

losipiuk commented Jan 24, 2022

hashhar commented Jan 20, 2022 •

edited