Add method to check and skip duplicate content uploads to S3 #1032

cmadjar · 2024-01-15T22:02:57Z

Rebase of #1015 to 24.1-release so HBCD can benefit from those changes when we do the next bug fix release of LORIS-MRI.

Description (from PR #1015)

The changes here are intended to check to see if the content of file that would be uploaded to S3 has already been uploaded. It does this by checking to see if the hash of a file content is already available at the targeted S3 object key location before attempting to upload new content. If it already exists, it will skip it.

This helps to resolve an issue where sometimes the same content would be uploaded to an S3 bucket, even if that file already existed. Normally this would be fine, but in versioning enabled buckets this creates duplicate copies of the files when no changes are needed.

This does not appear to cause any breaking changes.

cmadjar · 2024-01-15T22:31:11Z

@breen0074 Thank you for submitting this fix! I rebased and tested the change. Works wonderfully :). Sorry for the delay in getting to it.

cmadjar added 2 commits October 16, 2023 15:26

Update VERSION to 24.1.13

cd456b7

get Kim's code on 24.1-release branch

577162d

cmadjar self-assigned this Jan 15, 2024

cmadjar added Add to release notes New Feature A-BIDS Area: BIDS. Issues and pull requests related to BIDS and the BIDS import pipeline Python imaging pipeline labels Jan 15, 2024

fix flake8

ab0a10e

cmadjar mentioned this pull request Jan 15, 2024

Add method to check and skip duplicate content uploads to S3 #1015

Closed

fix flake8

900fec6

cmadjar merged commit 8601ce0 into aces:main Jan 15, 2024
1 check passed

cmadjar added this to the 24.1.14 milestone Jan 15, 2024

cmadjar mentioned this pull request Jan 17, 2024

Add method to check and skip duplicate content uploads to S3 - for 24.1-release #1037

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add method to check and skip duplicate content uploads to S3 #1032

Add method to check and skip duplicate content uploads to S3 #1032

cmadjar commented Jan 15, 2024

cmadjar commented Jan 15, 2024

Add method to check and skip duplicate content uploads to S3 #1032

Add method to check and skip duplicate content uploads to S3 #1032

Conversation

cmadjar commented Jan 15, 2024

Description (from PR #1015)

cmadjar commented Jan 15, 2024