An attacker can flood the storage by direct uploading files #13

choonkeat · 2015-11-03T04:14:58Z

Though attache presign uploads offers the same protection as S3 direct upload

The pre-signed URLs are valid only for the specified duration.

Within that duration, an attacker can still upload as many files.

To mitigate that, we can adopt the refile and shrine procedure of always uploading to cache then promote to store only when the client app sends a confirmation ping

Current proposal is for /promote to mimic the /delete endpoint

require a pre-signed HTTP POST (but valid duration must be pretty short, like < 30s)
params include a list of filenames to confirm (batch operation)
image server perform promotion async, responds to client app immediately

@janko-m if async promotion fails in the background, what does a shrine user do?

The text was updated successfully, but these errors were encountered:

janko · 2015-11-03T08:40:27Z

Fails in what way? If the attachment is changed on the record before the file is promoted, the stored file is deleted. However, if the record gets deleted, the promotion will error and the stored file won't get deleted, need to fix that.

choonkeat · 2015-11-03T09:01:34Z

[async promotion fails] in what way?

When we are using a cloud storage (eg s3), presumably promote is an api call that moves the file from one location to another?

So if this s3 api fail, the file isn't promoted, the cache ultimately gets cleared, then we'll lose a file?

janko · 2015-11-03T09:19:13Z

Yeah, promote is actually when a cached file is uploaded to "store". Hopefully the background job is set to retry a couple of times, so ultimately it should succeed.

choonkeat · 2015-11-03T09:42:26Z

Back when attaché was uploading asynchronously, I've had experience fail scenario before and the retries didn't persist as long as the s3 outage. That incident resulted in some data loss.

For a while we were investigating several methods to make the retry more robust but eventually ended with synchronous upload instead: either the end user experience upload error or the data is safe in s3, no silent data loss.

So for promotion, I'm back to worrying about the failure state again. This picking your brain on this.

If there's no satisfactory solution, I wonder if there's another algorithm to address the original attack vector

janko · 2015-11-03T11:45:10Z

I think this can be fixed by just setting a long-enough timespan. For example, you make the Sidekiq job retry for 1 day, in increasing periods (e.g. 1st retry in 5 seconds, 2nd is in 15 seconds after last retry, 3rd in a minute etc.), and you also make the cache storage clear out only files that are more than 1 day old. It's statistically impossible that S3 is down for 24 hours.

choonkeat · 2015-11-04T01:41:47Z

Guess it'll have to be an option for promote to be sync (slower, zero maintenance) or async (faster, caveat emptor) 🙇

- if configured, calling "backup_file" will copy the file from default bucket to a "backup bucket" - addresses #13 (aka "cache" vs "store" concept in refile) - reference simplified model of shrinerb/shrine#25 (comment)

choonkeat mentioned this issue Dec 14, 2015

Feature/backup bucket #25

Merged

choonkeat closed this as completed in 3a8bae4 Dec 16, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An attacker can flood the storage by direct uploading files #13

An attacker can flood the storage by direct uploading files #13

choonkeat commented Nov 3, 2015

janko commented Nov 3, 2015

choonkeat commented Nov 3, 2015

janko commented Nov 3, 2015

choonkeat commented Nov 3, 2015

janko commented Nov 3, 2015

choonkeat commented Nov 4, 2015

An attacker can flood the storage by direct uploading files #13

An attacker can flood the storage by direct uploading files #13

Comments

choonkeat commented Nov 3, 2015

janko commented Nov 3, 2015

choonkeat commented Nov 3, 2015

janko commented Nov 3, 2015

choonkeat commented Nov 3, 2015

janko commented Nov 3, 2015

choonkeat commented Nov 4, 2015