Etag header set by Journalist API is not sha256sum of file #4032

emkll · 2019-01-14T20:07:13Z

Description

Etag header of file download files for Journalist API (https://github.com/freedomofpress/securedrop/blob/develop/securedrop/journalist_app/utils.py#L337) always returns sha256sum:e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855. This the sha256sum of an empty string. This is due response.get_data() returning an empty object.

Steps to Reproduce

Upload a file or send a message to the source interface
Set up admin account and use the Journalist API to retrieve files:

a. Ensure you aren't using a staging environment or remove Header unset etag directive from /etc/apache2/sites-available/journalist.conf and restart Apache2
b. curl -I <download_url_of_file> and retain the value of header Etag: sha256sum:<SHA256sum goes here>
c. curl -O <download_url_of_file> and sha256sum download. Observe the hash is different from the previous step
d. echo -ne "" | sha256sum and observe the hash is identical to the etags above

Expected Behavior

Etag value should be sha256 sum of file

Actual Behavior

Etag value is sha256sum of an empty string

Comments

It makes sense that the response is empty because the file is sent as attachment: https://github.com/freedomofpress/securedrop/blob/develop/securedrop/journalist_app/utils.py#L333

Since the hash is computed every time a file is downloaded, it might use significant amount of resources server-side if there are large files that are downloaded at the time time. We should consider hashing the files at creation time, and storing the hash values in the database. This will also allow us to verify file integrity (e.g. when restoring backups)

The text was updated successfully, but these errors were encountered:

heartsucker · 2019-03-27T18:15:08Z

Partial work is done here: https://github.com/freedomofpress/securedrop/tree/sha256-etag

emkll mentioned this issue Jan 15, 2019

Allow DELETE HTTP method for journalist interface #4023

Merged

7 tasks

eloquence added bug api labels Jan 15, 2019

eloquence added this to the Long Term Product Backlog milestone Jan 16, 2019

eloquence removed this from the Long Term Product Backlog milestone Mar 21, 2019

redshiftzero assigned heartsucker Mar 28, 2019

heartsucker mentioned this issue Apr 2, 2019

calculated etag as sha256 of file on disk #4314

Merged

2 tasks

redshiftzero closed this as completed in #4314 Apr 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Etag header set by Journalist API is not sha256sum of file #4032

Etag header set by Journalist API is not sha256sum of file #4032

emkll commented Jan 14, 2019 •

edited

Loading

heartsucker commented Mar 27, 2019

Etag header set by Journalist API is not sha256sum of file #4032

Etag header set by Journalist API is not sha256sum of file #4032

Comments

emkll commented Jan 14, 2019 • edited Loading

Description

Steps to Reproduce

Expected Behavior

Actual Behavior

Comments

heartsucker commented Mar 27, 2019

emkll commented Jan 14, 2019 •

edited

Loading