Add support for HTTP meta-equiv headers. #264

april · 2017-09-05T14:23:55Z

No description provided.

chuckharmston

Some suggestions, but generally looks good.

Without knowledge of the project, I suspect that the test coverage here is inadequate, and that coverage would fall substantially as a result of this PR. Might be worth further looking into.

chuckharmston · 2017-09-05T14:33:55Z

httpobs/scanner/analyzer/headers.py

+    """
+
+    # Clean out all the junk
+    csp_string = csp_string.replace('\r', '').replace('\n', '').strip()


This is probably fine, but personally I'd treat it like an array and use csp_string.splitlines(); it mirrors the nature of the data better.

It actually shouldn't be split across lines. Carriage returns should be ignored, as it it's ideally all a single line. I've seen a lot of junk from various webservers, so this is my attempt to work aroundi t.

chuckharmston · 2017-09-05T14:37:17Z

httpobs/scanner/analyzer/headers.py

+    # So technically the shortest directive is img-src, so lets just assume that
+    # anything super short is invalid
+    if len(csp_string) < 6 or csp_string.isspace():
+        raise ValueError


I don't like the use of a magic number here. I'd suggest defining it as a constant, and comparing against

SHORTEST_DIRECTIVE = 'img-src' SHORTEST_DIRECTIVE_LENGTH = len(SHORTEST_DIRECTIVE)

Then in addition to having more obvious and legible code, you can import these and use them as comparators for tests.

Fixed, thank you.

chuckharmston · 2017-09-05T14:40:35Z

httpobs/scanner/analyzer/headers.py

+
+        # Technically the path part of any source is case-sensitive, but since we don't test
+        # any paths, we can cheat a little bit here
+        values = set([_.lower() for _ in entry[-1].split()]) if len(entry) > 1 else {'\'none\''}


A nit, but maybe give _ a meaningful name? It makes the line awkwardly long, but for production Python code I try to avoid single-char variable names, even in comprehensions, as it has the potential to conflict with pdb.set_trace() commands.

Changed to source, as defined in the CSP specification. Thanks!

chuckharmston · 2017-09-05T14:48:27Z

httpobs/tests/unittests/files/test_parse_http_equiv_headers_case_insensitivity.html

+<body>
+
+</body>
+</html>


Missing the trailing line endings on all these files.

chuckharmston · 2017-09-05T14:49:56Z

httpobs/scanner/analyzer/headers.py

+           for source in script_src) and '\'unsafe-inline\'' in script_src:
+        script_src.remove('\'unsafe-inline\'')
+
+    # Now to make the piggies squeal


chuckharmston · 2017-09-05T14:51:38Z

httpobs/scanner/analyzer/headers.py

+        # While technically valid in that you just use the first entry, we are saying that repeated
+        # directives are invalid so that people notice it
+        if directive in csp:
+            raise ValueError


Since you're raising ValueErrors for multiple reasons, I'd suggest doing one of two things:

Raising instances of ValueError with a message, e.g. raise ValueError('Repeated directives are invalid')

More complicated but probably more correct: create a series of custom error classes for each possible exception, so in the future you can explicitly except specific exception types.

Class HTTPObservatoryException(Exception): Pass Class CSPParsingException(HTTPObservatoryException): Pass Class DirectiveLengthSception(CSPParsingException): Pass Class RepeatedDirectiveException(CSPParsingException): Pass

(I'm on an iPad so first lines are capitalizing, but you get the idea.)

Fixed, with the first option. I may eventually do the later, but the exceptions are all handled so they don't actually percolate upward in a useful manner, only the failure is.

chuckharmston · 2017-09-05T14:54:30Z

httpobs/scanner/analyzer/headers.py

+            'equiv': __parse_csp(response.http_equiv.get('Content-Security-Policy'))
+            if 'Content-Security-Policy' in response.http_equiv else None,
+        }
+    except:


Naked excepts are really bad practice. I know it's catching some expected exceptions here, but it could potentially mask other errors that are unexpected. You should whitelist a set of expected exception types, e.g. except (ValueError) as e:

My problem is I'm not entirely sure what it might catch here, since the data coming from the client can be so weird. It should be just ValueError looking at my code, but I'm not sure if there are some weird subsets of Unicode or null characters or other weirdness that might trigger an exception. I really just want to catch any weird/broken header and mark it as invalid.

chuckharmston · 2017-09-05T14:57:08Z

httpobs/scanner/analyzer/headers.py

-    else:
-        output['result'] = 'csp-not-implemented'
+    # Code defensively on the size of the data
+    output['data'] = csp if len(str(csp)) < 32768 else {}


Would it make more sense to trim the string rather than remove it if it's long? Either csp[:32768] or textwrap.shorten (in 3.4+).

I specifically just drop it because I don't want a truncated string to mess up future database queries, when I'm looking for feature usage.

chuckharmston · 2017-09-05T14:57:27Z

httpobs/scanner/analyzer/headers.py

@@ -152,6 +206,8 @@ def cookies(reqs: dict, expectation='cookies-secure-with-httponly-sessions') ->
                'cookies-session-without-httponly-flag',
                'cookies-session-without-secure-flag']

+    # TODO: Support cookies set over http-equiv (ugh)


File a bug for this and add the URL to it here?

Done, see #265.

Add support for HTTP meta-equiv headers.

f0037be

chuckharmston approved these changes Sep 5, 2017

View reviewed changes

april added 4 commits September 6, 2017 11:03

Add trailing carriage returns on HTML test files

351ac8b

Fix issues identified by @chuckharmston during review

df0a824

Store whether a header or meta (or both) was used

332335f

Change header to http, for consistency

36a6c9c

april merged commit 5d227eb into mozilla:master Sep 6, 2017

This was referenced Sep 6, 2017

CSP and Referrer-Policy meta tag not recognized #105

Closed

Parse meta for CSP #209

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for HTTP meta-equiv headers. #264

Add support for HTTP meta-equiv headers. #264

april commented Sep 5, 2017

chuckharmston left a comment

chuckharmston Sep 5, 2017

april Sep 6, 2017

chuckharmston Sep 5, 2017

april Sep 6, 2017

chuckharmston Sep 5, 2017

april Sep 6, 2017

chuckharmston Sep 5, 2017

chuckharmston Sep 5, 2017

chuckharmston Sep 5, 2017

april Sep 6, 2017

chuckharmston Sep 5, 2017

april Sep 6, 2017

chuckharmston Sep 5, 2017

april Sep 6, 2017

chuckharmston Sep 5, 2017

april Sep 6, 2017

+              <body>
+              </body>
+              </html>

Add support for HTTP meta-equiv headers. #264

Add support for HTTP meta-equiv headers. #264

Conversation

april commented Sep 5, 2017

chuckharmston left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment