fix(security): no warning when sanitizing escaped html (#9392) #9413

wkwiatek · 2016-06-21T20:35:15Z

Please check if the PR fulfills these requirements

The commit message follows our guidelines: https://github.com/angular/angular/blob/master/CONTRIBUTING.md#commit-message-format
Tests for the changes have been added (for bug fixes / features)
Docs have been added / updated (for bug fixes / features)

What kind of change does this PR introduce? (check one with "x")

[x] Bugfix
[ ] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Build related changes
[ ] CI related changes
[ ] Other... Please describe:

What is the current behavior? (You can also link to an open issue here)
#9392

What is the new behavior?
No warning when properly escaped html is passed to sanitize

Does this PR introduce a breaking change? (check one with "x")

[ ] Yes
[x] No

If this PR contains a breaking change, please describe the impact and migration path for existing applications: ...

Other information:

mprobst · 2016-06-21T21:03:50Z

Thanks for the PR!

This looks conceptually fine (we want to compare to the input, not to the intermediate parsed state), and the test looks good, but I don't understand why it works.

Shouldn't safeHtml, once we parsed and re-serialized it, also no longer contain the escaped entity, but rather the unicode code point directly? And if so, how or why is it different from the treatment of unsafeHtml (before your change), which was also just parsed and re-serialized?

Any idea what's going on there?

wkwiatek · 2016-06-21T22:05:55Z

The problem is produced by these lines:
https://github.com/angular/angular/pull/9413/files#diff-475057312e0ba8afc2f24a4df80d6eaaL249 https://github.com/angular/angular/pull/9413/files#diff-475057312e0ba8afc2f24a4df80d6eaaL243

First modifies the input so that it's no longer the initial value in some cases (as you can see unsafeHtml becomes parsedHtml). DOM.setInnerHTML(containerEl, unsafeHtml); leaves the treatment to the browser so in the end the value may be a little bit different than the one we passed in to the function.

mprobst · 2016-06-21T22:30:58Z

Assume this code, from a DevTools console session:

let d = document.createElement('div');
// <div></div>
let unsafeHtml = 'hello &#x1f680;'; // the input to sanitize
d.innerHTML = unsafeHtml; // parse it
let safeHtml = d.innerHTML; // serialize it back
// "hello 🚀" -- safeHtml now contains the actual rocket character, unescaped.
safeHtml === unsafeHtml;
// false

Turns out our code explicitly encodes entities in safeHtml, in particular surrogate pairs. That means with this change, it'll work if entities in unsafeHtml are encoded originally, but then will fail if they were not because it encodes all entities (that's what I forgot, which was confusing me here).

E.g. if you add a test case for sanitizeHtml('hellö') (with the actual Unicode character in there), this will break as the result will be 'hellö', won't it? Fundamentally, this code is not aware of what is encoded and what is not in the input at the point of the comparison, so it seems to me that there is no real winning here, is there?

wkwiatek · 2016-06-21T22:47:17Z

The code you've just pasted is actually fine and reflects the situation except of naming. Look what safeHtml is in the source: https://github.com/angular/angular/pull/9413/files#diff-475057312e0ba8afc2f24a4df80d6eaaL253.

safeHtml at the end in condition is simply not related in any way with this whole stuff we're talking about. I refer to do {} while () block which I guess is only for mXSS protection. And was accidentally overwriting input of the function.

mprobst · 2016-06-21T22:52:15Z

Could you add this test case and see what happens?

    t.it('supports sanitizing escaped entities', () => {
      t.expect(sanitizeHtml('hellö')).toEqual('hellö');
      t.expect(logMsgs).toEqual([]);
    });

wkwiatek · 2016-06-21T23:10:24Z

No problem.

Here's the output:

Expected 'hell&#246;' to equal 'hellö'.

Expected [ 'WARNING: sanitizing HTML stripped some content.' ] to equal [  ].

Both I've expected before starting tests.

mprobst · 2016-06-21T23:26:25Z

So, do you think this change is worth it, given that we fundamentally cannot fix the problem?

wkwiatek · 2016-06-22T06:29:47Z

I think that fundamentally it works pretty well. Look, sanitizeHtml('hellö') gives you the output: hellöwhich is fine (because really sanitizes the output) and also gives you warning that something was stripped. Then try to add a test like this:

t.it('supports sanitizing escaped entities', () => {
  t.expect(sanitizeHtml('hell&#246;')).toEqual('hell&#246;');
  t.expect(logMsgs).toEqual([]);
});

Now input and output are exactly the same. Both tests in this PR will succeed. However in the master second will fail because of warning message that should not be logged in this case.

I think it's still worth to add because in current version the information is just misleading.

Make sense?

mprobst · 2016-06-22T21:46:00Z

modules/@angular/platform-browser/src/security/html_sanitizer.ts

@@ -223,11 +223,11 @@ function stripCustomNsAttrs(el: any) {
 * Sanitizes the given unsafe, untrusted HTML fragment, and returns HTML text that is safe to add to
 * the DOM in a browser environment.
 */
-export function sanitizeHtml(unsafeHtml: string): string {
+export function sanitizeHtml(entryHtml: string): string {


nit: could you rename this to unsafeHtmlInput? It's kind of important here to keep track of what's safe and what isn't.

mprobst · 2016-06-22T21:47:08Z

We'll still warn people about effectively harmless changes (input changes that are not actually changing anything), but I can see your reasoning. Also, the change is harmless enough.

Could you fix the parameter name? Otherwise good to go.

mprobst · 2016-06-23T20:06:32Z

Merged. Thanks for the contribution @wkwiatek!

wkwiatek · 2016-06-23T20:36:54Z

Thanks. You're welcome!

angular-automatic-lock-bot · 2019-09-08T20:21:30Z

This issue has been automatically locked due to inactivity.
Please file a new issue if you are encountering a similar or related problem.

Read more about our automatic conversation locking policy.

_{This action has been performed automatically by a bot.}

googlebot added the cla: yes label Jun 21, 2016

vicb added flag: can be closed? area: security Issues related to built-in security features, such as HTML sanitation labels Jun 22, 2016

vicb assigned mprobst Jun 22, 2016

mprobst reviewed Jun 22, 2016
View reviewed changes

fix(security): no warning when sanitizing escaped html (angular#9392)

2d30aba

wkwiatek force-pushed the issue9392 branch from a48f9d8 to 2d30aba Compare June 23, 2016 07:31

mprobst merged commit 98cef76 into angular:master Jun 23, 2016

wkwiatek deleted the issue9392 branch June 23, 2016 20:36

angular-automatic-lock-bot bot locked and limited conversation to collaborators Sep 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(security): no warning when sanitizing escaped html (#9392) #9413

fix(security): no warning when sanitizing escaped html (#9392) #9413

wkwiatek commented Jun 21, 2016 •

edited

mprobst commented Jun 21, 2016

wkwiatek commented Jun 21, 2016 •

edited

mprobst commented Jun 21, 2016 •

edited

wkwiatek commented Jun 21, 2016

mprobst commented Jun 21, 2016

wkwiatek commented Jun 21, 2016

mprobst commented Jun 21, 2016

wkwiatek commented Jun 22, 2016 •

edited

mprobst Jun 22, 2016

wkwiatek Jun 23, 2016

mprobst commented Jun 22, 2016

mprobst commented Jun 23, 2016

wkwiatek commented Jun 23, 2016

angular-automatic-lock-bot bot commented Sep 8, 2019

fix(security): no warning when sanitizing escaped html (#9392) #9413

fix(security): no warning when sanitizing escaped html (#9392) #9413

Conversation

wkwiatek commented Jun 21, 2016 • edited

mprobst commented Jun 21, 2016

wkwiatek commented Jun 21, 2016 • edited

mprobst commented Jun 21, 2016 • edited

wkwiatek commented Jun 21, 2016

mprobst commented Jun 21, 2016

wkwiatek commented Jun 21, 2016

mprobst commented Jun 21, 2016

wkwiatek commented Jun 22, 2016 • edited

mprobst Jun 22, 2016

Choose a reason for hiding this comment

wkwiatek Jun 23, 2016

Choose a reason for hiding this comment

mprobst commented Jun 22, 2016

mprobst commented Jun 23, 2016

wkwiatek commented Jun 23, 2016

angular-automatic-lock-bot bot commented Sep 8, 2019

wkwiatek commented Jun 21, 2016 •

edited

wkwiatek commented Jun 21, 2016 •

edited

mprobst commented Jun 21, 2016 •

edited

wkwiatek commented Jun 22, 2016 •

edited