HTML, CSS, and JSON modules shouldn't solely rely on MIME type to change parsing behavior #839

rniwa · 2019-09-18T04:51:35Z

As we discussed in TPAC 2019 Web Components session, the current proposal / spec of HTML, CSS, and JSON modules do not specify the type of content in the import statement.

This is problematic because an import statement that intended to load CSS or JSON and not execute arbitrary scripts could end up executing scripts if the destination server's MIME type got changed or the destination server get compromised.

In general, we've made so that the importer of any content can specify how the imported content should be parsed & processed. This is one of the motivations for adding CORS fetch for JSON as opposed to JSONP for example.

rniwa · 2019-09-18T04:51:56Z

@annevk @dandclark @domenic

Jamesernator · 2019-09-18T05:41:42Z

I think there's two separate parts to this:

Treating some response as another/specific format
Preventing evaluation of unexpected formats

I personally think the second one should be part of Content-Security-Policy rather than changing anything about the current module loading e.g. Content-Security-Policy: modules self *, https://some.config.tld json, https://fonts.somewhere.tld css.

For the first one maybe we could extend the import: protocol to have import+css:/import+html/etc to force it into a specific format.

trotyl · 2019-09-18T11:05:46Z

This is problematic because an import statement that intended to load CSS or JSON and not execute arbitrary scripts could end up executing scripts if the destination server's MIME type got changed or the destination server get compromised.

I believe this functionality is required in polyfilling, to support browsers not yet supports HTML/JSON/CSS modules, a server can just respond the corresponding JavaScript file that wrapping original content to default export, which could provide a consistent code style at consumer-side.

rniwa · 2019-09-18T13:47:56Z

This is problematic because an import statement that intended to load CSS or JSON and not execute arbitrary scripts could end up executing scripts if the destination server's MIME type got changed or the destination server get compromised.

I believe this functionality is required in polyfilling, to support browsers not yet supports HTML/JSON/CSS modules, a server can just respond the corresponding JavaScript file that wrapping original content to default export, which could provide a consistent code style at consumer-side.

Weakening the security model for the sake of polyfilling is an unacceptable trade off in our view.

justinfagnani · 2019-09-18T16:04:18Z

Imports are inherently dangerous, and this is why CSP allows for restricting the origin for scripts. That should apply to all imports, regardless of type.

Importing JSON should only be done for trusted sources, and shouldn't in general be done for calling third party APIs (and arguably shouldn't be done for 1st party APIs if you don't want your module graph to fail to load if the API call fails). fetch() is much more appropriate for that.

Regarding server-side polyfills, this should still work if the client sends the appropriate Accept header.

rniwa · 2019-09-18T22:33:14Z

Imports as inherently dangerous, and this is why CSP allows for restricting the origin for scripts. That should apply to all imports, regardless of type.

Given that many websites don't use CSP correctly, relying on websites to correctly deploy CSP to get the right security behavior is not a great plan.

Furthermore, today, if you were to fetch JSON via XHR or fetch API and parse it via JSON.parse, there is no chance of the fetched JSON suddenly executing as scripts. This new module loading mechanism, therefore, is a functional regression from existing loading mechanisms.

We believe this security issue is a show stopper issue for HTML, CSS, and JSON modules.

annevk · 2019-09-19T00:22:42Z

Yeah, Mozilla does as well. Importing non-scripts should be safe by default. (If HTML modules end up executing script they might not necessarily be problematic.)

matthewp · 2019-09-19T01:29:39Z

What is the counter proposal?

dandclark · 2019-09-19T01:31:36Z

I think @justinfagnani's point here is interesting and I want to second it. Even aside from security concerns, JSON imports seem like the wrong tool for consuming non-first-party JSON/CSS. If the request 404's or if, say, the JSON has a parse error, the entire module graph will fail to instantiate/execute. fetch() is the right tool in this scenario, with import being reserved for content that is directly controlled by the importer. Otherwise all your module scripts could fail to run because of a missing { in third-party JSON...

annevk · 2019-09-19T04:22:47Z

That doesn't apply to import() afaik.

Jamesernator · 2019-09-20T01:23:06Z

If the request 404's or if, say, the JSON has a parse error, the entire module graph will fail to instantiate/execute.

This might actually be what you want in some cases, e.g. if a critical resource does fail to load then you don't want to waste resources evaluating a bunch of modules you don't need. You can always use import() to catch errors in that particularly and do some recovery code or retry.

rniwa · 2019-09-20T01:38:08Z

I think @justinfagnani's point here is interesting and I want to second it. Even aside from security concerns, JSON imports seem like the wrong tool for consuming non-first-party JSON/CSS.

Regardless of whether it's a good idea or not, some web developers are inevitably going to do it. We shouldn't be adding a new foot gun to the Web so that authors can avoid using it in certain situations to avoid creating a new security surface.

caridy · 2019-09-20T01:51:27Z

I can only second @rniwa here, @justinfagnani's point included a lot of "should/shouldn't", and we all know how that goes. Importing non-scripts must be safe by default.

littledan · 2019-09-20T17:03:54Z

Is it OK for WebAssembly modules to have parsing behavior based on their MIME type? That's what the current proposal does.

joeldenning · 2019-09-20T22:04:34Z

HTML, CSS, and JSON modules do not specify the type of content in the import statement.

Are there any proposals / ideas that could solve these issues?

At the risk of suggesting something naive, could the file extension be the way that the import statement specifies its expected module type?

 // both the file extension and mime type must be present for the non-script module to load
import 'file.json'
import 'file.css'
import 'file.html

The obvious limitation of this approach is that you can't load json, css, and html modules from urls without the proper file extension. The advantage is that it doesn't require a rework of import statements and import maps to make html, css, and json modules work.

annevk · 2019-09-21T00:06:42Z

Apart from plugins there's no precedent for putting meaning in file extensions within the web, I don't think we should start here.

rniwa · 2019-09-21T06:10:54Z

Is it OK for WebAssembly modules to have parsing behavior based on their MIME type? That's what the current proposal does.

That’s not ideal but less problematic because the expectation of loading WASM & JS are similar: they execute arbitrary scripts. It’s not so with CSS & JSON.

As explained at WICG/webcomponents#839 the current setup is insecure. This reverts db03474.

littledan · 2019-09-28T10:39:14Z

If we had syntax in JavaScript for asserting the MIME type (mandatory for JSON modules, and optional for JavaScript), would that address this concern? If so, we can look into this issue in TC39; I don't think we have considered it before. As a strawperson, it could look like this:

import document from "./foo.json" with mime: "application/json";

This could be a basis for adding sending metadata to the host environment (HTML, Node.js, etc). Thoughts? Did people have other specific ideas for what this should look like?

annevk · 2019-09-28T15:41:52Z

Nothing concrete was discussed, but yeah, something like that would address the concern. And equivalent for import(). Instead of the MIME type it might be nicer to assert it being "json" or "css" or some such.

ljharb · 2019-09-28T15:44:32Z

The author is the only true arbiter of the parse goal of content; would this be an assertion, or would it change the parsing like script type does, for example?

littledan · 2019-09-28T15:50:52Z

@ljharb I imagine we would continue to use MIME type in conjunction with the declaration within JS. That's why I used the word "assertion".

devsnek · 2019-09-29T05:40:49Z

how about the CSP idea but the default is disallow instead of allow? I think the layering of extending import syntax is a bit iffy, since it adds host-specific semantics to an otherwise generic bit of code, and then random hosts have to be like "wait what do i do with this"

annevk · 2019-09-29T08:43:01Z

It doesn't really seem host-specific to be able to import JSON without that resulting in script execution later on.

devsnek · 2019-09-29T16:41:10Z

as an example, node requires you to write .json in the import specifier. we would have no use for such a syntactic extension. jumping a bit further, if someone writes import x as 'application/json', but a host doesn't use mine types, what do they do with that?

littledan · 2019-09-29T17:06:45Z

Right, somehow, this is syntactically redundant in environments where the interpretation is implied by the module specifier's suffix already. The web doesn't have a tradition of making such judgements.

I suppose the analogous (and unprecedented?) thing here would be something like requiring that, if the MIME type is application/json, then the module specifier must end in .json, and prohibiting JS modules with this suffix. But such a scheme faces web compatibility issues with growing over time.

If we require this syntax for JSON modules on the web, I think there is some chance that a common authoring format will omit the assertions, and tools will insert it when generating web output as part of a build process. But there is also a chance that we can convince most people to write this directly.

devsnek · 2019-09-29T17:13:03Z

sorry, to be more direct with what i'm trying to say: given security is host specific, shouldn't this assertion be out-of-band from the import? at worst you would have hosts ignoring a check people expected would be enforced.

annevk · 2019-09-30T06:56:07Z

Inferring meaning from a file extension is incompatible with the web's architecture. Requiring out-of-band annotations seems like it would create such bad ergonomics the feature would effectively not be used on the web.

tilgovi · 2019-11-19T20:10:44Z

Is there a solution discussed anywhere already that can take inspiration from import maps to provide loaders declared out of band?

justinfagnani · 2019-11-19T21:50:58Z

@tilgovi I think OOB has significant DX and usability downsides. See my comment here: tc39/proposal-import-attributes#13 (comment)

justinfagnani · 2019-12-06T18:46:46Z

Is it OK for WebAssembly modules to have parsing behavior based on their MIME type? That's what the current proposal does.

That’s not ideal but less problematic because the expectation of loading WASM & JS are similar: they execute arbitrary scripts. It’s not so with CSS & JSON.

In think about this a bit today... WASM doesn't have access to the DOM, correct? So an author could assume a WASM module has restricted access if it's not explicitly passed functions to allow it DOM access. If a file previously served as application/wasm was later served with application/javascript, would this present a similar security concern?

littledan · 2019-12-07T00:50:25Z

The not-yet-implemented-in-any-browser Wasm/ESM integration proposal gives Wasm the same level of privilege as JavaScript by design. This proposal allows importing arbitrary JS modules (including cross-origin), which could export functions that manipulate the DOM but have signatures which are just based on numerics, so it would be importable and usable from Wasm. The goal is to allow transparent interaction.

Ciantic · 2020-06-14T22:34:34Z

If this is about validating the input given by import, then could the import be augmented with function?

import someJson from "./some.json" with JSON.Parse;
import someTextFile from "./some.txt" with String;

// Going further, one could define own parser:
function MyParser(content, url, mimetype) {
  let sheet = new CSSStyleSheet();
  sheet.replaceSync(content);
  return sheet;
}
import someCssStylesheet from "./someother.css" with MyParser;

Having the parser as a plain function one can define, would allow to introduce some other logic easily in future. It's not uncommon to have different ways to parse e.g. *.css imports.

It would even allow returning a module like result, because you could return object from your parser function e.g.

function AnotherParser(content, url, mimetype) {
  // ...
  return { justThis: "Hello world", orThis: 5 };
}
import { justThis, orThis } from "./interesting-url" with AnotherParser;

It's of course debatable what argument should the parser get? Some object with more details including mime type, a string, and possibly the url?

dandclark · 2020-09-29T00:05:13Z

With the advancement of import assertions to Stage 3 in TC39, can we consider this issue resolved?

jerrygreen · 2020-09-29T01:24:26Z

@dandclark I personally still don't see any info of how that would be resolved when there's no assertion. Will this:

import json from "./foo.json"

still use MIME-type?

I opened this question in the proposal, calling it default assertions: tc39/proposal-import-attributes#101

justinfagnani · 2020-09-29T01:31:45Z

@jerrygreen import json from "./foo.json" will indeed use the mime-type of the response and fail if it's not a JavaScript type. Edit: in browsers.

annevk · 2020-09-29T09:27:31Z

Yeah, let's close this. In response to this issue:

We removed JSON modules from HTML for the time being: Revert JSON modules whatwg/html#4943.
TC39 worked out import assertions, currently stage 3.
Integration with import-attributes whatwg/html#5640 discusses integration of that with HTML and links PRs for that as well as JSON modules.

tclzcja · 2020-09-29T10:16:27Z

Sorry for commenting on a closed issue, but does that mean HTML/JSON/CSS Modules can only land after import assertion gets to Stage 4?

In other words, is Import Assertion the blocker on HTML/JSON/CSS Modules?

annevk · 2020-09-29T10:17:48Z

Stage 3 is fine for integration with the HTML Standard.

dandclark mentioned this issue Sep 26, 2019

Introduce CSS Module Script whatwg/html#4898

Merged

3 tasks

annevk added a commit to whatwg/html that referenced this issue Sep 28, 2019

Revert JSON modules

d218a6e

As explained at WICG/webcomponents#839 the current setup is insecure. This reverts db03474.

annevk mentioned this issue Sep 28, 2019

Revert JSON modules whatwg/html#4943

Merged

2 tasks

littledan mentioned this issue Nov 19, 2019

Inline vs out of line module attributes tc39/proposal-import-attributes#13

Closed

Janpot mentioned this issue Nov 25, 2019

Permissionless static relative imports are dangerous denoland/deno#3401

Closed

littledan mentioned this issue Dec 4, 2019

WebAssembly integration with ECMAScript modules w3ctag/design-reviews#377

Closed

2 tasks

dandclark mentioned this issue Dec 4, 2019

CSS Modules w3ctag/design-reviews#405

Closed

4 tasks

This was referenced Jan 11, 2020

Import maps mozilla/standards-positions#146

Closed

createScript / fetch / shouldFetch hooks systemjs/systemjs#2058

Merged

tantek mentioned this issue Feb 24, 2020

Review different cross-domain import mechanisms and their security models w3ctag/design-principles#157

Closed

GeoffreyBooth mentioned this issue Apr 1, 2020

type: module should also enable experimental-json-modules nodejs/node#32314

Closed

dandclark mentioned this issue Aug 5, 2020

import assertions RFC nodejs/modules#427

Closed

JeremiePat mentioned this issue Sep 18, 2020

Implicite assert type tc39/proposal-json-modules#4

Closed

jerrygreen mentioned this issue Sep 29, 2020

Default assertion based on extension tc39/proposal-import-attributes#101

Closed

annevk closed this as completed Sep 29, 2020

littledan mentioned this issue Jan 19, 2021

Note interaction with import assertions WebAssembly/esm-integration#45

Merged

jugglinmike mentioned this issue May 21, 2021

Rationale for optional exception tc39/proposal-json-modules#16

Open

dandclark mentioned this issue Jul 27, 2021

Reland JSON module scripts whatwg/html#5658

Merged

3 tasks

adajuly mentioned this issue Dec 19, 2021

ES新特性之import断言 adajuly/blog#3

Open

azu mentioned this issue Apr 6, 2022

2022-04-05のJS: React 18、Vite 2.9.0、Firefoxの新しいパフォーマンスツール jser/jser.github.io#974

Merged

nayeemrmn mentioned this issue Apr 23, 2023

Inferring module type from MIME type vs type attribute needs clarification tc39/proposal-import-attributes#140

Closed

colinrotherham mentioned this issue Jun 29, 2023

Default to ES modules with single Rollup config alphagov/govuk-frontend#3726

Merged

HTML, CSS, and JSON modules shouldn't solely rely on MIME type to change parsing behavior #839

HTML, CSS, and JSON modules shouldn't solely rely on MIME type to change parsing behavior #839

Comments

rniwa commented Sep 18, 2019

rniwa commented Sep 18, 2019

Jamesernator commented Sep 18, 2019

trotyl commented Sep 18, 2019 • edited Loading

rniwa commented Sep 18, 2019

justinfagnani commented Sep 18, 2019 • edited Loading

rniwa commented Sep 18, 2019

annevk commented Sep 19, 2019

matthewp commented Sep 19, 2019

dandclark commented Sep 19, 2019

annevk commented Sep 19, 2019

Jamesernator commented Sep 20, 2019

rniwa commented Sep 20, 2019

caridy commented Sep 20, 2019

littledan commented Sep 20, 2019

joeldenning commented Sep 20, 2019

annevk commented Sep 21, 2019

rniwa commented Sep 21, 2019

littledan commented Sep 28, 2019 • edited Loading

annevk commented Sep 28, 2019

ljharb commented Sep 28, 2019

littledan commented Sep 28, 2019

devsnek commented Sep 29, 2019 • edited Loading

annevk commented Sep 29, 2019

devsnek commented Sep 29, 2019

littledan commented Sep 29, 2019 • edited Loading

devsnek commented Sep 29, 2019

annevk commented Sep 30, 2019

tilgovi commented Nov 19, 2019

justinfagnani commented Nov 19, 2019

justinfagnani commented Dec 6, 2019

littledan commented Dec 7, 2019

Ciantic commented Jun 14, 2020 • edited Loading

dandclark commented Sep 29, 2020

jerrygreen commented Sep 29, 2020 • edited Loading

justinfagnani commented Sep 29, 2020 • edited Loading

annevk commented Sep 29, 2020

tclzcja commented Sep 29, 2020 • edited Loading

annevk commented Sep 29, 2020

trotyl commented Sep 18, 2019 •

edited

Loading

justinfagnani commented Sep 18, 2019 •

edited

Loading

littledan commented Sep 28, 2019 •

edited

Loading

devsnek commented Sep 29, 2019 •

edited

Loading

littledan commented Sep 29, 2019 •

edited

Loading

Ciantic commented Jun 14, 2020 •

edited

Loading

jerrygreen commented Sep 29, 2020 •

edited

Loading

justinfagnani commented Sep 29, 2020 •

edited

Loading

tclzcja commented Sep 29, 2020 •

edited

Loading