Define identity of a web app. #272

marcoscaceres · 2014-11-10T19:15:35Z

what identifies an app? An origin?
how does one update the app?
what happens if the scope changes?

marcoscaceres · 2014-11-10T20:45:48Z

//cc @sicking

@mounirlamouri do you have any thoughts on the above?

benfrancis · 2014-12-17T20:20:26Z

The obvious answer is the manifest URL. Are there any other suggestions?

sicking · 2014-12-17T20:33:30Z

Given the lack of other ideas. I think we should simply go with the manifest URL yes.

marcoscaceres · 2014-12-18T01:03:59Z

I wouldn't say there is a "lack of ideas" - we just haven't gotten around to this yet.

sicking · 2014-12-18T07:39:34Z

I think this needs to be a very high priority as it's likely to affect a lot of other features. For example the ServiceWorker registration API might affect what we do here.

marcoscaceres · 2014-12-20T06:11:14Z

Prioritized. Will work on this next.

opoto · 2014-12-30T20:13:21Z

Shouldn't the spec state that the identifier is the manifest's canonicalized URL, so that:
HTTP://my.domain.com:80/app/manifest.json
and
http://my.domain.com/app/images/../manifest.json
are considered as same app identifier?

sicking · 2014-12-30T23:19:05Z

Yeah. That sounds right. Is there a difference between canonical URL and resolved URL? I.e. do you need to do anything extra to get the canonical URL after you resolve a URL against its base-URL?

marcoscaceres · 2014-12-31T01:21:55Z

To clarify, URLs are not their serialized string representations - they are objects. Once a string is parsed to a URL, paths are normalized. Hence, there is no such thing as canonical URLs or resolved URLs. There are just URLs.

The spec always treats URLs as being in their object form (being parsed from string input) - so adding clarifications like the above wouldn't really help too much.

opoto · 2014-12-31T13:47:32Z

This makes sense.
Thanks for the clarification. Maybe it should be in the spec? Or maybe this is obvious to anyone else but me... BTW, I checked the "URL" dfn link, but it leads to https://url.spec.whatwg.org/#url-parsing, whereas I guess it should be https://url.spec.whatwg.org/#concept-url.

marcoscaceres · 2015-01-01T04:24:27Z

@opoto thanks! fixed the busted link. There is some ongoing work to fix cross-document references in the tool I'm using to generate the spec (Respec). That should allow for more seamless jumping between concepts and their definitions. I'm a bit hesitant to add (non-normative) clarifications for concepts defined in other specs, as every time I've done that it's not ended well: either the other document changes and there is a slight mismatch in definition (leading to confusion) - or the Editor of the other spec gets upset because I'm redefining their stuff.

* 'identity' of github.com:w3c/manifest: Define identity of a web app (closes #272) Conflicts: index.html

# The first commit's message is: Define identity of a web app (closes #272) # The 2nd commit message will be skipped: # Fixup # The 3rd commit message will be skipped: # Fixup

* 'identity' of github.com:w3c/manifest: # This is a combination of 3 commits. # The first commit's message is: Define identity of a web app (closes #272)

Define identity of a web app (closes #272)

marcoscaceres · 2015-01-13T06:39:58Z

Elsewhere, I proposed that identity be handled by the OS.

jmajnert · 2015-01-13T07:55:12Z

@marcosc - In a2e8c31#commitcomment-9254539 you wrote:

With this current proposal, the identity of the app will change if the start URL changes (e.g. from index.php to index.html). If an already installed app had its start URL changed in the manifest, it would stop getting updates and be orphaned on the device.

It's identity would simply be updated to reflect the new start URL, and the manifest URL would remain the same. The application is updated from the manifest and hence, so long as the manifest can be accessed, an update can take place.

I'm afraid that I don't fully understand your notion of identity. Why do we even need to have application identity in the manifest spec? At one point the manifest was regarded as just another resource with additional info about the app.
Application identity is hard to do right even in closed ecosystems like Android or iOS (author certs, setting up dev accounts, hosting apps in app-stores etc). Aren't we overreaching a bit when trying to define app identity for the whole web?

jmajnert · 2015-04-13T08:42:53Z

I agree with @marcosc - "authoring requirements" and "best practices" have no meaning for implementations and thus will be ignored in real world.

benfrancis · 2015-04-13T11:20:14Z

OK. How about if the "name" field was compulsory, and the start_url was resolved against the manifest URL (can use absolute URL if needs to be cross-origin)? These are things I'd quite like to see anyway and would make sharing a manifest between apps quite impractical.

jmajnert · 2015-04-13T11:56:02Z

If we're starting to think of manifests as standalone resources, we need to make start_url obligatory (and maybe not "purely advisory" as well).

benfrancis · 2015-04-13T13:26:22Z

Yes, or make start_url have a default of "/" or the directory of the manifest.

jmajnert · 2015-04-13T13:34:30Z

Yes. Exactly. It has to point somewhere.

marcoscaceres · 2015-04-29T18:59:16Z

For the record....

Different understandings of the role of the web manifest

This document discusses pros and cons that can arise with the "additive" approach, defined in detail below, being taken in the current standardization of the web manifest. This document proposes an alternative approach that treats the manifest as "authoritative metadata" about a web application. What we mean by authoritative is also described in detail below.

Our alternative "authoritative" approach is not without it's own set of pros and cons, but Mozilla would like to present it to other implementers for consideration - particularly as we believe it allows for a different life-cycle management than the current additive approach.

Additionally, as the folks standardizing the web manifest have not yet finalized the design of the specification (and no one fully implements it), the cost of switching models might not be too high if other implementers agree.

As such, we would appreciate your thoughts on which model would be best to pursue (i.e., continue down current "additive" path or take the "authoritative" approach... or maybe some kind of hybrid approach).

Manifest as additive, and its implications

To date, the W3C specification has been written with an assumption that the manifest provides additive metadata about a web application (i.e., a collection of web pages). It is additive in the sense that it overrides, amends, or works in concert with metadata found in a web page.

For instance, it is valid per spec to have a manifest that contains only the following information:

{
   "orientation":  "landscape",
   "display": "standalone",
   "scope":  "/clockapp/",
   "short_name": "Clock"
}

And have that associated with a page, "/clockapp/index.html" in the following manner:

<!doctype html>
<title>The World Clock â€” Worldwide</title>
<link rel=manifest href="//:cdn.bar.com/manifest.json">
<meta name='application-name' value='World Clock'>
<link rel='icon' href='clock.ico' sizes='16x16 32x32 48x48 64x64'>

As per the current processing rules of the manifest spec, this allows the UA to merge what is declared in the manifest and whatever metadata can be gathered from the DOM of the page from which the web application is being "bookmarked" or "added to home screen".

Combining the raw JSON manifest and the metadata from the web page, would yield a "processed manifest" that would look like:

{
   "orientation":  "landscape",
   "display": "standalone",
   "scope":  "/clockapp/",
   "short_name": "Clock",
   "name": "World Clock",
   "icons": [{
       "src": "clock.ico",
       "sizes": "16x16 32x32 48x48 64x64"
    }]
}

At install time, the above processed manifest is used to compose a UI dialog that allows the user to install the application.

Rationale of additive model

The rationale for the current additive design and processing model is to leverage legacy metadata declarations found in existing web content. For instance, research conducted by Mozilla in October 2013 showed that application-name: was used in 1,571 sites out of Alexa's top 78,000 site (2%). Also, link@rel=icon (and favicon.ico) has been quite successful on the Web over the last decade, so the idea was to leverage those resources where possible.

In addition, as is the nature of all Web standards, it was assumed that cross-vendor implementation would be gradual - hence this additive model would allow developers to incrementally transition web page metadata from web pages to manifests over approximately 2-6 years (average time for cross-browser parity is +5-7 years). We are currently on ~year 2.

Pros

this approach is that it fits a "traditional" web development model. A manifest works similarly to, for instance, CSS (in a very loose sense): where values from the manifest are "applied" to a page when the application is opened from a user's home screen.
Works with CDNs.
No need for a MIME type.
Allows manifest to work in concert with existing metadata on a page (or group of pages).

Cons

Updating installation details is difficult: if some of the data is derived from the manifest, and the rest was derived from the web page, it can be complicated to update the icons/name/etc. of an installed web application.
Metadata is not authoritative: one app can use another application's metadata (possibly even across domains, CORS allowing).
There is no 1:1 mapping between manifest metadata and HTML5's metadata - so new link/meta types/relationships might need to be specified for fallback to work properly in HTML with new features (e.g., "scope"). This makes the manifest an alternative way of providing HTML meta tags about a page (this begs the question if it's worth the trouble to standardize a whole new format just for this metadata, when this data could just be included in a web page?).

Manifest as authoritative

The manifest as authoritative means that the manifest serves as the absolute "source of truth" about a web application - making it distinct from metadata found in individual documents of a web application. As such, when processing the manifest, no fallback metadata is gleaned from the Document from which the manifest was derived.

Rationale of authoritative approach

The rationale for the authoritative approach is to make the manifest a useful/standalone resource in its own right: with metadata describing a web app as a whole (all URLs within a defined "scope"), which is separate from metadata describing any single web page from which the manifest might be linked.

This allows a manifest to be used independently of any document that makes up the web application itself (e.g., from a marketplace). This is achieved by restricting the manifest to a particular origin:

having the manifest be same origin provides a light-weight trust mechanism to assert information about an application it hosts.

Pros

Manifest URL serves as a "stable" identifier for a web application.
Single "source of truth": making it easier to reason about updates/changes to the manifest.: The metadata about the web application itself won't depend on any web page of the web application. This makes it simpler to perform updates, as the complete set of metadata can be gleaned from the manifest instead of the manifest + a HTML document (as is the case in the additive model).
Marketplace-friendly: a developer can simply submit the link to the manifest to an online store (or even a regular website), and metadata about an application can be derived just from the manifest.

Cons

May require a MIME type. Historically, this has been problematic for developers who don't control their own server setups. For example, HTML had to drop its requirement of a MIME type on appcache manifests because of the number of developers that encountered issues trying to enable a particular type on a server (independently of the other problems inherent with appcache).
Breaks the ability to use manifests on a CDN. This could be a problem for many sites that rely on CDNs for static content that are held at other origins.
Might restrict customization and localization of the manifest - for instance, serving the right manifest to a user after he or she logs into a site.

marcoscaceres · 2015-04-30T15:26:59Z

Ok, so, I think the only sensible compromise position is:

Make manifest metadata authoritative (a user agent ignores a page's meta tags): this gives us the ability to perform updates, etc. reliably without relying on the document from which the page was installed.
Make only CORS-enabled fetches of the manifest the default, as per Obtaining a Manifest should follow usual CORS rules with credentials. #353. This allows cross origin fetches, but provides content authors the ability to prevent others sites using their manifests without permission.

Also, protection against XSS attack is provided by manifest-src. So, evil.com won't be able to inject itself into good.com.

marcoscaceres · 2015-04-30T15:33:31Z

@benfrancis, @sicking, @jmajnert, @kenchris, @PaulKinlan, @anssiko, @mounirlamouri, @alxlu, @slightlyoff, agree with #272 (comment)?

benfrancis · 2015-04-30T15:47:20Z

Make manifest metadata authoritative (a user agent ignores a page's meta tags): this gives us the ability to perform updates, etc. reliably without relying on the document from which the page was installed.

Yes.

Make only CORS-enabled fetches of the manifest the default, as per #353. This allows cross origin fetches, but provides content authors the ability to prevent others sites using their manifests without permission.

I think you keep misunderstanding the problem people are talking about here. The problem is not other people using your manifest for their own content (what use would that be to them?). It's other people re-packaging your content as an app by creating their own manifest for your content and showing ads in splashscreens, changing the start_url for phishing purposes or selling it in an app store etc.

This is why I think the solution needs to be on the app content end, not the manifest end, and is why I suggested the idea of using the CSP header to determine whether to render a page.

benfrancis · 2015-04-30T15:49:25Z

(it's similar to the phishing problem that X-Frame-Options solves, which is why I was exploring a similar solution)

marcoscaceres · 2015-04-30T15:53:19Z

It's other people re-packaging your content as an app by creating their own manifest for your content and showing ads in splashscreens, changing the start_url for phishing purposes or selling it in an app store etc.

I don't understand how is that even possible with the current spec? Can you show how you would do that, concretely, with say IRC cloud?

marcoscaceres · 2015-04-30T15:55:44Z

(you lose 10 points if you say "marketplace")

benfrancis · 2015-04-30T16:04:00Z

I don't understand how is that even possible with the current spec? Can you show how you would do that, concretely, with say IRC cloud?

The truth is that it mostly isn't a problem if you assume that web apps are only ever installed from a page of the app, which is the assumption the spec makes. A side effect of this is that the manifest is not a trustable resource in its own right, it can only be used in conjunction with a page of the app. This is why I'm pushing for an answer on whether installing from an app store is considered a valid use case of a web manifest. For example:

An evil developer creates a manifest at http://evil.com/manifest.json which has a start_url of http://irccloud.com/index.html
They submit the URL http://evil.com/manifest.json to the Firefox Marketplace or Windows Store to be featured as an app, costing $1.
A user installs the app from the app store, without reference to any page of the app
The evil developer changes the start_url of the manifest http://evil.com/login.html
The user updates the app, launches it and logs into what they think is IRCCloud
The evil developer puts an ad in the splash screen of the app suggesting the user try out the new and improved product at evil2.com
The evil developer has $1, the user's username and password, and has them using their new evil2 product

As I understand it this was basically the rationale for the same-origin restriction on Firefox Apps. Whether or not this is important for web manifest depends largely on whether installing web apps from an app store, or using the manifest as a useful resource independently of a web page it might be referenced from, are considered valid use cases.

marcoscaceres · 2015-04-30T16:36:10Z

The truth is that it mostly isn't a problem if you assume that web apps are only ever installed from a page of the app, which is the assumption the spec makes.

Yes, which is exactly why I've never understood what the hell you people were talking about :)

A side effect of this is that the manifest is not a trustable resource in its own right, it can only be used in conjunction with a page of the app. This is why I'm pushing for an answer on whether installing from an app store is considered a valid use case of a web manifest.

Not for this spec. No.

For example:

An evil developer creates a manifest at http://evil.com/manifest.json which has a start_url of http://irccloud.com/index.html

It can't do that. This is already banned.

They submit the URL http://evil.com/manifest.json to the Firefox Marketplace or Windows Store to be featured as an app, costing $1.

-10 points (you were warned! :)).

As I understand it this was basically the rationale for the same-origin restriction on Firefox Apps. Whether or not this is important for web manifest depends largely on whether installing web apps from an app store,

It's not. The assumption is that you install at the application site, not from an app store.

or using the manifest as a useful resource independently of a web page it might be referenced from, are considered valid use cases.

This one is, but only in relation to performing updates of icons, etc.

benfrancis · 2015-04-30T16:50:55Z

Not for this spec. No.

OK, I'm fine with that. But are the Firefox Marketplace team, Microsoft and Crosswalk OK with that?

marcoscaceres · 2015-04-30T17:29:12Z

OK, I'm fine with that. But are the Firefox Marketplace team, Microsoft and Crosswalk OK with that?

Hence the ping to everyone. Note that we ripped the manifest out of the Sysapps Working Group to make it work with "The Web" (:tm:) - and not with marketplaces on purpose. Marketplaces have their own set of requirements which are incompatible with this specification.

If that's now changing again, this should bounce back to SysApps (at which point I would hand over the editorial reins to people who better understand the requirements around marketplaces, etc.).

benfrancis · 2015-04-30T17:41:46Z

OK, let's wait for feedback from others on whether the app store use case is essential to them.

In the mean time...

How does the current spec deal with this scenario?:

A web app at foo.com has a manifest at http://foo.com/manifest.json which references a start URL of http://foo.com/index.html
A user installs the app from http://foo.com/page2.html which is allowed because it's the same origin as http://foo.com/index.html
The owner of foo.com changes the start_url in the manifest to http://bar.com/index.html
The user agent updates the app by "periodically checking if the contents of the manifest have been modified"
The user launches the app and it starts at http://bar.com/index.html

Doesn't this bypass the mechanism which is supposed to ensure that the start URL is same-origin with the page the app was installed from?

marcoscaceres · 2015-04-30T19:35:53Z

Doesn't this bypass the mechanism which is supposed to ensure that the start URL is same-origin with the page the app was installed from?

No. The start URL is resolved and forced same origin to the page the app was installed from. If that fails, you get the Document url. So, to update, you need to keep a record of the page where you installed from.

jmajnert · 2015-05-01T15:10:39Z

Make manifest metadata authoritative (a user agent ignores a page's meta tags): this gives us the ability to perform updates, etc. reliably without relying on the document from which the page was installed.

+1. This is IMO the most sensible approach

Make only CORS-enabled fetches of the manifest the default, as per #353. This allows cross origin fetches, but provides content authors the ability to prevent others sites using their manifests without permission.

+1. As @benfrancis noted, this doesn't solve the rogue-app-store scenario in which the manifest is the only source about information about the app. IMHO, a sensible app store would validate such an app submission by visiting the app's site and checking for example if the app links to the same manifest.

jmajnert · 2015-05-01T15:20:21Z

There was once a discussion on the workflow of installing an app from the app store. From what I remember:

app store digests manifest (submitted or found by crawling the web). nothing stops the app stoe from validating that the manifest is not malicious (ex visiting the start url and checking the original manifest, if exists)
when user chooses to install an app from such a store (they click "Install" button), they are taken to the start_url and a normal installation flow from the UA is performed

For "special" app stores, like FxOS marketplace or Xwalk store, it's up to the store to validate the manifests and provide a special installation API if they wish to have their own installation UX

alxlu · 2015-05-01T21:40:50Z

Make manifest metadata authoritative (a user agent ignores a page's meta tags): this gives us the ability to perform updates, etc. reliably without relying on the document from which the page was installed.

I agree with this too.

An evil developer creates a manifest at http://evil.com/manifest.json which has a start_url of http://irccloud.com/index.html
They submit the URL http://evil.com/manifest.json to the Firefox Marketplace or Windows Store to be featured as an app, costing $1.
A user installs the app from the app store, without reference to any page of the app
The evil developer changes the start_url of the manifest http://evil.com/login.html
The user updates the app, launches it and logs into what they think is IRCCloud
The evil developer puts an ad in the splash screen of the app suggesting the user try out the new and improved product at evil2.com
The evil developer has $1, the user's username and password, and has them using their new evil2 product

Can't a developer already do something worse than this?

A malicious developer submits and app with a WebView pointing to foo.com
foo.com automatically redirects the user to http://irccloud.com/index.html
A user installs the app from the Store.
The malicious developer then changes foo.com to become malicious.
The user launches the app (and doesn't even have to update it), and logs into what they think is IRCCloud.

marcoscaceres · 2015-05-04T23:07:21Z

Ok, so I'm going to make manifest metadata authoritative and enable CORS by default. I think it's a fair compromise and will allow us to move forward.

jmajnert · 2015-05-05T07:06:06Z

Ok, so I'm going to make manifest metadata authoritative and enable CORS by default. I think it's a fair compromise and will allow us to move forward.

+1

…, #351, #272)

Make manifest authoritative + allow CORS (closes #376, #375, #360, #330,#351, #272)

marcoscaceres added the question label Nov 11, 2014

marcoscaceres mentioned this issue Dec 20, 2014

manifest should be queried when app is launched from home screen #292

Closed

marcoscaceres added P1 enhancement and removed question labels Dec 20, 2014

marcoscaceres changed the title ~~What is the identity of an app?~~ Define identity of a web app. Dec 20, 2014

marcoscaceres added a commit that referenced this issue Dec 30, 2014

Define identity of a web app (closes #272)

1ff6767

marcoscaceres added a commit that referenced this issue Dec 30, 2014

Define identity of a web app (closes #272)

dc34c95

marcoscaceres added a commit that referenced this issue Jan 1, 2015

fixed link to URL concept of #272

c816d8b

marcoscaceres added a commit that referenced this issue Jan 9, 2015

Define identity of a web app (closes #272)

bb388e6

marcoscaceres added a commit that referenced this issue Jan 9, 2015

Merge branch 'identity' of github.com:w3c/manifest into identity

465f950

* 'identity' of github.com:w3c/manifest: Define identity of a web app (closes #272) Conflicts: index.html

marcoscaceres added a commit that referenced this issue Jan 9, 2015

# This is a combination of 3 commits.

86433b3

# The first commit's message is: Define identity of a web app (closes #272) # The 2nd commit message will be skipped: # Fixup # The 3rd commit message will be skipped: # Fixup

marcoscaceres closed this as completed in a2e8c31 Jan 11, 2015

marcoscaceres pushed a commit that referenced this issue Jan 11, 2015

Merge pull request #299 from w3c/id

165cf75

Define identity of a web app (closes #272)

marcoscaceres reopened this Jan 13, 2015

marcoscaceres added a commit that referenced this issue May 5, 2015

Make manifest authoritative + allow CORS (closes #376, #375, #360, #330…

f50f2ce

…, #351, #272)

marcoscaceres added a commit that referenced this issue May 5, 2015

Make manifest authoritative + allow CORS (closes #376, #375, #360, #330…

e57aa0e

…, #351, #272)

marcoscaceres added a commit that referenced this issue May 5, 2015

Make manifest authoritative + allow CORS (closes #376, #375, #360, #330…

4145171

…, #351, #272)

marcoscaceres pushed a commit that referenced this issue May 5, 2015

Merge pull request #377 from w3c/compromise

2d99189

Make manifest authoritative + allow CORS (closes #376, #375, #360, #330,#351, #272)

marcoscaceres mentioned this issue May 5, 2015

Make manifest authoritative + allow CORS (closes #376, #375, #360, #330,#351, #272) #377

Merged

marcoscaceres closed this as completed May 5, 2015

benfrancis mentioned this issue Jul 7, 2016

Ability to claim web app #476

Closed

Define identity of a web app. #272

Define identity of a web app. #272

Comments

marcoscaceres commented Nov 10, 2014

marcoscaceres commented Nov 10, 2014

benfrancis commented Dec 17, 2014

sicking commented Dec 17, 2014

marcoscaceres commented Dec 18, 2014

sicking commented Dec 18, 2014

marcoscaceres commented Dec 20, 2014

opoto commented Dec 30, 2014

sicking commented Dec 30, 2014

marcoscaceres commented Dec 31, 2014

opoto commented Dec 31, 2014

marcoscaceres commented Jan 1, 2015

marcoscaceres commented Jan 13, 2015

jmajnert commented Jan 13, 2015

jmajnert commented Apr 13, 2015

benfrancis commented Apr 13, 2015

jmajnert commented Apr 13, 2015

benfrancis commented Apr 13, 2015

jmajnert commented Apr 13, 2015

marcoscaceres commented Apr 29, 2015

Different understandings of the role of the web manifest

Manifest as additive, and its implications

Rationale of additive model

Pros

Cons

Manifest as authoritative

Rationale of authoritative approach

Pros

Cons

marcoscaceres commented Apr 30, 2015

marcoscaceres commented Apr 30, 2015

benfrancis commented Apr 30, 2015

benfrancis commented Apr 30, 2015

marcoscaceres commented Apr 30, 2015

marcoscaceres commented Apr 30, 2015

benfrancis commented Apr 30, 2015

marcoscaceres commented Apr 30, 2015

benfrancis commented Apr 30, 2015

marcoscaceres commented Apr 30, 2015

benfrancis commented Apr 30, 2015

marcoscaceres commented Apr 30, 2015

jmajnert commented May 1, 2015

jmajnert commented May 1, 2015

alxlu commented May 1, 2015

marcoscaceres commented May 4, 2015

jmajnert commented May 5, 2015