Permissioning layer for match keys #39

csharrison · 2023-02-28T17:26:48Z

Currently in IPA, a match key set by a site can be used by any other site. There is no mechanism in the system where a site could choose to keep a match key to itself. I have serious concerns about this.

In practical terms, to use IPA to achieve cross-device attribution, match keys would likely be derived from PII. This means there is a tension between using IPA to its fullest, and pervasively sharing your device graph / PII-derived user data with the rest of the web ecosystem.

Like most other APIs that store user data, IPA should abide by the Same Origin Policy by default, and not expose read access to match keys, even in an encrypted form. If sites want to share their match keys with others, we can support selectively exposing access in an opt-in way with a permissioning system. This could look something like a set policy declared at setMatchKey time:

setMatchKey(<key>, {
  exposeToOrigins: ['https://foo.com', 'https://bar.*.com']
}

Where we could support pattern matching using the URLPattern API infrastructure, which will let a provider allow a specific set of report collector origins (or everyone with "*") to consume their match keys.

Additionally, we could support dynamic, just-in-time permissions, e.g.

getMatchKey could automatically grant permission to a report collector if the API was called within a document whose origin matches the provider.
If we support an HTTP API (Match keys without JavaScript (for browser implementations) #25) for getting match keys, we could allow a provider to redirect to a report collector and append its match key.

These changes would allow sites to use IPA without fear of leaking their user’s data to parties outside their control.

Note: We might need something more sophisticated if we want match key setting to persist beyond a single browser / app (e.g. storage mediated by an operating system), but I think we can discuss that later on.

The text was updated successfully, but these errors were encountered:

benjaminsavage · 2023-03-15T06:08:58Z

Before responding to the suggestion about permissioning match keys itself, I'd like to more fully understand your concerns with the current proposal.

You said:

In practical terms, to use IPA to achieve cross-device attribution, match keys would likely be derived from PII. This means there is a tension between using IPA to its fullest, and pervasively sharing your device graph / PII-derived user data with the rest of the web ecosystem

And later said:

These changes would allow sites to use IPA without fear of leaking their user’s data to parties outside their control.

Could you please flesh out in more concrete terms what exactly the threat / risk is?

You've alluded to "pervasively sharing your device graph / PII-derived user data with the rest of the web ecosystem", but I do not understand what you're referring to. In the current proposal, I do not see any way for another site to learn any of the following:

Any value of any match key
Any PII
Any way to link one record to a record from another site
The number of devices / browsers utilizing the same match key
Any metadata of any use or interest about the user/device graph of the match key provider

Maybe I'm missing something and there is a specific attack vector I haven't considered which allows and attacker to exfiltrate one or more of these pieces of information? If so, could you outline the attack?

As it is, the phrase "leaking their user’s data to parties outside their control" sounds very scary, but I just don't see that reflected in fact.

csharrison · 2023-03-15T14:30:59Z

There are a few pieces here:

Sites may not want to expose their user's data to others in the event of a security compromise. Obviously in the event of a security compromise of the helper system (e.g. the keys are leaked), all privacy for the users is gone. It is worse for this to be a major security event for match key providers who have exposed user data to a possibly unbounded number of third parties.
Relatedly, some company policies consider encrypted data "user data" and treat it similarly to cleartext data in terms of limiting the scope of its sharing.
It may be possible to leak aggregate proprietary information about a competitor using the IPA system. While I don't have an exhaustive list of techniques for how to do this, IPA also does not have any formal protections against it (similar to e.g. DP which boasts formal protection for the users privacy in the event of any possible attack). Rough attacks along these lines could involve comparing a competitor's match key performance against a same-device match key, or using an auxiliary system that uses PII joins to generate output. I also want to note that we expect IPA to evolve its capabilities over time to support new use-cases, and if this is something we want to prevent, it would possibly constrain this future innovation (e.g. using IPA for reach reporting).

cc @michaelkleber

csharrison mentioned this issue Mar 27, 2023

Standards Position of the Chrome Privacy Sandbox Measurement Team #59

Open

bmcase mentioned this issue Apr 4, 2023

IPA Biweekly Meeting Agenda Request - Chrome Standards Position and open concerns #62

Closed

csharrison mentioned this issue Apr 11, 2023

Agenda Request - Cross device attribution options patcg/meetings#115

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permissioning layer for match keys #39

Permissioning layer for match keys #39

csharrison commented Feb 28, 2023

benjaminsavage commented Mar 15, 2023

csharrison commented Mar 15, 2023

Permissioning layer for match keys #39

Permissioning layer for match keys #39

Comments

csharrison commented Feb 28, 2023

benjaminsavage commented Mar 15, 2023

csharrison commented Mar 15, 2023