Address real-world object detection #5

johnpallett · 2018-09-01T01:15:06Z

An explainer should outline user privacy and security concerns (particularly threat vectors) when sites have the ability to detect real-world objects, including planar image detection. An explainer should additionally explore approaches to mitigating those concerns.

johnpallett · 2018-09-01T01:20:12Z

A few thoughts and considerations from our team:

Generally, this type of detection might include 3D object detection (e.g. a dining room table) or 2D planar image detection (e.g. a poster of Rick Astley)
Users may not know what objects the site is looking for.
Site may gain information about what objects or images are present in the scene
- E.g. Detecting that there is a $100 bill
- E.g. Detecting that the user has valuable furniture
- E.g. Detecting a storefront sign
Examples of threat vectors
- Look for compromising objects in the scene
  - e.g. embarrasing imagery using planar image tracking
- Look for valuable objects (particularly in conjunction with location data)
  - A bad actor may look for expensive computers, furniture, decor, and profile users to find candidates for real-world burglary
- Identify the location of the user
  - Storefront signs may allow sites to identify where user is in public places
Limiting the number of objects or sites that the site can query could help mitigate threats
Even in a declarative model (such as “place this 3D model on a $100 bill”) if the site is aware that the object is placed, the site will learn that a $100 bill is present in the user’s environment

NellWaliczek · 2018-09-04T19:10:58Z

This is an interesting concern. It's worth noting, again, that this could be built in a polyfill with a camera. In fact, there are already libraries today which can identify the presence of objects (not necessarily their exact position) using the camera.

blairmacintyre · 2018-09-04T19:33:59Z

We haven't really talked at all about platform level object detection / tracking (aside from talking about how we should talk about it).

This is a bit reason why I'm such a "squeaky wheel" about camera permissions. It is clear that if we hand video frames to Javascript, "all bets are off". Video frames (with relative device pose information) can be sent off to the cloud to be analyzed at leisure. We should assume that will happen. So the converse (we should be able to do WebXR without giving camera frames) seems super-important to me: in fact, it feels to me like "a thing that the web could do that native platforms aren't going to do any time soon".

It may be the case that platforms provide "thing" detection and tracking (objects, images, etc) without giving access to the camera; the real advantage of such capabilities is likely performance (both CPU/GPU, and battery) than privacy or security (e.g., if I can do image tracking, I can look for signs, or even faces, as you suggest). Perhaps some things (like a fixed image/marker set, or something like QR codes) might be reasonably "safe".

Overall, I find it hard to imagine that these sorts of features wouldn't come with a "dire warning" (akin to camera access).

peterclemenko · 2018-09-12T08:37:19Z

QR codes can be a malware vector. Be very careful with accepting arbitrary QR codes.

For facial recognition, as I said in another issue, I feel the best option would be to do a match for things that look like faces in the browser and opt in to allowing it to be parsed with JS or sent. This could allow for censoring faces from being sent to the application/server at a browser level without consent.

blairmacintyre · 2018-09-12T14:49:06Z

QR codes can be a malware vector. Be very careful with accepting arbitrary QR codes.

In this context, which I probably should have been clearer on, they are data, not "urls" to be loaded: they might be URLs, of course (my "app" might pull the appropriate bits off the end of an appropriate URL, and ignore the data in others).

The point is that they are a recognizable 2D images, that can contain some data, and have a simple structure that can be robustly tracked in 3D.

blairmacintyre · 2018-09-12T14:53:58Z

For facial recognition, as I said in another issue, I feel the best option would be to do a match for things that look like faces in the browser and opt in to allowing it to be parsed with JS or sent. This could allow for censoring faces from being sent to the application/server at a browser level without consent.

I think we need to be careful to distinguish between "facial detection and tracking" (i.e., what ARKit on the iPhoneX does), and "recognition". It is easy to imagine safe ways of doing facial detection and tracking to provide useful capabilities (essentially, almost everything you see Apple promote with the X), without enabling recognition (if the app doesn't have access to the camera bits.).

But, your point is interesting too; I've seen security researchers suggest similar things in the past, where we detect things in video frames that we don't want code to have access to, and hide it. A program can tell a face was there, but that it was removed (because, for example, there's a big hole in the data, or it's been replaced by a fixed facial image ... everyone ends up looking like Deadpool on phone!)

avadacatavra · 2018-09-13T11:48:52Z

We also need to prevent obscuring important real world data (e.g. placing an XR object in front of a stop sign/danger: cliff/etc)

peterclemenko · 2018-09-17T13:01:14Z

Agreed with both of you. We need to prevent obscuring important data, but we also need to prevent the possibility of data gathering based on the data provided that can deanonymize users and those around them. With the sensor data being gathered, one of my worst nightmares is that XR is used to create an Orwellian nightmare, We need to implement methods to prevent deanonymization. I feel this requires further R&D to tell if this can be used to gather that kind of data and effective attack/prevention methods.

johnpallett · 2018-10-08T23:13:30Z

I've added a section on this topic to the privacy & security explainer to address this issue (#14). Can you please take a look and provide feedback? I'll plan on closing this issue (and merging the PR) at the end of the week.

blairmacintyre · 2018-10-09T19:47:04Z

One comment I had in the document is that much of the mitigations will assume that the app is running at a permission level that does not give full video/sensor access to the apps. This may be in contrast to the native APIs, which assume full access.

This may be obvious, but it also should be kept in mind, because there may be mitigations that limit some use cases, where those use cases may work when full access has been given.

johnpallett · 2018-10-12T21:02:00Z

I've just committed the PR #14 to address this issue. Thanks all!

blairmacintyre mentioned this issue Sep 12, 2018

Address site access to real-world geometry data #4

Closed

johnpallett mentioned this issue Oct 8, 2018

Explainer.md += [Image,Object] Detection #14

Merged

johnpallett closed this as completed Oct 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Address real-world object detection #5

Address real-world object detection #5

johnpallett commented Sep 1, 2018

johnpallett commented Sep 1, 2018

NellWaliczek commented Sep 4, 2018

blairmacintyre commented Sep 4, 2018

peterclemenko commented Sep 12, 2018

blairmacintyre commented Sep 12, 2018

blairmacintyre commented Sep 12, 2018

avadacatavra commented Sep 13, 2018

peterclemenko commented Sep 17, 2018 •

edited

johnpallett commented Oct 8, 2018

blairmacintyre commented Oct 9, 2018

johnpallett commented Oct 12, 2018

Address real-world object detection #5

Address real-world object detection #5

Comments

johnpallett commented Sep 1, 2018

johnpallett commented Sep 1, 2018

NellWaliczek commented Sep 4, 2018

blairmacintyre commented Sep 4, 2018

peterclemenko commented Sep 12, 2018

blairmacintyre commented Sep 12, 2018

blairmacintyre commented Sep 12, 2018

avadacatavra commented Sep 13, 2018

peterclemenko commented Sep 17, 2018 • edited

johnpallett commented Oct 8, 2018

blairmacintyre commented Oct 9, 2018

johnpallett commented Oct 12, 2018

peterclemenko commented Sep 17, 2018 •

edited