Existing intervention: user gesture required for sensitive operations #12

RByers · 2016-03-11T21:22:17Z

Chromium has a notion of a "user gesture" which indicates that we believe the user is explicitly interacting with the page (eg. mouse click, but not mouse move or wheel). Then certain sensitive operations are restricted to apply only if it can "consume" a user gesture (eg. one successful window.open call per mousedown/up/click sequence). Some of this maps to the pop-up blocking algorithm in the HTML5 spec. But I'm not sure how well spec'd the details are, what tests exist, and how much interoperability there is between browsers on this. Perhaps we should try to expand this issue with references / details?

Here's a (mostly complete) list of the things that require a user gesture in chromium:

Allowing pop-ups
Going full screen (requestFullscreen)
Writing to the clipboard
Some scenarios of form submission and requestAutocomplete
PresentationRequest::start
Enabling mouse lock
Various similar operations inside of plugins (fullscreen, mouse lock, etc.)
Buffering aggressively when media is paused, and potentially auto-playing media
‘color’ and ’file’ input types responding to an activation event
Showing an IME (eg. on screen keyboard) on element focus (permitted anytime after a user gesture has occurred since page load)
WebBluetooth and WebUSB requestDevice (experimental)

toddreifsteck · 2016-04-13T19:17:54Z

Microsoft Edge has found that the "user input flag" is flowing to some types of callbacks on mobile which was causing significant interop issues due to the lack of a public spec and agreement.

(My personal theory is that the history was to make video autoplay "work" for libraries originally built/tested on desktop, but I'll defer to Chrome experts who will be more familiar with the history.)

We’ve observed it flowing through all of the following in Chrome on Android:

setTimeout
setInterval (the 1st interval, but not any future intervals)
window.postMessage

We have observed it does not flow for:

Promises
RAF

Microsoft Edge's position is that the user flag should either flow to all callbacks OR should be blocked for all callbacks.

We are actively implementing a fix in Edge 14 in internal builds to flow the user input flag to setTimeout/setInterval/setImmediate to unblock a few sites that have issues

RByers · 2016-04-13T20:38:04Z

Interesting, thanks! Can you give us some data on which sites are affected by this? If Edge has never needed this before, then perhaps it's not worth the complexity and Chrome should just change to be simpler too?

What about for pop-up blocking - do you use a similar algorithm? Does it flow across setTimeout?

jeisinger · 2016-04-14T12:11:47Z

Sadly, the "User gesture" concept is not well defined. In WebKit and Blink, we implemented forwarding of the "user gesture" state to the first level of setTimeout calls with a 1s timeout, i.e. if a setTimeout handler invokes setTimeout again, the user gesture won't be forwarded twice, and if the timeout is >1s it won't be forwarded either.

We don't always forward the user gesture via postMessage - it is not forwaded across processes.

I agree that promises could forward the gestures, but why RAF?

What about stuff like XHR events (or IDB events etc.)

In general, the user gesture thing is a bit tricky to handle, as it has this 1s timeout, so if your XHR doesn't come back in time, you'd have lost the gesture. Not exactly developer friendly :(

domenic · 2016-05-31T22:19:00Z

So the spec defines this currently: https://html.spec.whatwg.org/multipage/browsers.html#allowed-to-show-a-popup

I have filed two issues on the spec related:

Change the name to something more general: Editorial: "allowed to show a popup" → "triggered by user activation" whatwg/html#1357
The list of triggering events seems too small: Events list that trigger "allowed to show a popup" seems too small whatwg/html#1358

The latter in particular could use implementer feedback on whether the spec aligns with implementations or not.

jeisinger · 2016-06-02T08:49:54Z

Should we also spec that certain operations destroy a usr gesture (opening a window in chrome does that).

RByers · 2016-06-02T15:51:38Z

This is a good improvement, thanks @domenic!

There's definitely a variety of ways implementation doesn't match the spec here. I'd like Microsoft's (eg @toddreifsteck's) input so let's discuss those details here.

Yes the list of triggering events is too small (eg. should also contain keydown, mousedown), but it's more complex than that - there's not a 1:1 mapping from event to gesture. For example, on a mousedown mousemove* mouseup sequence we take a single UserGesture - so you can open exactly one pop-up from any of those listeners (not one pop-up per movement). What complexity is actually required here for web compat / good user experience is really hard to say - I'd look to Edge's experience (trying to be compatible with Chrome). If they've got examples where they have been successful with something simpler, I'd be open to trying to change Chrome to match.

RByers · 2016-10-06T23:27:26Z

As part of rationalizing this intervention, we should really also expose an API indicating whether a user gesture is currently in progress. Eg. @dvoytenko has a scenario in AMP that is really no different than the built-in browser scenarios - an untrusted iframe does a postMessage to the main document requesting an action they only want to do in response to a user actually interacting with the frame. I'd argue we should just expose some simple userActivationInProgress bit somewhere.

dvoytenko · 2016-10-07T01:19:30Z

Yes, our security model is that we typically allow more changes to an AMP document if we can confirm user action. For instance, we only allow iframes to resize themselves on user action. If we didn't, the page would jump and auto-risize itself without any constraints thus completely obliterating user experience. There are many other features that are only allowed on user action. Currently, we polyfill this functionality via focused state and soon we will also deploy polyfill based on clipboard. But these are not ideal.

greggman · 2018-05-23T08:18:12Z

I'm not sure where to bring this up but, speccing which gestures. How about the drag and drop events? There are pages that say "drop an mp3 here" and they'd like to load and play the sound the moment the mp3 is dropped.

domenic · 2022-04-01T21:00:30Z

It's amazing coming back to this repository and issue and recalling that at one time, our user activation concept was called "allowed to show a popup" and only applied to window.open()!

These days we have a well-defined concept of user activation. (Well, three-ish, actually: user activation consumption, transient user activation checking, and sticky user activation checking.) And it's used by pretty much everything Rick lists in the original post here, with the exception of showing the IME (not really specced anywhere) and some stuff that died (requestAutocomplete(), plugins). Big kudos to @mustaqahmed for all the work on that over the years.

So we'll close out this issue, as part of the larger project of archiving this repository (#72). As soon as I get write access to this repository.

RByers mentioned this issue Mar 16, 2016

Stricter user gestures for touch #13

Closed

ojanvafai added the NeedsSpecWork label Apr 22, 2016

RByers mentioned this issue Jun 2, 2016

Events list that trigger "allowed to show a popup" seems too small whatwg/html#1358

Closed

georgwaechter mentioned this issue Aug 18, 2016

Video transition example not working in Chrome for Android bbc/VideoContext#14

Closed

RByers mentioned this issue Oct 27, 2016

Add some mechanism to know that a message event was triggered by user activation whatwg/html#1983

Closed

RByers mentioned this issue Jan 5, 2017

Add a new 'allow-top-navigation-by-user-activation' flag to iframe sandbox to require a user activation for top-level page navigation #42

Closed

domenic mentioned this issue Jan 26, 2017

Make "triggered by user activation" match browser behavior whatwg/html#1903

Closed

machenmusik mentioned this issue Aug 29, 2017

redo #2991: extend PR #2985 (moved canvas init) to also solve issue #2967 (vr2vr traversal in Oculus Browser) aframevr/aframe#2996

Closed

rajsite mentioned this issue Oct 24, 2018

Lack of atomic.wait on the main thread seems limiting to a fault WebAssembly/threads#106

Closed

domenic closed this as completed Apr 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Existing intervention: user gesture required for sensitive operations #12

Existing intervention: user gesture required for sensitive operations #12

RByers commented Mar 11, 2016 •

edited

Loading

toddreifsteck commented Apr 13, 2016

RByers commented Apr 13, 2016

jeisinger commented Apr 14, 2016

domenic commented May 31, 2016

jeisinger commented Jun 2, 2016

RByers commented Jun 2, 2016

RByers commented Oct 6, 2016

dvoytenko commented Oct 7, 2016

greggman commented May 23, 2018

domenic commented Apr 1, 2022 •

edited

Loading

Existing intervention: user gesture required for sensitive operations #12

Existing intervention: user gesture required for sensitive operations #12

Comments

RByers commented Mar 11, 2016 • edited Loading

toddreifsteck commented Apr 13, 2016

RByers commented Apr 13, 2016

jeisinger commented Apr 14, 2016

domenic commented May 31, 2016

jeisinger commented Jun 2, 2016

RByers commented Jun 2, 2016

RByers commented Oct 6, 2016

dvoytenko commented Oct 7, 2016

greggman commented May 23, 2018

domenic commented Apr 1, 2022 • edited Loading

RByers commented Mar 11, 2016 •

edited

Loading

domenic commented Apr 1, 2022 •

edited

Loading