Feature request: run puppeteer in the browser #2119

Janpot · 2018-02-28T14:00:15Z

Just trying to feel the water here, but it seems to me that apart from downloading chrome and launching a browser, puppeteer isn't really doing anything that can't be done in a browser. I'm thinking puppeteer.connect() in a webpage. Would there be any interest in supporting this? Am I overlooking any barriers that are in the way of achieving this? I can probably make some time to look into it.

The text was updated successfully, but these errors were encountered:

aslushnikov · 2018-04-11T00:28:50Z

@Janpot this definitely sounds interesting. I don't think it makes sense to have it as a part of this repository - it deserves a separate project.

Do you have any success with this?

Janpot · 2018-04-11T07:59:48Z

I tinkered a bit with it a month ago. I think it's feasible to create a build of puppeteer that runs in the browser but I haven't picked it back up. I think a separate project would only complicate things as the goal would be to not add code at all, just refactor here and there and add a build target.

aslushnikov · 2018-04-11T17:13:00Z

I think a separate project would only complicate things as the goal would be to not add code at all, just refactor here and there and add a build target.

This would be ideal. @JoelEinbinder had a prototype as well and refactored pptr codebase to simplify things there. However, iirc he still needed to mock fs and some other node modules we use occasionally.

Janpot · 2018-04-12T22:20:27Z

@aslushnikov Ok, so I did a quick and very very dirty test again, building puppeteer with browserify. To make it work:

I used exposify to shim require('ws') with WebSocket, and './BrowserFetcher' with null
I changed ChromiumRevision in Launcher.js and set it to null
I removed the second argument in new WebSocket(url) inConnection.js

I shimmed WebSocket with

  WebSocket.prototype.on = function (eventName, handler) {
    WebSocket.prototype.addEventListener.call(this, eventName, ({data}) => handler(data));
  }

Then with running browserless locally (docker run -p 3000:3000 browserless/chrome), I was able to make

const puppeteer = require('puppeteer');
puppeteer.connect({ browserWSEndpoint: 'ws://localhost:3000' })
  .then(async browser => {
    const page = await browser.newPage();
    await page.goto('https://example.com');
    console.log(await page.content())
  });

work in the browser.

So to list the main problems:

helpers.projectRoot in ChromiumRevision breaks the browser
WebSocket seems to be used in a browser incompatible way
BrowserFetcher needs to be excluded

Will see if one of these days I can find some time to clean this up a bit, and find better solutions to these problems.

aslushnikov · 2018-04-12T22:35:21Z

Thanks @Janpot for the follow-up

BrowserFetcher needs to be excluded

Right, I'd expect both BrowserFetcher and Launcher to be excluded.

I shimmed WebSocket with

This is an interesting approach. Another option might be implementing Connection class with whatever transport is used to drive pptr in browser.

Janpot · 2018-04-12T22:45:03Z

Launcher seems to be needed to be able to connect. As for the shimming it was basically a "make it work as quickly as possible" approach. Will look into more maintainable ways later.

Janpot · 2018-04-13T21:09:04Z

@aslushnikov I added my proof of concept as a PR #2374

elisherer · 2018-04-16T14:40:05Z

The developer tools can overcome some browser security enforcements like CORS.
e.g. Accessing an iframe from a different origin and running javascript code on that frame will probably be blocked if tried from inside the browser, wouldn't it?

aslushnikov · 2018-09-06T13:15:27Z

A nice summary on what's anti-bundleable in pptr was given here: #2245 (comment)

We should:

fix these
bundle puppeteer for web as part of our testsuite to make sure we don't regress the bundle'ability in future

Janpot · 2018-09-06T13:21:22Z

correct me if I'm wrong, but now that puppeteer-core is a thing, I guess trying to bundle puppeteer doesn't make much sense anymore? I think I should rather concentrate on bundling puppeteer-core. I haven't picked this back up again, but I'd assume the browser incompatible thing that's left in that case is the websocket implementation.

aslushnikov · 2018-09-06T14:36:21Z

correct me if I'm wrong, but now that puppeteer-core is a thing, I guess trying to bundle puppeteer doesn't make much sense anymore?

@Janpot: puppeteer-core is the same codebase as puppeteer; but yes, you'd probably want to depend on puppeteer-core.

I'd assume the browser incompatible thing that's left in that case is the websocket implementation.

Also the dynamic imports - I have a promising draft to cleanup these.

@Janpot

This patch removes all dynamic requires in Puppeteer. This should make it much simpler to bundle puppeteer/puppeteer-core packages. We used dynamic requires in a few places in lib/: - BrowserFetcher was choosing between `http` and `https` based on some runtime value. This was easy to fix with explicit `require`. - BrowserFetcher and Launcher needed to know project root to store chromium revisions and to read package name and chromium revision from package.json. (projectRoot value would be different in node6). Instead of doing a backwards logic to infer these variables, we now pass them directly from `//index.js`. With this patch, I was able to bundle Puppeteer using browserify and the following config in `package.json`: ```json "browser": { "./lib/BrowserFetcher.js": false, "ws": "./lib/BrowserWebSocket", "fs": false, "child_process": false, "rimraf": false, "readline": false } ``` (where `lib/BrowserWebSocket.js` is a courtesy of @Janpot from puppeteer#2374) And command: ```sh $ browserify -r puppeteer:./index.js > ppweb.js ``` References puppeteer#2119

@Janpot

This patch removes all dynamic requires in Puppeteer. This should make it much simpler to bundle puppeteer/puppeteer-core packages. We used dynamic requires in a few places in lib/: - BrowserFetcher was choosing between `http` and `https` based on some runtime value. This was easy to fix with explicit `require`. - BrowserFetcher and Launcher needed to know project root to store chromium revisions and to read package name and chromium revision from package.json. (projectRoot value would be different in node6). Instead of doing a backwards logic to infer these variables, we now pass them directly from `//index.js`. With this patch, I was able to bundle Puppeteer using browserify and the following config in `package.json`: ```json "browser": { "./lib/BrowserFetcher.js": false, "ws": "./lib/BrowserWebSocket", "fs": false, "child_process": false, "rimraf": false, "readline": false } ``` (where `lib/BrowserWebSocket.js` is a courtesy of @Janpot from #2374) And command: ```sh $ browserify -r puppeteer:./index.js > ppweb.js ``` References #2119

Currently connection assumes that transport is a websocket and tries to handle websocket-related errors. This patch: - moves ConnectionTransport interface to use callbacks instead of events. This way it could be used in browser context as well. - introduces WebSocketTransport that implements ConnectionTransport interface for ws. This is a preparation step for 2 things: - exposing `transport` option in the `puppeteer.connect` method - better support for `browserify` References puppeteer#2119

Currently connection assumes that transport is a websocket and tries to handle websocket-related errors. This patch: - moves ConnectionTransport interface to use callbacks instead of events. This way it could be used in browser context as well. - introduces WebSocketTransport that implements ConnectionTransport interface for ws. This is a preparation step for 2 things: - exposing `transport` option in the `puppeteer.connect` method - better support for `browserify` References #2119

Bundled version of Puppeteer should rely on native WebSocket. Luckily, 'ws' module supports the same interface as the native browser websockets. This patch switches WebSocketTransport to use the browser-compliant interface of 'ws'. After this patch, I was able to bundle Puppeteer for browser using the following config in `package.json`: ```json "browser": { "./lib/BrowserFetcher.js": false, "ws": "./lib/BrowserWebSocket", "fs": false, "child_process": false, "rimraf": false, "readline": false } ``` where `./lib/BrowserWebSocket` is: ```js module.exports = WebSocket; ``` and the bundling command is: ```sh $ browserify -r ./index.js:puppeteer > ppweb.js ``` References puppeteer#2119

Bundled version of Puppeteer should rely on native WebSocket. Luckily, 'ws' module supports the same interface as the native browser websockets. This patch switches WebSocketTransport to use the browser-compliant interface of 'ws'. After this patch, I was able to bundle Puppeteer for browser using the following config in `package.json`: ```json "browser": { "./lib/BrowserFetcher.js": false, "ws": "./lib/BrowserWebSocket", "fs": false, "child_process": false, "rimraf": false, "readline": false } ``` where `./lib/BrowserWebSocket` is: ```js module.exports = WebSocket; ``` and the bundling command is: ```sh $ browserify -r ./index.js:puppeteer > ppweb.js ``` References #2119

noamalffasy · 2018-09-26T11:23:01Z

When will the updated version be out?

aslushnikov · 2018-09-26T15:11:05Z

@noamalffasy The next release is scheduled for October, 4 (you can see next release date in the very beginning of our documentation).

noamalffasy · 2018-09-26T15:19:33Z

Is there a way I can get this version without waiting until the next release?

aslushnikov · 2018-09-26T15:33:38Z

@noamalffasy you can either clone from the github directly, or install the tip-of-tree release with npm i puppeteer@next.

noamalffasy · 2018-09-26T15:44:42Z

That worked!
Thank you!

noamalffasy · 2018-09-26T17:45:24Z

Okay I have an issue now with bundling,

ERROR in ./node_modules/puppeteer/lib/WebSocketTransport.js
Module not found: Error: Can't resolve 'ws' in '/node_modules/puppeteer/lib'
 @ ./node_modules/puppeteer/lib/WebSocketTransport.js 16:18-31
 @ ./node_modules/puppeteer/lib/Launcher.js
 @ ./node_modules/puppeteer/lib/Puppeteer.js
 @ ./node_modules/puppeteer/index.js

I'm using webpack

aslushnikov · 2018-09-26T17:55:26Z

@noamalffasy I'm not sure what's the lib/WebSocketTransport.js; there's a suitable one in utils/browser/WebSocket.js.

Note though: we don't currently publish bits we use to bundle, but you can git clone puppeteer and then run npm run bundle to bundle it locally.

noamalffasy · 2018-09-26T18:01:11Z

So this feature is only available if you clone the repository? Or is it temporary?

aslushnikov · 2018-09-26T18:49:35Z

@noamalffasy we're not shipping any bundled version of puppeteer for web, but we made sure that there are no obstacles in bundling puppeteer.

noamalffasy · 2018-09-26T19:06:49Z

But there is an issue, maybe I need to change my webpack config?

const path = require("path");

module.exports = {
  entry: "./src/main.ts",
  mode: "production",
  module: {
    rules: [
      {
        test: /\.ts$/,
        loaders: "babel-loader",
        exclude: /node_modules/
      },
      {
        test: /\.js$/,
        use: ["source-map-loader"],
        enforce: "pre"
      }
    ]
  },
  resolve: {
    extensions: [".ts", ".js", ".json"]
  },
  output: {
    filename: "bundle.js",
    path: path.resolve(__dirname, "dist")
  }
};

brandonros · 2018-10-05T01:30:39Z

I wrote some code that scrapes some web pages. It doesn't do too well in a cloud hosted environment like DigitalOcean. It'd be neat if a user could load a page served by my API that would then allow their browser tabs to be controlled through the regular puppeteer API (if they permitted/allowed it, etc. etc.). This is the opposite of me having to waste the server resources to run a web browser, while still allowing me to do scriptable things like user input, clicking, evaluating scripts, etc.

Was that kind of the vision here? Is that possible and I am just misunderstood?

aslushnikov · 2018-10-05T02:01:20Z

It'd be neat if a user could load a page served by my API that would then allow their browser tabs to be controlled through the regular puppeteer API (if they permitted/allowed it, etc. etc.). This is the opposite of me having to waste the server resources to run a web browser, while still allowing me to do scriptable things like user input, clicking, evaluating scripts, etc.

I think this is possible using the bundled version of puppeteer and extension's chrome.debugger.

Tip: you can pass a custom transport to puppeteer connect using the transport experimental option; check out our test in the /utils/browser/test.js

woniesong92 · 2018-11-18T00:01:46Z

@aslushnikov Q: Can I use this to use puppeteer inside of an already opened browser? For example, if I'm already logged into Facebook, can I execute a Puppeteer script inside the same browser so I don't have to login again? I thought it wouldn't work because if I had to launch a new headless browser, the cookies would be gone but comments on this issue and the merged PR give me some hope. Looking forward to your reply!

Janpot · 2018-11-18T09:46:45Z

@woniesong92 You can connect puppeteer to any browser that talks the devtools protocol. For that you'll first need to start chrome with an extra CLI flag --remote-debugging-port=9229. Then you can open http://localhost:9229/json/version and find the webSocketDebuggerUrl. Use that in puppeteer.connect as browserWSEndpoint instead of using puppeteer.launch.

ryzam · 2018-12-03T06:43:27Z

@Janpot do u have any document guideline how to run puppeteer in browser without running nodejs?

evanrolfe · 2019-07-05T13:23:49Z

I think this is possible using the bundled version of puppeteer and extension's chrome.debugger.

Tip: you can pass a custom transport to puppeteer connect using the transport experimental option; check out our test in the /utils/browser/test.js

@aslushnikov since chrome.debugger provides a sendCommand and onEvent functions, would it be possible to pass this in without having to use the experimental Target.exposeDevToolsProtocol command? i.e. we could have a ConnectionTransport class like ChromeDebuggerTransport which could be wrap the chrome.debugger functions so that Puppeteer could use it?

The reason why I ask this is because I cannot manage to get the Target.exposeDevToolsProtocol command to work properly - I never end up with the window.cdp object defined and there is not much information about this command apart from the API docs. This is what I've been trying:

    chrome.tabs.getCurrent((tab) => {
      let currentTabTarget = {tabId: tab.id};

      chrome.debugger.attach(currentTabTarget, '1.3', () => {
        if(chrome.runtime.lastError) {
          alert(chrome.runtime.lastError.message);
        }
      });

      chrome.debugger.getTargets((targets) => {
        currentTarget = targets.find((info) => { return info.url == tab.url });
        chrome.debugger.sendCommand(currentTabTarget, 'Target.exposeDevToolsProtocol', {targetId: currentTarget.id});
        chrome.debugger.detach(currentTabTarget, () => {
          if(chrome.runtime.lastError) {
            alert(chrome.runtime.lastError.message);
          }else{
            alert(window.cdp)
          }
        });
      });
    });

aslushnikov · 2019-07-15T02:48:01Z

@aslushnikov since chrome.debugger provides a sendCommand and onEvent functions, would it be possible to pass this in without having to use the experimental Target.exposeDevToolsProtocol command?

IIRC the chrome.debugger doesn't expose the newest version of DevTools protocol - with proper target ids, flattened session management etc. So I don't think it would be possible.

The reason why I ask this is because I cannot manage to get the Target.exposeDevToolsProtocol command to work properly - I never end up with the window.cdp object defined

Yeah, I think this is because chrome.debugger simply is not up-to-date with the modern DevTools protocol. The only way to get the window.cdp is to use devtools protocol right away externally when launching chrome, e.g. with Puppeteer, and then embed puppeteer-web. This is what we do with our puppeteer-web tests.

evanrolfe · 2019-07-15T08:13:58Z

@aslushnikov many thanks for getting back to me on this. I can't seem to find any information about the DevTools version that chrome.debugger exposes, but also since its in an experimental feature I think you're right this is probably just not possible at the moment.

In the future, if chrome.debugger becomes more stable and up-to-date, being able to run puppeteer-web using the chrome.debugger interface would be very useful for me. This could allow us to write chrome-extensions which could use puppeteer without the need for users to launch the browser from the command line.

m-rousse · 2019-08-06T15:41:54Z

Hi, we are trying to use the chrome.debugger api to interact with the browser using puppeteer. We were able to send messages back and forth. We mocked a few calls that are denied by the browser (eg. all Target.* commands), which allowed us to execute a few actions, however we are not able to retrieve the sessionId associated to targets as they are normally retrieved using the Target.attachToTarget. Using chrome.debugger.attach does not return the sessionId either.

@aslushnikov if I understand well, the devtools exposed by chrome.debugger are not the same than the ones exposed by the websocket obtained with --remote-debugging-port. Is it planned to update the chrome.debugger API/accessible commands? Also, is there a chromium issue to track this feature request?

sachinwins · 2020-12-22T13:58:33Z

Wanted to check any update on "launching the browser other than the command line" from the client-side JS page?

Basically, I want to deliver HTML page with some JavaScript file to the User, that will launch the browser (Currently, we are launching it from the command line). Once the browser is launched I will get "webSocketDebuggerURL" from http://127.0.0.1:9222/json/version. to connect.

Any help on, how I can achieve it without using any server/command Line?

…entEmitter. I noticed this while trying to actually use the TypeScript client in a browser context and went digging a bit into this `isomorphic-ws` module. Spoiler: it's a lie! The module is merely a switch which selects the right import at runtime. Yet, it does not attempt to fill the gaps between the Node.js and the Browser-base WebSocket. The main issue being that, on Node.js, the WebSocket is an instance of [EventEmitter](https://nodejs.org/api/events.html#events_class_eventemitter) which comes with fairly useful methods like `once`, `removeAllListener` and so forth. On the browser however, we are doomed with the [EventTarget](https://developer.mozilla.org/en-US/docs/Web/API/EventTarget) and its crappy API :'( ... One particularly surprising thing is that, Puppeteer and the supposed cross-platform testing isn't any useful here since Puppeteer seems to be using its own emulation of the WebSocket, which isn't at all the one used by the Browser but has an API closer to the Node.js one. So while the browser tests are all passing, they do not actually pass on a real browser 🤦 puppeteer/puppeteer#2119 (comment) This PR introduces a slightly better `IsomorphicWebSocket` interface as a drop-in replacement for our internal use. It only covers the `on`, `once`, `removeListener` and `removeAllListeners` which we use internally. I had to resort to a JavaScript module for that because I couldn't get the TypeScript compiler to cooperate. As a consequence, the .js module does not get copied in the `dist` by default, I had to manually copy it as part of the build command, which _seems wrong_ but I am too unfamiliar with the TypeScript tooling :/

Janpot mentioned this issue Apr 13, 2018

feat(browser): Run puppeteer in browser (POC) #2374

Closed

aslushnikov mentioned this issue Sep 6, 2018

Can't found module ChromiumRevision building by webpack #2245

Closed

aslushnikov mentioned this issue Sep 6, 2018

refactor: avoid dynamic requires in lib/ folder #3208

Merged

aslushnikov mentioned this issue Sep 7, 2018

refactor: move Connection to use ConnectionTransport #3217

Merged

aslushnikov mentioned this issue Sep 7, 2018

refactor: use browser-compliant interface of 'ws' #3218

Merged

Janpot closed this as completed Sep 13, 2018

brandonros mentioned this issue Oct 5, 2018

chore: make sure Puppeteer bundling works #3239

Merged

KtorZ mentioned this issue Aug 2, 2021

(Slightly Better) Isomorphic WebSocket CardanoSolutions/ogmios#96

Merged

KtorZ mentioned this issue Mar 22, 2022

Skip options for browser WebSocket() CardanoSolutions/ogmios#195

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: run puppeteer in the browser #2119

Feature request: run puppeteer in the browser #2119

Janpot commented Feb 28, 2018

aslushnikov commented Apr 11, 2018

Janpot commented Apr 11, 2018

aslushnikov commented Apr 11, 2018

Janpot commented Apr 12, 2018 •

edited

aslushnikov commented Apr 12, 2018

Janpot commented Apr 12, 2018

Janpot commented Apr 13, 2018

elisherer commented Apr 16, 2018

aslushnikov commented Sep 6, 2018

Janpot commented Sep 6, 2018

aslushnikov commented Sep 6, 2018

noamalffasy commented Sep 26, 2018

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

noamalffasy commented Sep 26, 2018 •

edited

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

brandonros commented Oct 5, 2018

aslushnikov commented Oct 5, 2018

woniesong92 commented Nov 18, 2018 •

edited

Janpot commented Nov 18, 2018

ryzam commented Dec 3, 2018

evanrolfe commented Jul 5, 2019 •

edited

aslushnikov commented Jul 15, 2019

evanrolfe commented Jul 15, 2019

m-rousse commented Aug 6, 2019

sachinwins commented Dec 22, 2020

Feature request: run puppeteer in the browser #2119

Feature request: run puppeteer in the browser #2119

Comments

Janpot commented Feb 28, 2018

aslushnikov commented Apr 11, 2018

Janpot commented Apr 11, 2018

aslushnikov commented Apr 11, 2018

Janpot commented Apr 12, 2018 • edited

aslushnikov commented Apr 12, 2018

Janpot commented Apr 12, 2018

Janpot commented Apr 13, 2018

elisherer commented Apr 16, 2018

aslushnikov commented Sep 6, 2018

Janpot commented Sep 6, 2018

aslushnikov commented Sep 6, 2018

noamalffasy commented Sep 26, 2018

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

noamalffasy commented Sep 26, 2018 • edited

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

aslushnikov commented Sep 26, 2018

noamalffasy commented Sep 26, 2018

brandonros commented Oct 5, 2018

aslushnikov commented Oct 5, 2018

woniesong92 commented Nov 18, 2018 • edited

Janpot commented Nov 18, 2018

ryzam commented Dec 3, 2018

evanrolfe commented Jul 5, 2019 • edited

aslushnikov commented Jul 15, 2019

evanrolfe commented Jul 15, 2019

m-rousse commented Aug 6, 2019

sachinwins commented Dec 22, 2020

Janpot commented Apr 12, 2018 •

edited

noamalffasy commented Sep 26, 2018 •

edited

woniesong92 commented Nov 18, 2018 •

edited

evanrolfe commented Jul 5, 2019 •

edited