brisjs-web-scraping-talk

Code to accompany my talk on web scraping for the Brisbane JavaScript meeting in September 2018

Why not do your data wrangling, analysis and visualization entirely in JavaScript? To support my effort please buy or help promote my book Data Wrangling with JavaScript.

Do your prototyping and exploratory data analysis in JavaScript with Data-Forge Notebook.

Click here to support my work

Slides / Video

The slides for the talk are available here: https://www.slideshare.net/AshleyDavis33/web-scraping-112846350

A video will be posted here as soon it is made available by BrisJS.

Running the code

Clone or download this repo.

Open a command line, change directory to either the simple or advanced sub-directory, then install dependencies:

npm install

Then run the script:

npm start

or:

node index.js

Production issues

If you try to use the advanced technique in production, here's some production issues you'll want to investigate:

Performance
- Cache and reuse the Nightmare object
- Disable image download
- Batch your requests
Debugging
- Show the Electron window
- Enable devtools in the Electron window
- Handle errors from Nightmare
- Display logging from target web page running in the headless browser

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

brisjs-web-scraping-talk

Slides / Video

Contents

Running the code

Production issues

Files

README.md

Latest commit

History

README.md

File metadata and controls

brisjs-web-scraping-talk

Slides / Video

Contents

Running the code

Production issues