Skip to content

okdistribute/nutella-scrape

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 

nutella-scrape

NPM

nutella

  1. Run sudo npm install nutella-scrape -g
  2. Run nutella-scrape
  3. ???
  4. LEARN!!

In this tutorial, we will work through how to scrape websites using Node.js for the primary purpose of using it in other programs -- in servers, frontends (yes, Node works in the browser!), or just writing a table to disk for analysis elsewhere.

The DOM (Document Object Model) is an abstract concept describing how we can interact with HTML. JavaScript is GREAT for traversing HTML (i.e., the DOM) because it was made to work with HTML in the first place.

TODO

  • parallel
  • spoofing
  • cookies/login walls
  • electron-microscope

About

🍫 learn to scrape the web with Node.js -- it tastes like chocolate

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published