Skyscraper.JS - an interactive web scraping bookmarklet.
- Once installed, you have a little bookmarklet in your browser.
- Click it and it will identify tables and lists (UL, OL) in the current web page.
- Click a table to select it.
- Click a data row to identify what is a row.
- Click as many data fields as you want.
- Press 'Begin Parsing' to start the parse.
- Start a webserver in this directory on port 8000 *, e.g.
python -m SimpleHTTPServer
on a Mac. - Open the demo page on http://localhost:8000
*The host name 'localhost:8000' is used in the index.html
and skyscraper.js
files, please change to your own host name.