Skip to content
Simple Scraper library for Node.js
Branch: master
Clone or download
Latest commit 638728f Jun 15, 2019
Type Name Latest commit message Commit time
Failed to load latest commit information.
lib Implementation refactor Jun 15, 2019
samples Wikipedia get page sample Jun 9, 2017
test Elements tests refactor Jun 15, 2019
.gitignore First where in select test Feb 9, 2014
LICENSE Initial files Feb 9, 2014 Version 0.0.2 Nov 22, 2015
package.json Implementation refactor Jun 15, 2019


SimpleScraper define parsers.


Via npm on Node:

npm install simplescraper


Reference in your program:

var ss = require('simplescraper');

Create a document:

var doc = ss.document(doctext);

Find and process elements:

var elems = doc.elements();

for (var elem =; elem; elem = {
    // process element

Find elements by tag:

var elems = doc.elements('div');

Find elements by class:

var elems = doc.elements('.news');

Find elements by id:

var elems = doc.elements('#content');

Combined filters:

var elems = doc.elements('div .news');

Find first element (or null):

var firstelem = doc.element('div');
var firstelem = doc.element('.news');
var firstelem = doc.element('#content');

Filter elements:

var elems = doc.elements(function (element) { return element.attribute('style') != null; });

Get attribute in an element (or null value):

var myattr = elem.attribute('myattr');

Get attributes (as a plain JavaScript object, each attribute name is a property):

var attrs = elem.attributes();
// { class: 'news', type: 'text', ... }

Get element tag name as string:

var tagname = elem.tag();


git clone git://
cd SimpleScraper
npm install
npm test




  • 0.0.1: Published
  • 0.0.2: Published, new examples, internal refactor




Feel free to file issues and submit pull requests — contributions are welcome<

If you submit a pull request, please be sure to add or update corresponding test cases, and ensure that npm test continues to pass.

You can’t perform that action at this time.