Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Sinew-Node collects structured data from web sites (screen scraping).

branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

README.md

Welcome to Sinew-Node

Sinew-Node collects structured data from web sites (screen scraping).

Sinew-Node is distributed as a npm package:

npm install sinew-node

Example

Here's an example for collecting Reddit's topic list:

// JavaScript
var sinew = require('sinew-node');
sinew.get('http://www.reddit.com/r/javascript/', function() {
  this.$('#siteTable div.thing a.title').each(function(index) {
    console.log(this.innerHTML);
  });
});
# CoffeeScript
sinew = require 'sinew-node'
sinew.get 'http://www.reddit.com/r/javascript/', ->
  (@$ '#siteTable div.thing a.title').each (index) ->
    console.log @innerHTML

Features

  • Sinew caches all HTTP requests on disk. That makes it possible to iterate quickly. Crawl once and then continue to work on your recipe. Run the recipe over and over while you tune your CSS selectors and regular expressions.
Something went wrong with that request. Please try again.