Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Sinew-Node collects structured data from web sites (screen scraping).
CoffeeScript
Branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
lib
src
support
test
.gitignore
README.md
package.json

README.md

Welcome to Sinew-Node

Sinew-Node collects structured data from web sites (screen scraping).

Sinew-Node is distributed as a npm package:

npm install sinew-node

Example

Here's an example for collecting Reddit's topic list:

// JavaScript
var sinew = require('sinew-node');
sinew.get('http://www.reddit.com/r/javascript/', function() {
  this.$('#siteTable div.thing a.title').each(function(index) {
    console.log(this.innerHTML);
  });
});
# CoffeeScript
sinew = require 'sinew-node'
sinew.get 'http://www.reddit.com/r/javascript/', ->
  (@$ '#siteTable div.thing a.title').each (index) ->
    console.log @innerHTML

Features

  • Sinew caches all HTTP requests on disk. That makes it possible to iterate quickly. Crawl once and then continue to work on your recipe. Run the recipe over and over while you tune your CSS selectors and regular expressions.
Something went wrong with that request. Please try again.