Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Server side readability with node.js
branch: master
Failed to load latest commit information.
lib Added option argument to readability.parse. Turn off debugging and pr…
misc updated to readability.js r164
test Added option argument to readability.parse. Turn off debugging and pr…
.gitignore misc
LICENSE.txt Initial commit
README.md updated performance info
notes.txt misc
package.json Merge branch 'master' of https://github.com/chetan51/node-readability
package.sh
test.txt misc

README.md

node-readability

Readability.js by Arc90 ported to node.js.

Blog post: Server side readability with node.js

Requirements

Live demo

I'm working on it...

Example

var readability = require('readability');
//...
// This is an very early example. The API is subject to change.
readability.parse(html, url, function(result) {
    console.log(result.title, result.content);
});

Performance

In my testing of 140 pages with an average size of 58KB collected from digg, delicious and hacker news, the average time taken for each page is about 1.1 seconds on a Mac Mini (2.4G Intel Core 2 Duo).

Limitation

  • no fetching next pages
  • no support for frames

Plan

  • Performance optimization
  • Better API, more options
  • Support more readability features
Something went wrong with that request. Please try again.