Server side readability with node.js
JavaScript Other
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
lib
misc updated to readability.js r164 Dec 3, 2010
test
.gitignore misc Nov 23, 2010
LICENSE.txt Initial commit Nov 19, 2010
README.md updated performance info Nov 22, 2010
notes.txt
package.json
package.sh
test.txt misc Nov 23, 2010

README.md

node-readability

Readability.js by Arc90 ported to node.js.

Blog post: Server side readability with node.js

Requirements

Live demo

I'm working on it...

Example

var readability = require('readability');
//...
// This is an very early example. The API is subject to change.
readability.parse(html, url, function(result) {
    console.log(result.title, result.content);
});

Performance

In my testing of 140 pages with an average size of 58KB collected from digg, delicious and hacker news, the average time taken for each page is about 1.1 seconds on a Mac Mini (2.4G Intel Core 2 Duo).

Limitation

  • no fetching next pages
  • no support for frames

Plan

  • Performance optimization
  • Better API, more options
  • Support more readability features