Server side readability with node.js
JavaScript Other
Switch branches/tags
Nothing to show
Pull request Compare This branch is even with arrix:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
lib
misc
test
.gitignore
LICENSE.txt
README.md
notes.txt
package.json
package.sh
test.txt

README.md

node-readability

Readability.js by Arc90 ported to node.js.

Blog post: Server side readability with node.js

Requirements

Live demo

I'm working on it...

Example

var readability = require('readability');
//...
// This is an very early example. The API is subject to change.
readability.parse(html, url, function(result) {
    console.log(result.title, result.content);
});

Performance

In my testing of 140 pages with an average size of 58KB collected from digg, delicious and hacker news, the average time taken for each page is about 1.1 seconds on a Mac Mini (2.4G Intel Core 2 Duo).

Limitation

  • no fetching next pages
  • no support for frames

Plan

  • Performance optimization
  • Better API, more options
  • Support more readability features