Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Event-driven HTML5 Parser in Javascript
JavaScript
Pull request Compare This branch is 1 commit ahead, 879 commits behind aredridel:master.

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
deps
doc
lib
test
testdata
tools
.gitmodules
COPYING
DOCUMENTATION.md
Makefile
README.md
example.js
package.json
test.js

README.md

HTML5 Parser for node.js

Example (With jQuery!)

    /* Before you run this, run:
          git submodule update --init
          (cd deps/jquery; rake)
    */
    var  HTML5 = require('html5'),
        Script = process.binding('evals').Script,
           sys = require('sys'),
            fs = require('fs'),
         jsdom = require('jsdom'),
        window = jsdom.jsdom(null, null, {parser: HTML5}).createWindow()

    var parser = new HTML5.Parser({document: window.document});

    var inputfile = fs.readFileSync('doc/jquery-example.html');
    parser.parse(inputfile);

    jsdom.jQueryify(window, __dirname + '/deps/jquery/dist/jquery.js', function(window, jquery) {
            Script.runInNewContext('jQuery("p").append("<b>Hi!</b>")', window);
            sys.puts(window.document.innerHTML);

    });

Interesting features

  • Streaming parser: You can pass parser.parse an EventEmitter and the parser will keep adding data as it's received.

  • HTML5 parsing algorithm. If you find something this can't parse, I'll want to know about it. It should make sense out of anything a browser can.

Installation

Use npm, or to use the git checkout, read on.

You'll need to fetch dependencies or initialize git submodules if you're pulling this from my git repository.

npm install

or

git submodules init

and give it a run:

node test.js

Git repository at http://theinternetco.net/~aredridel/projects/js/html5/.git/

Something went wrong with that request. Please try again.