html5lib
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
Testsuite data for html5lib, including the de-facto standard HTML parsing tests.
Automatically exported from code.google.com/p/html5lib. Purely archival.
html5lib now has its very own website!
2
Updated May 19, 2014
Ruby port of html5lib, currently unmaintained.
PHP port of html5lib, currently unmaintained.