HTML parser based on the WHAT-WG Web Applications

HTML parser designed to follow the WHATWG HTML5 specification. The parser is designed to handle all
flavours of HTML and parses invalid documents using well-defined error handling rules compatible
with the behaviour of major desktop web browsers.
Output is to a tree structure; the current release supports output to ElementTree (including
cElementTree and lxml.etree), minidom, and a custom simpletree format.
html5lib also includes a HTML sanitizer, "treewalkers" for converting various tree formats into
streams and filters and serializers to operate on those streams.

James Graham

Source Files (show merged sources derived from linked package)
Filename Size Changed Actions
html5lib-0.90.tar.bz2 0000068043 66.4 KB almost 10 years
python-html5lib.changes 0000000310 310 Bytes almost 10 years
python-html5lib.spec 0000001401 1.37 KB almost 10 years
Comments for python-html5lib 0