We have some news to share for the request index beta feature. We’ve added more options to sort your requests, counters to the individual filters and documentation for the search functionality. Checkout the blog post for more details.

HTML parser based on the WHAT-WG Web Applications

Edit Package python-html5lib
http://code.google.com/p/html5lib

HTML parser designed to follow the WHATWG HTML5 specification. The parser is designed to handle all
flavours of HTML and parses invalid documents using well-defined error handling rules compatible
with the behaviour of major desktop web browsers.
Output is to a tree structure; the current release supports output to ElementTree (including
cElementTree and lxml.etree), minidom, and a custom simpletree format.
html5lib also includes a HTML sanitizer, "treewalkers" for converting various tree formats into
streams and filters and serializers to operate on those streams.

Author:
--------
James Graham

Source Files (show merged sources derived from linked package)
Filename Size Changed
Comments 0
No comments available
openSUSE Build Service is sponsored by