Parsing and extracting information from (possibly malformed) HTML/XML documents
http://hackage.haskell.org/package/tagsoup
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.
Users should start from the Text.HTML.TagSoup module.
- Sources inherited from project openSUSE:Leap:42.3
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Leap:42.3:Ports/ghc-tagsoup && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
ghc-tagsoup.changes | 0000003039 2.97 KB | |
ghc-tagsoup.spec | 0000002478 2.42 KB | |
tagsoup-0.14.1.tar.gz | 0000044031 43 KB |
Comments 0