Parsing and extracting information from (possibly malformed) HTML/XML documents
http://hackage.haskell.org/package/tagsoup
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.
Users should start from the Text.HTML.TagSoup module.
- Sources inherited from project openSUSE:Leap:42.3
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Leap:42.3:Update/ghc-tagsoup && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
ghc-tagsoup.changes | 0000002535 2.48 KB | |
ghc-tagsoup.spec | 0000002502 2.44 KB | |
tagsoup-0.13.10.tar.gz | 0000044587 43.5 KB |
Revision 4 (latest revision is 7)
Dominique Leuenberger (dimstar_suse)
accepted
request 402540
from
Ondřej Súkup (mimi_vx)
(revision 4)
- update to 0.13.10 * improve the Haddock documentation * fix some > 16bit HTML entities - update to 0.13.9 * fix a space leak * fix the demo examples * make IsString a superclass of StringLike * make flattenTree O(n) instead of O(n^2) - update to 0.13.8 * add parse/render functions directly to the Tree module - update to 0.13.7 * make sure upper case &#X works in lookupEntity - update to 0.13.6 * some named entities require a trailing semicolon (e.g. mid) - update to 0.13.5 * add isTagComment function
Comments 0