Please login to access the resource

Parsing and extracting information from (possibly malformed) HTML/XML documents

Edit Package ghc-tagsoup
http://hackage.haskell.org/package/tagsoup

TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Refresh
Refresh
Source Files
Filename Size Changed
ghc-tagsoup.changes 0000002535 2.48 KB
ghc-tagsoup.spec 0000002502 2.44 KB
tagsoup-0.13.10.tar.gz 0000044587 43.5 KB
Revision 4 (latest revision is 7)
Dominique Leuenberger's avatar Dominique Leuenberger (dimstar_suse) accepted request 402540 from Ondřej Súkup's avatar Ondřej Súkup (mimi_vx) (revision 4)
- update to 0.13.10
* improve the Haddock documentation
* fix some > 16bit HTML entities

- update to 0.13.9
* fix a space leak
* fix the demo examples
* make IsString a superclass of StringLike
* make flattenTree O(n) instead of O(n^2)

- update to 0.13.8
* add parse/render functions directly to the Tree module

- update to 0.13.7
* make sure upper case &#X works in lookupEntity

- update to 0.13.6
* some named entities require a trailing semicolon (e.g. mid)

- update to 0.13.5
* add isTagComment function
Comments 0
openSUSE Build Service is sponsored by