Parsing and extracting information from (possibly malformed) HTML/XML documents

Edit Package ghc-tagsoup

TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

  • Devel package for openSUSE:Factory
  • 2 derived packages
  • Links to openSUSE:Factory / ghc-tagsoup
  • Download package
  • osc -A checkout devel:languages:haskell/ghc-tagsoup && cd $_
  • Create Badge
Source Files (show merged sources derived from linked package)
Filename Size Changed
ghc-tagsoup.changes 0000005237 5.11 KB
ghc-tagsoup.spec 0000003590 3.51 KB
tagsoup-0.14.8.tar.gz 0000043894 42.9 KB
Comments 0
openSUSE Build Service is sponsored by