Parsing and extracting information from (possibly malformed) HTML/XML documents
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.
Users should start from the Text.HTML.TagSoup module.
-
1
derived packages
- Links to openSUSE:Le...2.2:Update / ghc-tagsoup.6476
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Leap:42.2:Update/ghc-tagsoup && cd $_ - Create Badge
Refresh
Source Files (show merged sources derived from linked package)
| Filename | Size | Changed |
|---|---|---|
| _link | 0000000051 51 Bytes |
Comments 0