Java library for working with HTML
https://jsoup.org/
jsoup is a Java library for working with HTML.
It provides an API for extracting and manipulating data,
using DOM, CSS, and jquery-like methods.
jsoup implements the WHATWG HTML5 specification.
- scrapes and parses HTML from a URL, file, or string
- finds and extracts data, using DOM traversal or CSS selectors
- manipulates the HTML elements, attributes, and text
- cleans user-submitted content against a safe white-list,
to prevent XSS attacks
- outputs tidied HTML
jsoup can deal with invalid HTML tag soup.
- Links to home:fstrba:maven / jsoup
- Has a link diff
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout home:fstrba:maven:test/jsoup && cd $_
- Create Badge
Refresh
Source Files (show merged sources derived from linked package)
Filename | Size | Changed |
---|---|---|
_link | 0000000697 697 Bytes |
Comments 0