Build and scan parse-trees of HTML
HTML-Tree is a suite of Perl modules for making parse trees out of HTML
source. It consists of mainly two modules, whose documentation you should
refer to: HTML::TreeBuilder and HTML::Element.
HTML::TreeBuilder is the module that builds the parse trees. (It uses
HTML::Parser to do the work of breaking the HTML up into tokens.)
The tree that TreeBuilder builds for you is made up of objects of the class
HTML::Element.
If you find that you do not properly understand the documentation for
HTML::TreeBuilder and HTML::Element, it may be because you are unfamiliar
with tree-shaped data structures, or with object-oriented modules in
general. Sean Burke has written some articles for _The Perl Journal_
('www.tpj.com') that seek to provide that background. The full text of
those articles is contained in this distribution, as:
* HTML::Tree::AboutObjects
"User's View of Object-Oriented Modules" from TPJ17.
* HTML::Tree::AboutTrees
"Trees" from TPJ18
* HTML::Tree::Scanning
"Scanning HTML" from TPJ19
Readers already familiar with object-oriented modules and tree-shaped data
structures should read just the last article. Readers without that
background should read the first, then the second, and then the third.
new
Redirects to HTML::TreeBuilder::new
new_from_file
Redirects to HTML::TreeBuilder::new_from_file
new_from_content
Redirects to HTML::TreeBuilder::new_from_content
- Developed at devel:languages:perl
-
4
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Factory/perl-HTML-Tree && cd $_
- Create Badge
Source Files
Filename | Size | Changed |
---|---|---|
HTML-Tree-5.07.tar.gz | 0000150477 147 KB | |
cpanspec.yml | 0000000669 669 Bytes | |
perl-HTML-Tree.changes | 0000005540 5.41 KB | |
perl-HTML-Tree.spec | 0000003270 3.19 KB |
Comments 0