HTML parser class

Edit Package perl-HTML-Parser
http://search.cpan.org/dist/HTML-Parser/

Objects of the HTML::Parser class will recognize markup and separate it
from plain text (alias data content) in HTML documents. As different
kinds of markup and text are recognized, the corresponding event handlers
are invoked.

HTML::Parser is not a generic SGML parser. We have tried to make it able to
deal with the HTML that is actually "out there", and it normally parses as
closely as possible to the way the popular web browsers do it instead of
strictly following one of the many HTML specifications from W3C. Where
there is disagreement, there is often an option that you can enable to
get the official behaviour.

The document to be parsed may be supplied in arbitrary chunks. This makes
on-the-fly parsing as documents are received from the network possible.

If event driven parsing does not feel right for your application, you might
want to use HTML::PullParser. This is an HTML::Parser subclass that allows
a more conventional program structure.

Refresh
Refresh
Source Files
Filename Size Changed
HTML-Parser-3.61.tar.bz2 0000074524 72.8 KB
HTML-Parser-3.64.tar.bz2 0000075156 73.4 KB
perl-HTML-Parser.changes 0000006546 6.39 KB
perl-HTML-Parser.spec 0000002980 2.91 KB
ready 0000000000 0 Bytes
Revision 11 (latest revision is 64)
Anna Maresova's avatar Anna Maresova (anicka) committed (revision 11)
- update to 3.64 
 * Convert files to UTF-8
 * Don't allow decode_entities() to generate illegal Unicode chars
 * Copyright 2009
 * Remove rendundant (repeated) test
 * Make parse_file() method use 3-arg open [RT#49434]
 * Take more care to prepare the char range for encode_entities [RT#50170]
 * decode_entities confused by trailing incomplete entity
 * HTTP::Header doc typo fix.
 * Do not bother tracking style or script, they're ignored.
 * Bring HTML 5 head elements up to date with WD-html5-20090423.
 * Improve HeadParser performance.
 * Doc patch: Make it clearer what the return value from ->parse is
- remove last patch (fixed in upstream)
Comments 0
openSUSE Build Service is sponsored by