Package Info

perl-HTML-SimpleParse


a bare-bones HTML parser


Development/Libraries/Perl

This is the HTML::SimpleParse module. It is a bare-bones HTML parser, similar to HTML::Parser, but with a couple important distinctions:

First, HTML::Parser knows which tags can contain other tags, which start tags have corresponding end tags, which tags can exist only in the <HEAD> portion of the document, and so forth. HTML::SimpleParse does not know any of these things. It just finds tags and text in the HTML you give it, it does not care about the specific content of these tags (though it does distiguish between different types of tags, such as comments, starting tags like <b>, ending tags like </b>, and so on).

Second, HTML::SimpleParse does not create a hierarchical tree of HTML content, but rather a simple linear list. It does not pay any attention to balancing start tags with corresponding end tags, or which pairs of tags are inside other pairs of tags.

Because of these characteristics, you can make a very effective HTML filter by sub-classing HTML::SimpleParse.

Authors:

Ken Williams &lt;ken@forum.swarthmore.edu&gt;

License: Artistic-1.0
URL: http://search.cpan.org/perldoc?HTML::SimpleParse

Categories

Releases

Package Version Update ID Released Package Hub Version Platforms Subpackages
0.12-bp155.268.9 info GA Release 2023-05-17 15 SP5
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • perl-HTML-SimpleParse
0.12-bp154.267.18 info GA Release 2022-05-09 15 SP4
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • perl-HTML-SimpleParse
0.12-bp153.266.14 info GA Release 2021-03-06 15 SP3
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • perl-HTML-SimpleParse
0.12-bp152.268.13 info GA Release 2020-04-16 15 SP2
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • perl-HTML-SimpleParse
0.12-bp151.268.1 info GA Release 2019-07-17 15 SP1
  • AArch64
  • s390x
  • x86-64
  • perl-HTML-SimpleParse
0.12-bp151.267.12 info GA Release 2019-05-18 15 SP1
  • ppc64le
  • perl-HTML-SimpleParse
0.12-bp150.266.3 info GA Release 2018-07-30 15
  • AArch64
  • ppc64le
  • s390x
  • x86-64
  • perl-HTML-SimpleParse