HTML::TreeBuilder - Parser that builds a
HTML syntax tree
$h = new HTML::TreeBuilder;
$h->parse($document);
#...
print $h->as_HTML; # or any other HTML::Element method
This is a parser that builds (and actually itself is) a
HTML syntax tree.
Objects of this class inherit the methods of both HTML::Parser
and
HTML::Element
. After parsing has taken place it can be regarded as the syntax tree
itself.
The following method all control how parsing takes place. You can set the attributes by passing a
TRUE or
FALSE value as argument.
- $p->implicit_tags
-
Setting this attribute to true will instruct the parser to try to deduce
implicit elements and implicit end tags. If it is false you get a parse
tree that just reflects the text as it stands. Might be useful for quick
& dirty parsing. Default is true.
Implicit elements have the implicit()
attribute set.
- $p->ignore_unknown
-
This attribute controls whether unknown tags should be represented as
elements in the parse tree. Default is true.
- $p->ignore_text
-
Do not represent the text content of elements. This saves space if all you
want is to examine the structure of the document. Default is false.
- $p->warn
-
Call
warn()
with an appropriate message for syntax errors.
Default is false.
Parser, Element
Copyright 1995-1996 Gisle Aas. All rights reserved.
This library is free software; you can redistribute it and/or modify it
under the same terms as Perl itself.
Gisle Aas