Point users at easier HTML / XML modules.
FAQ readers are generally going to be new to Perl (and possibly
programming) so it behooves us to give them the easiest to use resources
we can. Only if they have a demonstrated need for more power should we
point them to lower level tools.
Loading branch information
2 changed files
and 6 deletions.
@@ -160,11 +160,11 @@ X<sucking out, will to live>
(contributed by brian d foy)
If you just want to get work done, use a module and forget about the
regular expressions. The L<XML:: Parser> and L<HTML:: Parser> modules
are good starts, although each namespace has other parsing modules
specialized for certain tasks and different ways of doing it. Start at
CPAN Search ( L<http:// search.cpan.org/> ) and wonder at all the work people
have done for you already! :)
regular expressions. The L<XML:: LibXML>, L<HTML:: TokeParser> and
L<HTML::TreeBuilder> modules are good starts, although each namespace
has other parsing modules specialized for certain tasks and different
ways of doing it. Start at CPAN Search ( L<http:// metacpan.org/> )
and wonder at all the work people have done for you already! :)
The problem with things such as XML is that they have balanced text
containing multiple levels of balanced text, but sometimes it isn't
@@ -63,7 +63,7 @@ but for anything other than a very basic web site, using a web framework
=head2 How do I remove HTML from a string?
Use L<HTML:: Parser>, or L<HTML::FormatText> which not only removes HTML
Use L<HTML:: Strip>, or L<HTML::FormatText> which not only removes HTML
but also attempts to do a little simple formatting of the resulting