The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

DOMHandler - Implements a call-back interface to DOM.

SYNOPSIS

  use DOMHandler;
  use XML::LibXML;
  $p = new XML::LibXML;
  $doc = $p->parse_file( 'data.xml' );
  $dh = new DOMHandler( handler_package => new testhandler );
  $dh->traverse( $doc );

  package testhandler;
  sub new {
      return bless {};
  }
  sub A {
      my( $self, $agent, $node ) = @_;
      my $par = $node->parentNode->nodeName;
      print "I'm in an A element and my parent is $par.\n";
  }
  sub generic_element {
      my( $self, $agent, $node ) = @_;
      my $name = $node->nodeName;
      print "I'm in an element named '$name'.\n";
  }
  sub generic_text {
      print "Here's some text.\n";
  }
  sub generic_PI {
      print "Here's a processing instruction.\n";
  }
  sub generic_CDATA {
      print "Here's a CDATA Section.\n";
  }

DESCRIPTION

This module creates a layer on top of DOM that allows you to program in a "push" style rather than "pull". Once the document has been parsed and you have a DOM object, you can call on the DOMHandler's traverse() method to apply a set of call-back routines to all the nodes in a tree. You supply the routines in a handler package when initializing the DOMHandler.

In your handler package, the names of routines determine which will be called for a given node. There are routines for node types, named "generic_" plus the node type. For elements, you can name routines after the element name and these will only be called for that type of element. A list of supported handlers follows:

else_generic_node()

Applied only to nodes that have not been handled by another routine.

generic_CDATA()

Applied to CDATA sections.

generic_comment()

Applied to XML comments.

generic_doctype()

Applied to DOCTYPE declarations.

generic_element()

Applied to all elements.

generic_node()

Applied to all nodes.

generic_PI()

Processing instruction

generic_text()

Applied to text nodes.

A handler routine takes three arguments: the $self reference, a reference to the DOMHandler object, and a reference to a node in the document being traversed. You can use DOM routines on that node to do any processing you want. At the moment, this module only supports XML::LibXML documents.

IMPORTANT NOTE: Some DOM operations may cause unwanted results. For example, if you delete the current node's parent, the program will likely crash.

METHODS

traverse( $doc )

Visits each node in a document, in order, applying the appropriate handler routines.

AUTHOR

Erik Ray (eray@oreilly.com), Production Tools Dept., O'Reilly and Associates Inc.

COPYRIGHT

Copyright (c) 2002 Erik Ray and O'Reilly & Associates.

1 POD Error

The following errors were encountered while parsing the POD:

Around line 328:

=back doesn't take any parameters, but you said =back 4