Matt Sergeant > XML-QL-0.07 > XML::QL

Download:
XML-QL-0.07.tar.gz

Dependencies

Annotate this POD

Related Modules

XML::XQL
Data::Dumper
XML::PYX
DBIx::XML_RDB
more...
By perlmonks.org

CPAN RT

New  1
Open  0
View/Report Bugs
Module Version: 0.07   Source  

NAME ^

XML::QL - An XML query language

VERSION ^

0.07

SYNOPSIS ^

$ql = 'WHERE <head>$head</head> ORDER-BY $head IN "file:REC-xml-19980210.xml" CONSTRUCT $head';

print XML::QL->query($sql);

DESCRIPTION ^

This module is an early implementation of a note published by the W3C called "XML-QL: A Query Language for XML". XML-QL allows the user to query an XML document much like a database, and describe a construct for output. Currently this module only offers partial functionality as described in the specification, and even some of that has been changed for ease of use. This documentation will describe the fuctionality of this module as well as differences from the XML-QL specification.

METHODS ^

query( "query" )

This is the only method required to use this module. This one method allows the user to pass a valid XML-QL query to the module, and the return value is the output.

XML-QL: The Query Language ^

The basic syntax consists of two parts, a WHERE clause to describe the data to search for, and a CONSTRUCT clause to describe how to return the data that is found.

WHERE

WHERE XML-searchstring [ORDER-BY variable [DESCENDING] [, variable [DESCENDING]] ] IN 'filename'

The WHERE clause can be separated into several parts. The first is the search string, the second is an optional ORDER-BY clause much like ORDER BY in SQL, and last is the required XML document file name. Each of these parts is described below.

XML-searchstring

The search string MUST be a valid XML snippet. This is one are where this module differs from the specification. It has been implemented in this way so that the search string may be parsed by the XML::Parser module.

The first step in building a query is to list the tags to search for in the document. For example, consider the following search string:

        <BOOK>
                <AUTHOR></AUTHOR>
        </BOOK>

This search string will search for the AUTHOR tag nested within a BOOK tag. Note however that no information has been selected for retrieval. In the following example, we actually grab some information:

        <BOOK>
                <AUTHOR>$author</AUTHOR>
        </BOOK>

The variable name $author will grab the information that it finds withing this tag, and makes this information avalable to us for use in the CONSTRUCT section of the query. You will notice that variable names start with a dollar sign ($), as this is called for by the specification. In Perl, this means that if the query is enclosed in double quotes, this dollar sign must be escaped.

In the following example we take it a step further by searching for books of that are non-fiction:

        <BOOK TYPE='non-fiction'>
                <AUTHOR>$author</AUTHOR>
        </BOOK>

We can also express this as a regular expression:

        <BOOK TYPE='non-.*'>
                <AUTHOR>$author</AUTHOR>
        </BOOK>

This is another area where this module differs from the specification. The regular expesssion ability as defined in the specification only allows for a subset of the ability available in a Perl regular expression. With this module, the full range of regular expression syntax has been made available. This also means that you must also escape things such as periods(.), parenthesis (), and brackets ([]). All non tag matched are case insensitive.

Now lets say that besides matching the TYPE, we also wanted to grab the value. Consider this example:

        <BOOK TYPE='non-.* AS_ELEMENT $type'>
                <AUTHOR>$author</AUTHOR>
        </BOOK>

The AS_ELEMENT keyword allows you to save the matched value for later use in the CONSTRUCT portion of the query.

ORDER-BY

The ORDER-BY clause allows to sort the data retrieved in the variables. You may specify multiple variables, and specify DESCENDING for a reverse sort. This clause is not required. For example:

ORDER-BY $type, $author DESCENDING

IN

The IN clause is a required clause that specifies the file name of the XML file. This can be any URI that is supported by LWP, or it can be a single file name enclosed in quotes. In later versions of this module there will be support for multiple files, directories. The following will work:

IN 'REC-xml-19980210.xml'

IN 'file://othermachine/share/filename.xml'

IN 'http://www.example.com/file.xml'

CONSTRUCT

The CONSTRUCT construct allows you to specify a template for output. The template will match character for character from the first space after the word CONSTRUCT to the end of the XML-QL query. For example:

$ql = '(where clause...) CONSTRUCT Type: $type Author: $author';

The ouput of this will then be a carriage return, a tab, "Type: ", the contents of $type, a carriage return, a tab, "Author: ", and the contents of $author. This construct will be repeated for every match found and returned as a single string.

AUTHOR ^

Robert Hanson - Initial Version rhanson@blast.net

Matt Sergeant - Only minor fixes so far msergeant@ndirect.co.uk, sergeant@geocities.com

COPYRIGHT ^

Robert's Original licence was: I hereby reserve NO rights to this module, except for maybe a little recognition if you decide to rewrite it and redistribute as your own. Beyond that, you can do whatever you want with this. I would just appreciate a copy of any improvements to this module.

However that only stands for version 0.01 of the module. All versions above that are released under the same terms as perl itself.

syntax highlighting: