Regexp::Bind - Bind variables to captured buffers
use Regexp::Bind qw( bind global_bind bind_array global_bind_array ); $record = bind($string, $regexp, @fields); @record = global_bind($string, $regexp, @fields); $record = bind(\$string, $regexp, @fields); @record = global_bind(\$string, $regexp, @fields); $record = bind_array($string, $regexp); @record = global_bind_array($string, $regexp); $record = bind_array(\$string, $regexp); @record = global_bind_array(\$string, $regexp); $record = bind(\$string, $embedded_regexp); @record = global_bind(\$string, $embedded_egexp);
This module is an extension to perl's native regexp function. It binds anonymous hashes or named variables to matched buffers. Both normal regexp syntax and embedded regexp syntax are supported. You can view it as a tiny and petite data extraction system.
Two types of function are exported. They bind the given fields to captured contents, and return anonymous hashes/arrayes of the fields.
In the following example, you can pass in either a string or a string-reference.
use Data::Dumper;
$record = bind($string, $regexp, qw(field_1 field_2 field_3)); print Dumper $record;
$record = bind_array($string, $regexp); print $record->[0];
@record = global_bind($string, $regexp, qw(field_1 field_2 field_3)); print Dumper $_ foreach @record;
@record = global_bind_array($string, $regexp); print $record[0]->[0];
To use named variable binding, please set $Regexp::Bind::USE_NAMED_VAR to non-undef, and then matched parts will be bound to named variables while using bind(). It is not supported for global_bind(), bind_array() and global_bind_array().
$Regexp::Bind::USE_NAMED_VAR = 1; bind($string, $regexp, qw(field_1 field_2 field_3)); print "$field_1 $field_2 $field_3\n";
Using embedded regexp syntax means you can embed fields right in regexp itself. Its embedded syntax exploits the feature of in-line commenting in regexps.
The module first tries to detect if embedded syntax is used. If detected, then comments are stripped and regexp is turned back into a simple one.
Using embedded syntax, for the sake of simplicity and legibility, field's name is restricted to alphanumerics only. bind_array() and global_bind_array() do not support embedded syntax.
Example:
bind($string, qr'# (?#<field_1>\w+) (?#<field_2>\d+)\n'm);
is converted into
bind($string, qr'# (\w+) (\d+)\n'm);
If embedded syntax is detected, further input arguments are ignored. It means that
bind($string, qr'# (?#<field_1>\w+) (?#<field_2>\d+)\n'm, qw(field_1 field_2));
is the same as
and conceptually equal to
bind($string, qr'# (\w+) (\d+)\n'm, qw(field_1 field_2));
Note that the module simply replaces (?#<field name> with ( and binds the field's name to buffer. It does not check for syntax correctness, so any fancier usage may crash.
Inline filtering now works with embedded syntax. Matched parts are saved in $_, and you can do some simple transformation within the brackets before they are exported.
bind($string, qr'# (?#<field_1>{ s/\s+//, $_ }\w+) (?#<field_2>{ $_*= 10, $_ }\d+)\n'm);
For a similar functionality, see Regexp::Fields.
And see Template::Extract and WWW::Extractor also. They are similar projects with prettier templates instead of low-level regexps.
You may wanna check test.pl for an example too.
Perhaps, I'll add a 'FOREACH' directive like that in Template::Extract.
Copyright (C) 2004 by Yung-chung Lin (a.k.a. xern) <xern@cpan.org>
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself
To install Regexp::Bind, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Regexp::Bind
CPAN shell
perl -MCPAN -e shell install Regexp::Bind
For more information on module installation, please visit the detailed CPAN module installation guide.