
Unicode::Properties - find out what properties a character has

Version 0.03

use Unicode::Properties 'uniprops';
my @prop_list = uniprops ('☺'); # Unicode smiley face
print "@prop_list\n";
prints
InMiscellaneousSymbols Any Assigned Common
You can then use, for example, \p{InMiscellaneousSymbols} to match this character in a regular expression.

Exports a function "uniprops" on request.

Given a character, returns a list of properties which the character has.

Ben Kasmin Bullock, <benkasminbullock at gmail.com>

Please report any bugs or feature requests to bug-unicode-properties at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Unicode-Properties. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.

You can find documentation for this module with the perldoc command.
perldoc Unicode::Properties

This module uses a list taken from the "perlunicode" documentation. It would be better to use Perl's internals to get the list, but I don't know how to do that.
Depending on your Perl and Unicode version, you'll get different results. For example "Balinese" was added in Unicode version 5.0.0, so if you are using Perl 5.8.8 unpatched, your Unicode version is 4.1.0 so you won't get "Balinese" in the results list.
Also, I don't know the behaviour of Unicode versions other than 4.1.0 and 5.0.0, so this module only covers those two. I couldn't get Perl 5.8.5 to install on my computer, so I've set the minimum version to 5.8.8 for this module.

Copyright 2008 Ben Kasmin Bullock, all rights reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.