Andrea Benazzo > Lingua-AR-Word-1.5.7 > Lingua::AR::Word

Download:
Lingua-AR-Word-1.5.7.tar.gz

Dependencies

Annotate this POD

CPAN RT

Open  1
View/Report Bugs
Module Version: 1.5.7   Source  

NAME ^

Lingua::AR::Word - Perl extension to get the stem and ArabTeX encoding of Arabic words

SYNOPSIS ^

        use utf8;
        use Lingua::AR::Word;

        my $word=Lingua::AR::Word->new("ARABIC_WORD_IN_UTF8");

        open FOUTPUT, ">>:utf8", "output" or die "Cannot create output file: $!\n";
        

        print FOUTPUT $word->get_word();
        print FOUTPUT $word->get_stem();
        print FOUTPUT $word->get_arabtex();

        close FOUTPUT;

DESCRIPTION ^

In order to work on an Arabic word, you need to create the object Lingua::AR::Word, passing the Arabic word encoded in utf8 to the constructor. You will then be able to get the stem through get_stem(). You will get the ArabTeX translittered form through get_arabtex().

Remember that input-output directly to shell will not be useful as long as your shell doesn't support utf8 encoded characters. In the example above, for example, I piped the output to another file forcing its writing in utf8.

SEE ALSO ^

If you also want the translation, check out the Lingua::AR::Db module. You may find more info about ArabTeX encoding at ftp://ftp.informatik.uni-stuttgart.de/pub/arabtex/arabtex.htm

TODO ^

Add function which returns the Arabic form of a translitterated word.
Add function which analyzes the arabic word and returns info about it: gender, {gender,number} of person/people it refers to,..

AUTHOR ^

Andrea Benazzo, <andy@slacky.it>

COPYRIGHT AND LICENSE ^

Copyright (c) 2006 Andrea Benazzo. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

syntax highlighting: