Andy Lester > Lingua-JP-Kanjidic > Lingua::JP::Kanjidic

Download:
Lingua-JP-Kanjidic-1.1.tar.gz

Dependencies

Annotate this POD

CPAN RT

New  2
Open  0
View/Report Bugs
Module Version: 1.1   Source  

NAME ^

Lingua::JP::Kanjidic - Parse Jim Breen's kanji dictionary

SYNOPSIS ^

  use Lingua::JP::Kanjidic;
  system("wget http://ftp.monash.edu.au/pub/nihongo/kanjidic.gz");
  # Please see the license at
  # http://www.csse.monash.edu.au/groups/edrdg/licence.html
  system("gunzip kanjidic.gz");

  my $x = Lingua::JP::Kanjidic->new();

DESCRIPTION ^

This module parses the kanjidic file for information about individual Japanese kanji characters. It can be used as a random-access reader or as an iterator.

METHODS

new

Creates a new kanjidic reader; you must pass in the path to a kanjidic file, or it assumes the "kanjidic" file in the current directory.

reset

Resets the iterator to the beginning of file.

next

Returns the next line in the file as a Lingua::JP::Kanjidic::Kanji object (see below).

return_line($i)

Returns line $i in the file; does not affect the iterator. The line is returned as a Lingua::JP::Kanjidic::Kanji object.

lookup($kanji)

Looks up a particular kanji, returning the ::Kanji object. May be slow. The kanji should be specified as a Unicode character.

KANJI OBJECT METHODS

The following methods are available on ::Kanji objects.

kanji

The Unicode character itself.

jis

The JIS encoding, as a hex string.

unicode

The Unicode codepoint for the character, as a hex string.

nelson

The index in Nelson Modern Reader's Japanese-English Character Dictionary.

radical_nelson

The radical number, as given in Nelson.

radical

The classical radical number.

grade

The school grade in which this kanji is learnt.

strokes

The number of strokes in the kanji

halpern

The index in Halpern New Japanese-English Character Dictionary.

frequency

The frequency index of this kanji's occurrence.

new_nelson

The index number in The New Nelson Japanese-English Character Dictionary.

henshall

The index number used in "A Guide To Remembering Japanese Characters" by Kenneth G. Henshall.

gakken

The index number in the Gakken Kanji Dictionary ("A New Dictionary of Kanji Usage").

heiseg

The index number used in "Remembering The Kanji" by James Heisig.

oneill

The index number in "Japanese Names", by P.G. O'Neill.

morohashi

The index number in the 13-volume Morohashi Daikanwajiten.

tuttle

The index number in The Kanji Dictionary (Tuttle 1996).

skip

Jack Halpern's SKIP code. Note that separate copyrights apply to commercial utilization of this code.

morohashi_page

The volume and page number in the Morohashi Daikanwajiten.

four_corner

Wang Chen's four corner code.

korean / pinyin

An array reference of the Korean and Chinese readings of the kanji

meaning

An array reference of the English meanings of the kanji.

hiragana

Kun-yomi readings for the character, returned as an array reference of Unicode strings.

katakana

On-yomi readings for the character, returned as an array reference of Unicode strings.

joyo

A binary flag indicating whether or not the kanji is joyo.

SEE ALSO ^

http://www.csse.monash.edu.au/~jwb/kanjidic_doc.html

AUTHOR ^

Simon Cozens, <simon@cpan.org>

COPYRIGHT AND LICENSE ^

Copyright (C) 2004 by Simon Cozens

syntax highlighting: