MHonArc::UTF8 - UTF-8 routines for MHonArc
<CharsetConverters override> plain; mhonarc::htmlize; default; MHonArc::UTF8::str2sgml; MHonArc/UTF8.pm </CharsetConverters> <TextClipFunc> MHonArc::UTF8::clip; MHonArc/UTF8.pm </TextClipFunc>
MHonArc::UTF8 provides UTF-8 related routines for use in MHonArc. The main use of the routines provided is to generate mail archives encoded in Unicode UTF-8.
MHonArc::UTF8::to_utf8($data, $from_charset, $to_charset)
$data encoded in
$from_charset into UTF-8.
$to_charset is ignored since it assumed to be
This function is designed to be registered to the TEXTENCODE resource:
<TextEncode> utf-8; MHonArc::UTF8::to_utf8; MHonArc/UTF8.pm </TextEncode>
This function is designed to be registered to the CHARSETCONVERTERS resource:
<CharsetConverters override> plain; mhonarc::htmlize; us-ascii; mhonarc::htmlize; default; MHonArc::UTF8::str2sgml; MHonArc/UTF8.pm </CharsetConverters>
All data passed in is converted to utf-8 with HTML specials converted into entity references.
MHonArc::UTF8::clip($text, $clip_len, $is_html, $has_tags)
This function is designed to be registered to the TEXTCLIPFUNC resource to have utf-8 strings safely clipped in resource variable expansion:
<TextClipFunc> MHonArc::UTF8::clip; MHonArc/UTF8.pm </TextClipFunc>
Note: Since it is unclear about the future of Unicode::MapUTF8, it is possible that support for it may be dropped in the future. It appears to not have been updated in awhile since Perl's Encode module will probably become the standard module to use for handling text encodings.
The CHARSETCONVERTERS, TEXTCLIPFUNC, and TEXTENCODE resources in the MHonArc documentation.
$Id: UTF8.pm,v 1.6 2003/03/05 22:17:15 ehood Exp $
Earl Hood, email@example.com
MHonArc comes with ABSOLUTELY NO WARRANTY and MHonArc may be copied only under the terms of the GNU General Public License, which may be found in the MHonArc distribution.