MHonArc::CharEnt - HTML Character routines for MHonArc.
use MHonArc::CharEnt; MHonArc resource file: <CharsetConverters> ... iso-8859-15; MHonArc::CharEnt::str2sgml; MHonArc/CharEnt.pm ... </CharsetConverters>
MHonArc::CharEnt provides the main character conversion routine used by MHonArc for converting non-ASCII encoded message header data and text/plain character data into HTML. This module was initially written to just support 8-bit only charsets. However, it has been extended to support multibyte charsets.
All characters are mapped to HTML 4.0 character entity references (e.g. < >) or to Unicode numeric character entity references (e.g. ‾). Most modern browsers will support the Unicode references directly.
UTF-8 conversion is done algorithmically.
This does make reading the raw HTML source for non-English languages difficult, but this may be a non-issue with most users.
$Id: CharEnt.pm,v 1.17 2010/12/31 18:23:02 ehood Exp $
Earl Hood, firstname.lastname@example.org
MHonArc comes with ABSOLUTELY NO WARRANTY and MHonArc may be copied only under the terms of the GNU General Public License, which may be found in the MHonArc distribution.