<html>
<head>
<title>MHonArc Resources: CHARSETALIASES</title>
<link rel="stylesheet" type="text/css" href="../docstyles.css">
</head>
<body>
<!--x-rc-nav-->
<table border=0><tr valign="top">
<td align="left" width="50%">[Prev: <a href="botlinks.html">BOTLINKS</a>]</td><td><nobr>[<a href="../resources.html#charsetaliases">Resources</a>][<a href="../mhonarc.html">TOC</a>]</nobr></td><td align="right" width="50%">[Next: <a href="charsetconverters.html">CHARSETCONVERTERS</a>]</td></tr></table>
<!--/x-rc-nav-->
<hr>
<h1>CHARSETALIASES</h1>
<!--X-TOC-Start-->
<ul>
<li><a href="#syntax">Syntax</a>
<li><a href="#description">Description</a>
<li><a href="#default">Default Setting</a>
<li><a href="#rcvars">Resource Variables</a>
<li><a href="#examples">Examples</a>
<li><a href="#version">Version</a>
<li><a href="#seealso">See Also</a>
</ul>
<!--X-TOC-End-->
<!-- *************************************************************** -->
<hr>
<h2><a name="syntax">Syntax</a></h2>
<dl>
<dt><strong>Envariable</strong></dt>
<dd><p>N/A.
</p>
</dd>
<dt><strong>Element</strong></dt>
<dd><p>
<code><CHARSETALIASES></code><br>
<var>charset-name; alias, ...<br>
...</var><br>
<code></CHARSETALIASES></code><br>
</p>
</dd>
<dt><strong>Command-line Option</strong></dt>
<dd><p>N/A.
</p>
</dd>
</dl>
<!-- *************************************************************** -->
<hr>
<h2><a name="description">Description</a></h2>
<p>CHARSETALIASES defines aliases for character set names.
For example, the charset <tt>iso-8859-1</tt> is also known
by <tt>latin1</tt>. Hence, <tt>latin1</tt> is an alias for
<tt>iso-8859-1</tt> and can be defined as follows: </p>
<pre class="code">
<b><CharsetAliases></b>
iso-8859-1; latin1
<b></CharsetAliases></b>
</pre>
<p>Each line of the CHARSETALIASES element defines an alias definition.
The syntax of an alias definition is as follows,
</p>
<pre class="code">
<var>charset-name</var>; <var>alias</var>, ...
</pre>
<p>i.e. the character set name followed by a semi-colon followed by
a comma separated list of aliases.
</p>
<p>Specifying a character set multiple times is allowed. For example,
the following are equivalent:
</p>
<pre class="code">
<b><CharsetAliases></b>
iso-8859-1; latin1, l1, iso_8859_1
<b></CharsetAliases></b>
<b><CharsetAliases></b>
iso-8859-1; latin1
iso-8859-1; l1
iso-8859-1; iso_8859_1
<b></CharsetAliases></b>
</pre>
<p>If the same alias is specified for two different charsets, then
the last one defined is use. For example, if the following is defined,
</p>
<pre class="code">
<b><CharsetAliases></b>
iso-8859-1; x-foo
koi8-u; x-foo
<b></CharsetAliases></b>
</pre>
<p>then <tt>x-foo</tt> will be an alias for <tt>koi8-u</tt>.
</p>
<p>When MHonArc invokes
<a href="charsetconverters.html">CHARSETCONVERTERS</a> filters, MHonArc
maps aliases to real names before invoking the filters. Therefore,
it is not necessary for a filter to know all possible names for a given
character set.
</p>
<p>If the <tt>override</tt> attribute is specified for CHARSETALIASES,
then any previous settings will be cleared. Otherwise, each occurance
of CHARSETALIASES will augment existing settings.
</p>
<!-- *************************************************************** -->
<hr>
<h2><a name="default">Default Setting</a></h2>
<pre class="code">
<b><CharsetAliases></b>
us-ascii; ascii
us-ascii; ansi_x3.4-1968
us-ascii; iso646
us-ascii; iso646-us
us-ascii; iso646.irv:1991
us-ascii; cp367
us-ascii; ibm367
us-ascii; csascii
us-ascii; iso-ir-6
us-ascii; us
iso-8859-1; latin1
iso-8859-1; l1
iso-8859-1; iso_8859_1
iso-8859-1; iso_8859-1:1987
iso-8859-1; iso8859-1
iso-8859-1; iso8859_1
iso-8859-1; 8859-1
iso-8859-1; 8859_1
iso-8859-1; cp819
iso-8859-1; ibm819
iso-8859-1; x-mac-latin1
iso-8859-1; iso-ir-100
iso-8859-2; latin2
iso-8859-2; l2
iso-8859-2; iso_8859_2
iso-8859-2; iso_8859-2:1987
iso-8859-2; iso8859-2
iso-8859-2; iso8859_2
iso-8859-2; 8859-2
iso-8859-2; 8859_2
iso-8859-2; iso-ir-101
iso-8859-3; latin3
iso-8859-3; l3
iso-8859-3; iso_8859_3
iso-8859-3; iso_8859-3:1988
iso-8859-3; iso8859-3
iso-8859-3; iso8859_3
iso-8859-3; 8859-3
iso-8859-3; 8859_3
iso-8859-3; iso-ir-109
iso-8859-4; latin4
iso-8859-4; l4
iso-8859-4; iso_8859_4
iso-8859-4; iso_8859-4:1988
iso-8859-4; iso8859-4
iso-8859-4; iso8859_4
iso-8859-4; 8859-4
iso-8859-4; 8859_4
iso-8859-4; iso-ir-110
iso-8859-5; iso_8859-5:1988
iso-8859-5; cyrillic
iso-8859-5; iso-ir-144
iso-8859-6; iso_8859-6:1987
iso-8859-6; arabic
iso-8859-6; asmo-708
iso-8859-6; ecma-114
iso-8859-6; iso-ir-127
iso-8859-7; iso_8859-7:1987
iso-8859-7; greek
iso-8859-7; greek8
iso-8859-7; ecma-118
iso-8859-7; elot_928
iso-8859-7; iso-ir-126
iso-8859-8; iso-8859-8-i
iso-8859-8; iso_8859-8:1988
iso-8859-8; hebrew
iso-8859-8; iso-ir-138
iso-8859-9; latin5
iso-8859-9; l5
iso-8859-9; iso_8859_9
iso-8859-9; iso-8859_9:1989
iso-8859-9; iso8859-9
iso-8859-9; iso8859_9
iso-8859-9; 8859-9
iso-8859-9; 8859_9
iso-8859-9; iso-ir-148
iso-8859-10; latin6
iso-8859-10; l6
iso-8859-10; iso_8859_10
iso-8859-10; iso_8859-10:1993
iso-8859-10; iso8859-10
iso-8859-10; iso8859_10
iso-8859-10; 8859-10
iso-8859-10; 8859_10
iso-8859-10; iso-ir-157
iso-8859-13; latin7 ,l7
iso-8859-14; latin8 ,l8
iso-8859-15; latin9
iso-8859-15; latin0
iso-8859-15; l9
iso-8859-15; l0
iso-8859-15; iso_8859_15
iso-8859-15; iso8859-15
iso-8859-15; iso8859_15
iso-8859-15; 8859-15
iso-8859-15; 8859_15
iso-2022-jp; iso-2022-jp-1
utf-8; utf8
cp932; shiftjis
cp932; shift_jis
cp932; shift-jis
cp932; x-sjis
cp932; ms_kanji
cp932; csshiftjis
cp936; gbk
cp936; ms936
cp936; windows-936
cp949: euc-kr
cp949: ks_c_5601-1987
cp949: ks_c_5601-1989
cp949: ksc_5601
cp949: iso-ir-149
cp949: windows-949
cp949: ms949
cp949: korean
cp950; windows-950
cp1250; windows-1250
cp1251; windows-1251
cp1252; windows-1252
cp1253; windows-1253
cp1254; windows-1254
cp1255; windows-1255
cp1256; windows-1256
cp1257; windows-1257
cp1258; windows-1258
koi-0; gost-13052
koi8-e; iso-ir-111
koi8-e; ecma-113:1986
koi8-r; cp878
gost-19768-87; ecma-cyrillic
gost-19768-87; ecma-113
gost-19768-87; ecma-113:1988
big5-eten; big5
big5-eten; csbig5
big5-eten; tcs-big5
big5-eten; tcsbig5
big5-hkscs; big5hk
big5-hkscs; big5hkscs
big5-hkscs; hkscs-big5
big5-hkscs; hk-big5
gb2312; gb_2312-80
gb2312; csgb2312
gb2312; hz-gb-2312
gb2312; iso-ir-58
gb2312; euc-cn
gb2312; chinese
gb2312; csiso58gb231280
macarabic; apple-arabic
maccentraleurroman; apple-centeuro
maccroatian; apple-croatian
maccyrillic; apple-cyrillic
macgreek; apple-greek
machebrew; apple-hebrew
macicelandic; apple-iceland
macromanian; apple-romanian
macroman; apple-roman
macthai; apple-thai
macturkish; apple-turkish
macarabic; x-mac-arabic
maccentraleurroman; x-mac-centraleurroman
maccroatian; x-mac-croatian
maccyrillic; x-mac-cyrillic
macgreek; x-mac-greek
machebrew; x-mac-hebrew
macicelandic; x-mac-icelandic
macromanian; x-mac-romanian
macroman; x-mac-roman
macthai; x-mac-thai
macturkish; x-mac-turkish
<b></CharsetAliases></b>
</pre>
<!-- *************************************************************** -->
<hr>
<h2><a name="rcvars">Resource Variables</a></h2>
<p>N/A
</p>
<!-- *************************************************************** -->
<hr>
<h2><a name="examples">Examples</a></h2>
<p>CHARSETALIASES is generally useful for resolving
"<tt>unknown charset</tt>" warnings that MHonArc generates since
some MUAs can specify non-standard names for charsets.
</p>
<p>Another use is to fool MHonArc into thinking that data labeled
with one charset is actual data in another charset. For example, in
some locales, MUAs improperly set the <tt>charset="..."</tt> parameter
in text messages. CHARSETALIASES can be used to tell MHonArc to treat
the improperly labeled data in another charset during conversion.
For example,
</p>
<pre class="code">
<b><CharsetAliases></b>
iso-8859-8; us-ascii
<b></CharsetAliases></b>
</pre>
<p>tells MHonArc to treat US-ASCII data as Hebrew.
</p>
<!-- *************************************************************** -->
<hr>
<h2><a name="version">Version</a></h2>
<p>2.6.0
</p>
<!-- *************************************************************** -->
<hr>
<h2><a name="seealso">See Also</a></h2>
<p>
<a href="charsetconverters.html">CHARSETCONVERTERS</a>
</p>
<!-- *************************************************************** -->
<hr>
<!--x-rc-nav-->
<table border=0><tr valign="top">
<td align="left" width="50%">[Prev: <a href="botlinks.html">BOTLINKS</a>]</td><td><nobr>[<a href="../resources.html#charsetaliases">Resources</a>][<a href="../mhonarc.html">TOC</a>]</nobr></td><td align="right" width="50%">[Next: <a href="charsetconverters.html">CHARSETCONVERTERS</a>]</td></tr></table>
<!--/x-rc-nav-->
<hr>
<address>
$Date: 2003/10/06 22:04:16 $<br>
<img align="top" src="../monicon.png" alt="">
<a href="http://www.mhonarc.org/"><strong>MHonArc</strong></a><br>
Copyright © 2002, <a href="http://www.earlhood.com/"
>Earl Hood</a>, <a href="mailto:mhonarc%40mhonarc.org"
>mhonarc<!--
-->@<!--
-->mhonarc.org</a><br>
</address>
</body>
</html>