Skip to content
Permalink
Browse files

[giow] (2) List the default encodings by locale.

git-svn-id: http://svn.whatwg.org/webapps@4126 340c8d12-0b0e-0410-8428-c7bf67bfef74
  • Loading branch information...
Hixie committed Oct 13, 2009
1 parent b2e3aca commit 96374b61a1c8898de97c7083c891b5d456ebcd2e
Showing with 380 additions and 26 deletions.
  1. +114 −8 complete.html
  2. +114 −8 index
  3. +152 −10 source

</li>

<li><p>Otherwise, return an implementation-defined or
user-specified default character encoding, with the <a href=#concept-encoding-confidence title=concept-encoding-confidence>confidence</a>
<i>tentative</i>. In controlled environments or in environments
where the encoding of documents can be prescribed (for example, for
user agents intended for dedicated use in new networks), the more
comprehensive <code title="">UTF-8</code> encoding is
suggested. Due to its use in legacy content, <code title="">windows-1252</code> is suggested as a default in
predominantly Western locales instead.</li>
<li>

<p>Otherwise, return an implementation-defined or user-specified
default character encoding, with the <a href=#concept-encoding-confidence title=concept-encoding-confidence>confidence</a>
<i>tentative</i>.</p>

<p>In controlled environments or in environments where the
encoding of documents can be prescribed (for example, for user
agents intended for dedicated use in new networks), the
comprehensive <code title="">UTF-8</code> encoding is
suggested.</p>

<p>In other environments, the default encoding is typically
dependent on the user's locale (an approximation of the languages,
and thus typically encodings, of the pages that the user is likely
to frequent). The following table gives suggested defaults based
on the user's locale, for compatibility with legacy content:</p>

<!-- based on mozilla 1.9.1 localizations:
http://mxr.mozilla.org/l10n-mozilla1.9.1/find?string=global%2Fintl.properties&tree=l10n-mozilla1.9.1&hint= -->

<table><thead><tr><th>Locale
<th>Suggested default encoding
<tbody><tr><td>ar
<td>UTF-8

<tr><td>be
<td>ISO-8859-5

<tr><td>bg
<td>windows-1251

<tr><td>cs<!-- -CZ -->
<td>ISO-8859-2

<tr><td>cy
<td>UTF-8

<tr><td>fa<!-- -IR -->
<td>UTF-8

<tr><td>he<!-- -IL -->
<td>windows-1255

<tr><td>hr
<td>UTF-8

<tr><td>hu<!-- -HU -->
<td>ISO-8859-2

<tr><td>ja <!-- and ja-JP-mac -->
<td>windows-31J <!-- Shift_JIS -->

<tr><td>kk
<td>UTF-8

<tr><td>ko<!-- -KR -->
<td>windows-949 <!-- EUC-KR -->

<tr><td>ku
<td>windows-1254 <!-- ISO-8859-9 -->

<tr><td>lt
<td>windows-1257

<tr><td>lv<!-- -LV -->
<td>ISO-8859-13

<tr><td>mk<!-- -MK -->
<td>UTF-8

<tr><td>or
<td>UTF-8

<tr><td>pl<!-- -PL -->
<td>ISO-8859-2

<tr><td>ro
<td>UTF-8

<tr><td>ru
<td>windows-1251

<tr><td>sk
<td>windows-1250

<tr><td>sl
<td>ISO-8859-2

<tr><td>sr
<td>UTF-8

<tr><td>th
<td>windows-874 <!-- TIS-620 -->

<tr><td>tr<!-- -TR -->
<td>windows-1254 <!-- ISO-8859-9 -->

<tr><td>uk
<td>windows-1251

<tr><td>vi
<td>UTF-8

<tr><td>zh-CN
<td>GB18030

<tr><td>zh-TW
<td>Big5

<tr><td>All other locales
<td>windows-1252

</table></li>

</ol><p>The <a href="#document's-character-encoding">document's character encoding</a> must immediately
be set to the value returned from this algorithm, at the same time
122 index

</li>

<li><p>Otherwise, return an implementation-defined or
user-specified default character encoding, with the <a href=#concept-encoding-confidence title=concept-encoding-confidence>confidence</a>
<i>tentative</i>. In controlled environments or in environments
where the encoding of documents can be prescribed (for example, for
user agents intended for dedicated use in new networks), the more
comprehensive <code title="">UTF-8</code> encoding is
suggested. Due to its use in legacy content, <code title="">windows-1252</code> is suggested as a default in
predominantly Western locales instead.</li>
<li>

<p>Otherwise, return an implementation-defined or user-specified
default character encoding, with the <a href=#concept-encoding-confidence title=concept-encoding-confidence>confidence</a>
<i>tentative</i>.</p>

<p>In controlled environments or in environments where the
encoding of documents can be prescribed (for example, for user
agents intended for dedicated use in new networks), the
comprehensive <code title="">UTF-8</code> encoding is
suggested.</p>

<p>In other environments, the default encoding is typically
dependent on the user's locale (an approximation of the languages,
and thus typically encodings, of the pages that the user is likely
to frequent). The following table gives suggested defaults based
on the user's locale, for compatibility with legacy content:</p>

<!-- based on mozilla 1.9.1 localizations:
http://mxr.mozilla.org/l10n-mozilla1.9.1/find?string=global%2Fintl.properties&tree=l10n-mozilla1.9.1&hint= -->

<table><thead><tr><th>Locale
<th>Suggested default encoding
<tbody><tr><td>ar
<td>UTF-8

<tr><td>be
<td>ISO-8859-5

<tr><td>bg
<td>windows-1251

<tr><td>cs<!-- -CZ -->
<td>ISO-8859-2

<tr><td>cy
<td>UTF-8

<tr><td>fa<!-- -IR -->
<td>UTF-8

<tr><td>he<!-- -IL -->
<td>windows-1255

<tr><td>hr
<td>UTF-8

<tr><td>hu<!-- -HU -->
<td>ISO-8859-2

<tr><td>ja <!-- and ja-JP-mac -->
<td>windows-31J <!-- Shift_JIS -->

<tr><td>kk
<td>UTF-8

<tr><td>ko<!-- -KR -->
<td>windows-949 <!-- EUC-KR -->

<tr><td>ku
<td>windows-1254 <!-- ISO-8859-9 -->

<tr><td>lt
<td>windows-1257

<tr><td>lv<!-- -LV -->
<td>ISO-8859-13

<tr><td>mk<!-- -MK -->
<td>UTF-8

<tr><td>or
<td>UTF-8

<tr><td>pl<!-- -PL -->
<td>ISO-8859-2

<tr><td>ro
<td>UTF-8

<tr><td>ru
<td>windows-1251

<tr><td>sk
<td>windows-1250

<tr><td>sl
<td>ISO-8859-2

<tr><td>sr
<td>UTF-8

<tr><td>th
<td>windows-874 <!-- TIS-620 -->

<tr><td>tr<!-- -TR -->
<td>windows-1254 <!-- ISO-8859-9 -->

<tr><td>uk
<td>windows-1251

<tr><td>vi
<td>UTF-8

<tr><td>zh-CN
<td>GB18030

<tr><td>zh-TW
<td>Big5

<tr><td>All other locales
<td>windows-1252

</table></li>

</ol><p>The <a href="#document's-character-encoding">document's character encoding</a> must immediately
be set to the value returned from this algorithm, at the same time

0 comments on commit 96374b6

Please sign in to comment.
You can’t perform that action at this time.