Permalink
Find file
Fetching contributors…
Cannot retrieve contributors at this time
242 lines (215 sloc) 14.6 KB
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<!-- X-URL: http://www.ramsch.org/martin/uni/fmi-hp/iso8859-1.html -->
<!-- Date: Tue, 28 Dec 2004 20:24:09 GMT -->
<!-- Last-Modified: Mon, 15 May 2000 09:37:37 GMT -->
<HTML>
<HEAD>
<TITLE>Martin Ramsch - iso8859-1 table</TITLE>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<BASE HREF="http://www.ramsch.org/martin/uni/fmi-hp/iso8859-1.html">
</HEAD>
<BODY>
<H1 ALIGN=center>iso8859-1 table</H1>
<PRE>
Description Code Entity name
=================================== ============ ==============
quotation mark &amp;#34; --> &#34; &amp;quot; --> &quot;
ampersand &amp;#38; --> &#38; &amp;amp; --> &amp;
less-than sign &amp;#60; --> &#60; &amp;lt; --> &lt;
greater-than sign &amp;#62; --> &#62; &amp;gt; --> &gt;
Description Char Code Entity name
=================================== ==== ============ ==============
non-breaking space   &amp;#160; --> &#160; &amp;nbsp; --> &nbsp;
inverted exclamation ¡ &amp;#161; --> &#161; &amp;iexcl; --> &iexcl;
cent sign ¢ &amp;#162; --> &#162; &amp;cent; --> &cent;
pound sterling £ &amp;#163; --> &#163; &amp;pound; --> &pound;
general currency sign ¤ &amp;#164; --> &#164; &amp;curren; --> &curren;
yen sign ¥ &amp;#165; --> &#165; &amp;yen; --> &yen;
broken vertical bar ¦ &amp;#166; --> &#166; &amp;brvbar; --> &brvbar;
Non-standard &amp;brkbar; --> &brkbar;
section sign § &amp;#167; --> &#167; &amp;sect; --> &sect;
umlaut (dieresis) ¨ &amp;#168; --> &#168; &amp;uml; --> &uml;
Non-standard &amp;die; --> &die;
copyright © &amp;#169; --> &#169; &amp;copy; --> &copy;
feminine ordinal ª &amp;#170; --> &#170; &amp;ordf; --> &ordf;
left angle quote, guillemotleft « &amp;#171; --> &#171; &amp;laquo; --> &laquo;
not sign ¬ &amp;#172; --> &#172; &amp;not; --> &not;
soft hyphen ­ &amp;#173; --> &#173; &amp;shy; --> &shy;
registered trademark ® &amp;#174; --> &#174; &amp;reg; --> &reg;
macron accent ¯ &amp;#175; --> &#175; &amp;macr; --> &macr;
Non-standard &amp;hibar; --> &hibar;
degree sign ° &amp;#176; --> &#176; &amp;deg; --> &deg;
plus or minus ± &amp;#177; --> &#177; &amp;plusmn; --> &plusmn;
superscript two ² &amp;#178; --> &#178; &amp;sup2; --> &sup2;
superscript three ³ &amp;#179; --> &#179; &amp;sup3; --> &sup3;
acute accent ´ &amp;#180; --> &#180; &amp;acute; --> &acute;
micro sign µ &amp;#181; --> &#181; &amp;micro; --> &micro;
paragraph sign ¶ &amp;#182; --> &#182; &amp;para; --> &para;
middle dot · &amp;#183; --> &#183; &amp;middot; --> &middot;
cedilla ¸ &amp;#184; --> &#184; &amp;cedil; --> &cedil;
superscript one ¹ &amp;#185; --> &#185; &amp;sup1; --> &sup1;
masculine ordinal º &amp;#186; --> &#186; &amp;ordm; --> &ordm;
right angle quote, guillemotright » &amp;#187; --> &#187; &amp;raquo; --> &raquo;
fraction one-fourth ¼ &amp;#188; --> &#188; &amp;frac14; --> &frac14;
fraction one-half ½ &amp;#189; --> &#189; &amp;frac12; --> &frac12;
fraction three-fourths ¾ &amp;#190; --> &#190; &amp;frac34; --> &frac34;
inverted question mark ¿ &amp;#191; --> &#191; &amp;iquest; --> &iquest;
capital A, grave accent À &amp;#192; --> &#192; &amp;Agrave; --> &Agrave;
capital A, acute accent Á &amp;#193; --> &#193; &amp;Aacute; --> &Aacute;
capital A, circumflex accent  &amp;#194; --> &#194; &amp;Acirc; --> &Acirc;
capital A, tilde à &amp;#195; --> &#195; &amp;Atilde; --> &Atilde;
capital A, dieresis or umlaut mark Ä &amp;#196; --> &#196; &amp;Auml; --> &Auml;
capital A, ring Å &amp;#197; --> &#197; &amp;Aring; --> &Aring;
capital AE diphthong (ligature) Æ &amp;#198; --> &#198; &amp;AElig; --> &AElig;
capital C, cedilla Ç &amp;#199; --> &#199; &amp;Ccedil; --> &Ccedil;
capital E, grave accent È &amp;#200; --> &#200; &amp;Egrave; --> &Egrave;
capital E, acute accent É &amp;#201; --> &#201; &amp;Eacute; --> &Eacute;
capital E, circumflex accent Ê &amp;#202; --> &#202; &amp;Ecirc; --> &Ecirc;
capital E, dieresis or umlaut mark Ë &amp;#203; --> &#203; &amp;Euml; --> &Euml;
capital I, grave accent Ì &amp;#204; --> &#204; &amp;Igrave; --> &Igrave;
capital I, acute accent Í &amp;#205; --> &#205; &amp;Iacute; --> &Iacute;
capital I, circumflex accent Î &amp;#206; --> &#206; &amp;Icirc; --> &Icirc;
capital I, dieresis or umlaut mark Ï &amp;#207; --> &#207; &amp;Iuml; --> &Iuml;
capital Eth, Icelandic Ð &amp;#208; --> &#208; &amp;ETH; --> &ETH;
Non-standard &amp;Dstrok; --> &Dstrok;
capital N, tilde Ñ &amp;#209; --> &#209; &amp;Ntilde; --> &Ntilde;
capital O, grave accent Ò &amp;#210; --> &#210; &amp;Ograve; --> &Ograve;
capital O, acute accent Ó &amp;#211; --> &#211; &amp;Oacute; --> &Oacute;
capital O, circumflex accent Ô &amp;#212; --> &#212; &amp;Ocirc; --> &Ocirc;
capital O, tilde Õ &amp;#213; --> &#213; &amp;Otilde; --> &Otilde;
capital O, dieresis or umlaut mark Ö &amp;#214; --> &#214; &amp;Ouml; --> &Ouml;
multiply sign × &amp;#215; --> &#215; &amp;times; --> &times;
capital O, slash Ø &amp;#216; --> &#216; &amp;Oslash; --> &Oslash;
capital U, grave accent Ù &amp;#217; --> &#217; &amp;Ugrave; --> &Ugrave;
capital U, acute accent Ú &amp;#218; --> &#218; &amp;Uacute; --> &Uacute;
capital U, circumflex accent Û &amp;#219; --> &#219; &amp;Ucirc; --> &Ucirc;
capital U, dieresis or umlaut mark Ü &amp;#220; --> &#220; &amp;Uuml; --> &Uuml;
capital Y, acute accent Ý &amp;#221; --> &#221; &amp;Yacute; --> &Yacute;
capital THORN, Icelandic Þ &amp;#222; --> &#222; &amp;THORN; --> &THORN;
small sharp s, German (sz ligature) ß &amp;#223; --> &#223; &amp;szlig; --> &szlig;
small a, grave accent à &amp;#224; --> &#224; &amp;agrave; --> &agrave;
small a, acute accent á &amp;#225; --> &#225; &amp;aacute; --> &aacute;
small a, circumflex accent â &amp;#226; --> &#226; &amp;acirc; --> &acirc;
small a, tilde ã &amp;#227; --> &#227; &amp;atilde; --> &atilde;
small a, dieresis or umlaut mark ä &amp;#228; --> &#228; &amp;auml; --> &auml;
small a, ring å &amp;#229; --> &#229; &amp;aring; --> &aring;
small ae diphthong (ligature) æ &amp;#230; --> &#230; &amp;aelig; --> &aelig;
small c, cedilla ç &amp;#231; --> &#231; &amp;ccedil; --> &ccedil;
small e, grave accent è &amp;#232; --> &#232; &amp;egrave; --> &egrave;
small e, acute accent é &amp;#233; --> &#233; &amp;eacute; --> &eacute;
small e, circumflex accent ê &amp;#234; --> &#234; &amp;ecirc; --> &ecirc;
small e, dieresis or umlaut mark ë &amp;#235; --> &#235; &amp;euml; --> &euml;
small i, grave accent ì &amp;#236; --> &#236; &amp;igrave; --> &igrave;
small i, acute accent í &amp;#237; --> &#237; &amp;iacute; --> &iacute;
small i, circumflex accent î &amp;#238; --> &#238; &amp;icirc; --> &icirc;
small i, dieresis or umlaut mark ï &amp;#239; --> &#239; &amp;iuml; --> &iuml;
small eth, Icelandic ð &amp;#240; --> &#240; &amp;eth; --> &eth;
small n, tilde ñ &amp;#241; --> &#241; &amp;ntilde; --> &ntilde;
small o, grave accent ò &amp;#242; --> &#242; &amp;ograve; --> &ograve;
small o, acute accent ó &amp;#243; --> &#243; &amp;oacute; --> &oacute;
small o, circumflex accent ô &amp;#244; --> &#244; &amp;ocirc; --> &ocirc;
small o, tilde õ &amp;#245; --> &#245; &amp;otilde; --> &otilde;
small o, dieresis or umlaut mark ö &amp;#246; --> &#246; &amp;ouml; --> &ouml;
division sign ÷ &amp;#247; --> &#247; &amp;divide; --> &divide;
small o, slash ø &amp;#248; --> &#248; &amp;oslash; --> &oslash;
small u, grave accent ù &amp;#249; --> &#249; &amp;ugrave; --> &ugrave;
small u, acute accent ú &amp;#250; --> &#250; &amp;uacute; --> &uacute;
small u, circumflex accent û &amp;#251; --> &#251; &amp;ucirc; --> &ucirc;
small u, dieresis or umlaut mark ü &amp;#252; --> &#252; &amp;uuml; --> &uuml;
small y, acute accent ý &amp;#253; --> &#253; &amp;yacute; --> &yacute;
small thorn, Icelandic þ &amp;#254; --> &#254; &amp;thorn; --> &thorn;
small y, dieresis or umlaut mark ÿ &amp;#255; --> &#255; &amp;yuml; --> &yuml;
</PRE>
<!-- removed: second /PRE, a hack for HotJava 1.0 preBeta 1 -->
<HR>
<STRONG>How to read</STRONG> this table. The columns are
<DL COMPACT>
<DT>1st:<DD>textual <EM>description</EM> of the character
<DT>2nd:<DD>character inserted directly into the HTML page as <EM>one
byte</EM>
<DT>3rd:<DD>character written as <EM>numeric HTML entity</EM>, in the
format:<BR>"how it looks literally" <CODE>--&gt;</CODE>
"what your browser does with it"
<DT>4th:<DD>character written as <EM>symbolic HTML entity</EM>, in the
format:<BR>"how it looks literally" <CODE>--&gt;</CODE>
"what your browser does with it"
</DL>
So for example, if you see something like "<CODE>&amp;divide; -->
&amp;divide;</CODE>" in the 4th column, this means your browser
doesn't know about the entity name "divide" and just puts it
literally.
<P>
<STRONG>This table</STRONG> grew out of an overview of the "ISO
Latin-1 Character Set" overview related to the Hyper-G Text Format
(<A HREF="http://www.hyperwave.de/HTFdoc">HTF</A>).
The entity names <CODE>&amp;brkbar;</CODE> and <CODE>&amp;Dstrok;</CODE>
seem to be unique to HTF.
The entity name <CODE>&amp;hibar;</CODE> has been supported by X Mosaic
but seems to be replaced with <CODE>&amp;macr;</CODE>.
The entity names <CODE>&amp;uml;</CODE> and <CODE>&amp;die;</CODE> should
be equivalent.
<P><STRONG>The standards stuff:</STRONG>
The
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/">HTML 2.0 Standard</A>
includes a section on
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/html-spec_9.html#SEC99">Character Entity Sets</A>
and an overview on the
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/html-spec_13.html#SEC106">HTML Coded Character Set</A>
(The entity names are derived from <A HREF="http://www.ucc.ie/info/net/isolat1.html">ISO 8879</A>).
<BR>
Or have a look at the
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html3/latin1.html">Latin-1 Character Entities</A>
as listed in an draft for the
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html3/CoverPage.html">HTML 3.0 specification</A>.
<BR>
The
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_59.html">Appendix II</A>
of CERN's
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_1.html">HTML+ Discussion Document</A>
contains a
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_table.ps">table</A>
(in PostScript format) of the proposed character entities for HTML+ and their
corresponding character codes for Unicode and the Adobe Latin-1 &amp; Symbol
character sets.
<P>
<STRONG>Please note</STRONG> that there is nothing wrong with using
characters of ISO Latin-1 above 127: the normal transmission protocol
for the WWW,
<A HREF="http://www.w3.org/pub/WWW/Protocols/rfc1945/rfc1945">HTTP/1.0</A>,
uses the 8bit ISO latin-1 as default encoding.
(Thanks to Roman
Czyborra for pointing this out!)
<P>
<STRONG>Other information:</STRONG>
<UL>
<LI><STRONG>Kevin J. Brewer</STRONG> has done two very good pages on the subject:
<UL>
<LI><A HREF="http://www.bbsinc.com/iso8859.html">ASCII - ISO 8859-1 (Latin-1) with HTML 3.0 Entities Table</A> and
<LI><A HREF="http://www.bbsinc.com/iso8879.html">ISO 8879 Entities Gopher Menu</A>
</UL>
<LI>The excellent overview on the series of
<A HREF="http://czyborra.com/charsets/iso8859.html">ISO 8859
character sets</A> compiled by Roman Czyborra.
<LI>Also have a look on Alan Flavell's page of
<A HREF="http://ppewww.ph.gla.ac.uk/%7Eflavell/iso8859/iso8859-pointers.html">pointers
to information about ISO8859</A>. It's written very well!
<LI>Maybe also of interest to you is the
<A HREF="ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/FAQ-ISO-8859-1">ISO
8859-1 FAQ</A> by Michael Gschwind
(<A HREF="mailto:mike@vlsivie.tuwien.ac.at">mike@vlsivie.tuwien.ac.at</A>),
part of his page on
<A HREF="http://www.vlsivie.tuwien.ac.at/mike/i18n.html">Internationalization</A>.
<LI>For users of X11R5 on SunOS systems: the
<A HREF="Compose.txt">table over the compose combinations</A>
(also coded <A HREF="Compose.html">with entities</A> where possible).
It's taken from the MIT X sources in
<CODE>server/ddx/sun/Compose.list</CODE>.
<LI>Finally you could have a look at
<A HREF="ftp://ds.internic.net/rfc/rfc1345.txt">RFC 1345:
Character Mnemonics &amp; Character Sets</A>
by K. Simonsen (06/11/92, 103 pages, approx. 240 kbyte).
</UL>
<HR>
<ADDRESS><A HREF="http://ramsch.home.pages.de/">Martin Ramsch</A>, 16.02.1994, 07.01.1996, 01.07.1996, 1998-10-09, 2000-05-15</ADDRESS>
</BODY>
</HTML>