Latin1的所有字符编码
ISO-8859-1 (ISO Latin 1) Character Encoding
Contents
- The characters at a glance
- Character codes and names
- Notes for html documents
- Other notes
- Additional references
The characters at a glance
Here are all the printable characters, in collating order:
! " # $ % & ' ( ) * + , - . /
0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
[ \ ] ^ _ `
a b c d e f g h i j k l m n o p q r s t u v w x y z
{ | } ~ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö
×
Ø Ù Ú Û Ü Ý Þ
ß
à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö
÷
ø ù ú û ü ý þ
ÿ
The first six rows are the ASCII character set.
Note the ordinary ASCII space (before `!') and the ISO Latin-1 non-breaking space (before `¡')
Character codes and names
The columns show, in order:
- HTML: the HTML notation (decimal);
- OCTL: the C/Modula-3 octal notation;
- HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
- CMP: the Sun/X11 "Compose" key combinations;
- CHR: the charater itself, when HTML-printable;
- MEANING: the ISO-8859-1 (and ASCII) definition.
HTML OCTL HEX CMP CHR MEANING
------ + ---- + --- + --- + --- + ------------------------------
� | \000 | =00 | | | Invalid (ASCII NUL)
 | \001 | =01 | | | Unused (ASCII SOH)
 | \002 | =02 | | | Unused (ASCII STX)
 | \003 | =03 | | | Unused (ASCII ETX)
 | \004 | =04 | | | Unused (ASCII EOT)
 | \005 | =05 | | | Unused (ASCII ENQ)
 | \006 | =06 | | | Unused (ASCII ACK)
 | \007 | =07 | | | Unused (ASCII BEL, audible bell)
 | \010 | =08 | | | Unused (ASCII BS, backspace)
| \011 | =09 | | | Horizontal tab (ASCII HT)
| \012 | =0A | | | Line feed (ASCII NL, newline)
 | \013 | =0B | | | Unused (ASCII VT, vertical tab)
 | \014 | =0C | | | Unused (ASCII NP, new page)
| \015 | =0D | | | Carriage Return (ASCII CR)
 | \016 | =0E | | | Unused (ASCII SO)
 | \017 | =0F | | | Unused (ASCII SI)
 | \020 | =10 | | | Unused (ASCII DLE)
 | \021 | =11 | | | Unused (ASCII DC1)
 | \022 | =12 | | | Unused (ASCII DC2)
 | \023 | =13 | | | Unused (ASCII DC3)
 | \024 | =14 | | | Unused (ASCII DC4)
 | \025 | =15 | | | Unused (ASCII NAK)
 | \026 | =16 | | | Unused (ASCII SYN)
 | \027 | =17 | | | Unused (ASCII ETB)
 | \030 | =18 | | | Unused (ASCII CAN)
 | \031 | =19 | | | Unused (ASCII EM)
 | \032 | =1A | | | Unused (ASCII SUB)
 | \033 | =1B | | | Unused (ASCII ESC, escape)
 | \034 | =1C | | | Unused (ASCII FS)
 | \035 | =1D | | | Unused (ASCII GS)
 | \036 | =1E | | | Unused (ASCII RS)
 | \037 | =1F | | | Unused (ASCII US)
| \040 | =20 | | ( ) | Space (ASCII SP)
! | \041 | =21 | | (!) | Exclamation mark
" | \042 | =22 | | (") | Quotation mark (")
# | \043 | =23 | | (#) | Number sign
$ | \044 | =24 | | ($) | Dollar sign
% | \045 | =25 | | (%) | Percent sign
& | \046 | =26 | | (&) | Ampersand (&)
' | \047 | =27 | | (') | Apostrophe (right single quote)
( | \050 | =28 | | (() | Left parenthesis
) | \051 | =29 | | ()) | Right parenthesis
* | \052 | =2A | | (*) | Asterisk
+ | \053 | =2B | | (+) | Plus sign
, | \054 | =2C | | (,) | Comma
- | \055 | =2D | | (-) | Hyphen
. | \056 | =2E | | (.) | Period (fullstop)
/ | \057 | =2F | | (/) | Solidus (slash)
0 | \060 | =30 | | (0) | Digit 0
. . .
9 | \071 | =39 | | (9) | Digit 9
: | \072 | =3A | | (:) | Colon
; | \073 | =3B | | (;) | Semi-colon
< | \074 | =3C | | (<) | Less than (<)
= | \075 | =3D | | (=) | Equals sign
> | \076 | =3E | | (>) | Greater than (>)
? | \077 | =3F | | (?) | Question mark
@ | \100 | =40 | | (@) | Commercial at-sign
A | \101 | =41 | | (A) | Uppercase letter A
. . .
Z | \132 | =5A | | (Z) | Uppercase letter Z
[ | \133 | =5B | | ([) | Left square bracket
\ | \134 | =5C | | (\) | Reverse solidus (backslash)
] | \135 | =5D | | (]) | Right square bracket
^ | \136 | =5E | | (^) | Caret
_ | \137 | =5F | | (_) | Horizontal bar (underscore)
` | \140 | =60 | | (`) | Reverse apostrophe (left single quote)
a | \141 | =61 | | (a) | Lowercase letter a
. . .
z | \172 | =7A | | (z) | Lowercase letter z
{ | \173 | =7B | | ({) | Left curly brace
| | \174 | =7C | | (|) | Vertical bar
} | \175 | =7D | | (}) | Right curly brace
~ | \176 | =7E | | (~) | Tilde
| \177 | =7F | | | Unused (ASCII DEL)
| \200 | =80 | | | Unused
| \201 | =81 | | | Unused
| \202 | =82 | | | Unused
| \203 | =83 | | | Unused
| \204 | =84 | | | Unused
| \205 | =85 | | | Unused
| \206 | =86 | | | Unused
| \207 | =87 | | | Unused
| \210 | =88 | | | Unused
| \211 | =89 | | | Unused
| \212 | =8A | | | Unused
| \213 | =8B | | | Unused
| \214 | =8C | | | Unused
| \215 | =8D | | | Unused
| \216 | =8E | | | Unused
| \217 | =8F | | | Unused
| \220 | =90 | | | Unused
| \221 | =91 | | | Unused
| \222 | =92 | | | Unused
| \223 | =93 | | | Unused
| \224 | =94 | | | Unused
| \225 | =95 | | | Unused
| \226 | =96 | | | Unused
| \227 | =97 | | | Unused
| \230 | =98 | | | Unused
| \231 | =99 | | | Unused
| \232 | =9A | | | Unused
| \233 | =9B | | | Unused
| \234 | =9C | | | Unused
| \235 | =9D | | | Unused
| \236 | =9E | | | Unused
| \237 | =9F | | | Unused
| \240 | =A0 | | ( ) | Non-breaking space ( )
¡ | \241 | =A1 | ! ! | (¡) | Inverted exclamation
¢ | \242 | =A2 | c / | (¢) | Cent sign
£ | \243 | =A3 | l - | (£) | Pound sterling
¤ | \244 | =A4 | o x | (¤) | General currency sign
¥ | \245 | =A5 | y - | (¥) | Yen sign
¦ | \246 | =A6 | | | | (¦) | Broken vertical bar
§ | \247 | =A7 | s o | (§) | Section sign
¨ | \250 | =A8 | " " | (¨) | Umlaut (dieresis)
© | \251 | =A9 | c o | (©) | Copyright
ª | \252 | =AA | - a | (ª) | Feminine ordinal
« | \253 | =AB | < < | («) | Left angle quote, guillemotleft
¬ | \254 | =AC | - , | (¬) | Not sign
| \255 | =AD | - - | () | Soft hyphen
® | \256 | =AE | r o | (®) | Registered trademark
¯ | \257 | =AF | ^ - | (¯) | Macron accent
° | \260 | =B0 | ^ * | (°) | Degree sign
± | \261 | =B1 | + - | (±) | Plus or minus
² | \262 | =B2 | ^ 2 | (²) | Superscript two
³ | \263 | =B3 | ^ 3 | (³) | Superscript three
´ | \264 | =B4 | \ \ | (´) | Acute accent
µ | \265 | =B5 | / u | (µ) | Micro sign
¶ | \266 | =B6 | P ! | (¶) | Paragraph sign
· | \267 | =B7 | ^ . | (·) | Middle dot
¸ | \270 | =B8 | , , | (¸) | Cedilla
¹ | \271 | =B9 | ^ 1 | (¹) | Superscript one
º | \272 | =BA | _ o | (º) | Masculine ordinal
» | \273 | =BB | > > | (») | Right angle quote, guillemotright
¼ | \274 | =BC | 1 4 | (¼) | Fraction one-fourth
½ | \275 | =BD | 1 2 | (½) | Fraction one-half
¾ | \276 | =BE | 3 4 | (¾) | Fraction three-fourths
¿ | \277 | =BF | ? ? | (¿) | Inverted question mark
À | \300 | =C0 | A ` | (À) | Capital A, grave accent
Á | \301 | =C1 | A ' | (Á) | Capital A, acute accent
 | \302 | =C2 | A ^ | (Â) | Capital A, circumflex accent
à | \303 | =C3 | A ~ | (Ã) | Capital A, tilde
Ä | \304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark
Å | \305 | =C5 | A * | (Å) | Capital A, ring
Æ | \306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature)
Ç | \307 | =C7 | C , | (Ç) | Capital C, cedilla
È | \310 | =C8 | E ` | (È) | Capital E, grave accent
É | \311 | =C9 | E ' | (É) | Capital E, acute accent
Ê | \312 | =CA | E ^ | (Ê) | Capital E, circumflex accent
Ë | \313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark
Ì | \314 | =CC | I ` | (Ì) | Capital I, grave accent
Í | \315 | =CD | I ' | (Í) | Capital I, acute accent
Î | \316 | =CE | I ^ | (Î) | Capital I, circumflex accent
Ï | \317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark
Ð | \320 | =D0 | D - | (Ð) | Capital Eth, Icelandic
Ñ | \321 | =D1 | N ~ | (Ñ) | Capital N, tilde
Ò | \322 | =D2 | O ` | (Ò) | Capital O, grave accent
Ó | \323 | =D3 | O ' | (Ó) | Capital O, acute accent
Ô | \324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent
Õ | \325 | =D5 | O ~ | (Õ) | Capital O, tilde
Ö | \326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark
× | \327 | =D7 | x x | (×) | Multiply sign
Ø | \330 | =D8 | O / | (Ø) | Capital O, slash
Ù | \331 | =D9 | U ` | (Ù) | Capital U, grave accent
Ú | \332 | =DA | U ' | (Ú) | Capital U, acute accent
Û | \333 | =DB | U ^ | (Û) | Capital U, circumflex accent
Ü | \334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark
Ý | \335 | =DD | Y ' | (Ý) | Capital Y, acute accent
Þ | \336 | =DE | P | | (Þ) | Capital THORN, Icelandic
ß | \337 | =DF | s s | (ß) | Small sharp s, German (sz ligature)
à | \340 | =E0 | a ` | (à) | Small a, grave accent
á | \341 | =E1 | a ' | (á) | Small a, acute accent
â | \342 | =E2 | a ^ | (â) | Small a, circumflex accent
ã | \343 | =E3 | a ~ | (ã) | Small a, tilde
ä | \344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark
å | \345 | =E5 | a * | (å) | Small a, ring
æ | \346 | =E6 | a e | (æ) | Small ae dipthong (ligature)
ç | \347 | =E7 | c , | (ç) | Small c, cedilla
è | \350 | =E8 | e ` | (è) | Small e, grave accent
é | \351 | =E9 | e ' | (é) | Small e, acute accent
ê | \352 | =EA | e ^ | (ê) | Small e, circumflex accent
ë | \353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark
ì | \354 | =EC | i ` | (ì) | Small i, grave accent
í | \355 | =ED | i ' | (í) | Small i, acute accent
î | \356 | =EE | i ^ | (î) | Small i, circumflex accent
ï | \357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark
ð | \360 | =F0 | d - | (ð) | Small eth, Icelandic
ñ | \361 | =F1 | n ~ | (ñ) | Small n, tilde
ò | \362 | =F2 | o ` | (ò) | Small o, grave accent
ó | \363 | =F3 | o ' | (ó) | Small o, acute accent
ô | \364 | =F4 | o ^ | (ô) | Small o, circumflex accent
õ | \365 | =F5 | o ~ | (õ) | Small o, tilde
ö | \366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark
÷ | \367 | =F7 | - : | (÷) | Division sign
ø | \370 | =F8 | o / | (ø) | Small o, slash
ù | \371 | =F9 | u ` | (ù) | Small u, grave accent
ú | \372 | =FA | u ' | (ú) | Small u, acute accent
û | \373 | =FB | u ^ | (û) | Small u, circumflex accent
ü | \374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark
ý | \375 | =FD | y ' | (ý) | Small y, acute accent
þ | \376 | =FE | p | | (þ) | Small thorn, Icelandic
ÿ | \377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark
Notes for HTML documents
HTML entity names are given in the "MEANING" column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.
The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)
The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.
The non-breaking space ( ) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: | |.
Other notes
Alternative Sun/X11 "Compose" sequences for the Icelandic "thorn" are "t h" (þ, lowercase) and "T H" (Þ, uppercase).
Note that the Sun/X11 "Compose" sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas...
Additional references
- Martin Ramsch's iso8859-1 table.
- The HTML 2.0 Standard [Character Entity Sets] [HTML Coded Character Set]
- The HTML 3.0 specification [Latin-1 Character Entities]
- The HTML+ Discussion Document [Appendix II]
- An exhaustive entity table including HTML 2.0, HTML3.0, HTML+, with Postscript equivalents.
Composed by J. Stolfi from several sources found throughout the net.
http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html
Latin1的所有字符编码的更多相关文章
- mysql命令行修改字符编码
1.修改数据库字符编码 mysql> alter database mydb character set utf8 ; 2.创建数据库时,指定数据库的字符编码 mysql> create ...
- mysql修改数据库编码(数据库字符集)和表的字符编码的方法
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程和将表的字符编码转换成utf-8的方法,需要的朋友可以参考下. mysql将表的字符编码转换成utf-8 ...
- mysql字符编码集(乱码)问题解决
1.创建数据库 CREATE DATABASE `test` CHARACTER SET 'utf8' COLLATE 'utf8_general_ci'; 创建表 CREATE TABLE tp_w ...
- 彻底解决mysql中文乱码的办法,修改mysql解压缩版(免安装版或zip版)字符编码
MySQL会出现中文乱码的原因不外乎下列几点:1.server本身设定问题,例如server字符编码还停留在latin12.table的语系设定问题(包含character与collation)3.客 ...
- 彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, Utf-8
彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, U 1.ASCII: 0-127(128-255未使用),美国标准 2.IS0-8859-1(lati ...
- 修改数据库mysql字符编码为UTF8
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程. 步骤1:查看当前的字符编码方法 mysql> show variables like'char ...
- Linux 下查看文件字符编码和转换编码
Linux 下查看文件字符编码和转换编码 如果你需要在Linux中操作windows下的文件,那么你可能会经常遇到文件编码转换的问题.Windows中默认的文件格式是GBK(gb2312),而Linu ...
- vim字符编码设置
vim 编码方式的设置 和所有的流行文本编辑器一样,Vim 可以很好的编辑各种字符编码的文件,这当然包括UCS-2.UTF-8 等流行的 Unicode 编码方式.然而不幸的是,和很多来自 Linux ...
- linux mysql字符编码问题
发布:thatboy 来源:脚本学堂 [大 中 小] 本文介绍下,linux环境中mysql字符编码问题的解决办法,有遇到mysql编码问题的朋友,可以参考下本文的介绍,希望对你有一定的帮 ...
随机推荐
- java调用163邮箱发送邮件
1:注册一个163邮箱,http://mail.163.com 调用发送邮件代码,查询smtp.163.com,作为发送邮件的服务器ip,类似的邮箱服务器应该也可以. MailSenderInfo m ...
- [HTML5游戏开发]简单的《找不同汉字版》,来考考你的眼力吧
本次 游戏 开发需要用到lufylegend.js开源游戏引擎,版本我用的是1.5.2(现在最新的版本是1.6.0). 引擎下载的位置: http://lufylegend.googlecode ...
- POJ 3301 Texas Trip
题目大意: 在二维坐标系中给出一些点.求能覆盖他们的最小正方形的面积(正方形的边不一定平行坐标轴) 解题思路: 对于一个点.若坐标轴旋转a度(弧度制).那么X'=X*cos(a)-Y*sin(a);Y ...
- telerik 控件 SCRIPT5007: 无法获取未定义或 null 引用的属性“documentElement” (IE 文档模式)
IE对盒模型的渲染在 Standards Mode和Quirks Mode是有很大差别的,在Standards Mode下对于盒模型的解释和其他的标准浏览器是一样,但在Quirks Mode模式下则有 ...
- anglehack参赛总结
自已不足的方面: 自已和伙伴是带着idea去的,但是没有带有很大的热情激励和吸引在场的hacker加入团队,一定要找最优秀的人加入团队,事实上我是有这方面识人认人的能力的,24h,5-6个人的优秀团队 ...
- C# 未能加载文件或程序集“MySQLDriverCS..." 错误解决
在解决方案的属性里,生成,里面有个目标平台,网上说的 大概也就是64位和32位的不兼容问题..试着把目标平台改为X86后竟然神奇的正常了!
- easyhadoop:failed to open stream:Permission denied in /var/www/html/index.php
今天又重新部署了下easyhadoop,结果apache后台服务器报这个错误: [Fri Dec 13 10:32:41 2013] [notice] SIGHUP received. Attempt ...
- 删除workspace下的vss的scc文件
public class DeleteAA { public static void main(String[] args) { DeleteAA aa=new DeleteAA(); aa.dele ...
- Delphi中关于Rtti的一些操作(一)
function TForm1.ShowMethodAddress(aObj: TDerived; const sData: String) : Pointer;var aPtr : Pointer ...
- 关于Delphi中TRttiContext.FindType失效的问题
自从Delphi2010后,Delphi中的Rtti功能得到了增强.我们终于可以不用先RegisterClass,再GetClass获取类的信息了.而只是简单的通过TRttiContext.GetTy ...