Latin1的所有字符编码
ISO-8859-1 (ISO Latin 1) Character Encoding
Contents
- The characters at a glance
- Character codes and names
- Notes for html documents
- Other notes
- Additional references
The characters at a glance
Here are all the printable characters, in collating order:
! " # $ % & ' ( ) * + , - . /
0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
[ \ ] ^ _ `
a b c d e f g h i j k l m n o p q r s t u v w x y z
{ | } ~ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö
×
Ø Ù Ú Û Ü Ý Þ
ß
à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö
÷
ø ù ú û ü ý þ
ÿ
The first six rows are the ASCII character set.
Note the ordinary ASCII space (before `!') and the ISO Latin-1 non-breaking space (before `¡')
Character codes and names
The columns show, in order:
- HTML: the HTML notation (decimal);
- OCTL: the C/Modula-3 octal notation;
- HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
- CMP: the Sun/X11 "Compose" key combinations;
- CHR: the charater itself, when HTML-printable;
- MEANING: the ISO-8859-1 (and ASCII) definition.
HTML OCTL HEX CMP CHR MEANING
------ + ---- + --- + --- + --- + ------------------------------
� | \000 | =00 | | | Invalid (ASCII NUL)
 | \001 | =01 | | | Unused (ASCII SOH)
 | \002 | =02 | | | Unused (ASCII STX)
 | \003 | =03 | | | Unused (ASCII ETX)
 | \004 | =04 | | | Unused (ASCII EOT)
 | \005 | =05 | | | Unused (ASCII ENQ)
 | \006 | =06 | | | Unused (ASCII ACK)
 | \007 | =07 | | | Unused (ASCII BEL, audible bell)
 | \010 | =08 | | | Unused (ASCII BS, backspace)
| \011 | =09 | | | Horizontal tab (ASCII HT)
| \012 | =0A | | | Line feed (ASCII NL, newline)
 | \013 | =0B | | | Unused (ASCII VT, vertical tab)
 | \014 | =0C | | | Unused (ASCII NP, new page)
| \015 | =0D | | | Carriage Return (ASCII CR)
 | \016 | =0E | | | Unused (ASCII SO)
 | \017 | =0F | | | Unused (ASCII SI)
 | \020 | =10 | | | Unused (ASCII DLE)
 | \021 | =11 | | | Unused (ASCII DC1)
 | \022 | =12 | | | Unused (ASCII DC2)
 | \023 | =13 | | | Unused (ASCII DC3)
 | \024 | =14 | | | Unused (ASCII DC4)
 | \025 | =15 | | | Unused (ASCII NAK)
 | \026 | =16 | | | Unused (ASCII SYN)
 | \027 | =17 | | | Unused (ASCII ETB)
 | \030 | =18 | | | Unused (ASCII CAN)
 | \031 | =19 | | | Unused (ASCII EM)
 | \032 | =1A | | | Unused (ASCII SUB)
 | \033 | =1B | | | Unused (ASCII ESC, escape)
 | \034 | =1C | | | Unused (ASCII FS)
 | \035 | =1D | | | Unused (ASCII GS)
 | \036 | =1E | | | Unused (ASCII RS)
 | \037 | =1F | | | Unused (ASCII US)
| \040 | =20 | | ( ) | Space (ASCII SP)
! | \041 | =21 | | (!) | Exclamation mark
" | \042 | =22 | | (") | Quotation mark (")
# | \043 | =23 | | (#) | Number sign
$ | \044 | =24 | | ($) | Dollar sign
% | \045 | =25 | | (%) | Percent sign
& | \046 | =26 | | (&) | Ampersand (&)
' | \047 | =27 | | (') | Apostrophe (right single quote)
( | \050 | =28 | | (() | Left parenthesis
) | \051 | =29 | | ()) | Right parenthesis
* | \052 | =2A | | (*) | Asterisk
+ | \053 | =2B | | (+) | Plus sign
, | \054 | =2C | | (,) | Comma
- | \055 | =2D | | (-) | Hyphen
. | \056 | =2E | | (.) | Period (fullstop)
/ | \057 | =2F | | (/) | Solidus (slash)
0 | \060 | =30 | | (0) | Digit 0
. . .
9 | \071 | =39 | | (9) | Digit 9
: | \072 | =3A | | (:) | Colon
; | \073 | =3B | | (;) | Semi-colon
< | \074 | =3C | | (<) | Less than (<)
= | \075 | =3D | | (=) | Equals sign
> | \076 | =3E | | (>) | Greater than (>)
? | \077 | =3F | | (?) | Question mark
@ | \100 | =40 | | (@) | Commercial at-sign
A | \101 | =41 | | (A) | Uppercase letter A
. . .
Z | \132 | =5A | | (Z) | Uppercase letter Z
[ | \133 | =5B | | ([) | Left square bracket
\ | \134 | =5C | | (\) | Reverse solidus (backslash)
] | \135 | =5D | | (]) | Right square bracket
^ | \136 | =5E | | (^) | Caret
_ | \137 | =5F | | (_) | Horizontal bar (underscore)
` | \140 | =60 | | (`) | Reverse apostrophe (left single quote)
a | \141 | =61 | | (a) | Lowercase letter a
. . .
z | \172 | =7A | | (z) | Lowercase letter z
{ | \173 | =7B | | ({) | Left curly brace
| | \174 | =7C | | (|) | Vertical bar
} | \175 | =7D | | (}) | Right curly brace
~ | \176 | =7E | | (~) | Tilde
| \177 | =7F | | | Unused (ASCII DEL)
| \200 | =80 | | | Unused
| \201 | =81 | | | Unused
| \202 | =82 | | | Unused
| \203 | =83 | | | Unused
| \204 | =84 | | | Unused
| \205 | =85 | | | Unused
| \206 | =86 | | | Unused
| \207 | =87 | | | Unused
| \210 | =88 | | | Unused
| \211 | =89 | | | Unused
| \212 | =8A | | | Unused
| \213 | =8B | | | Unused
| \214 | =8C | | | Unused
| \215 | =8D | | | Unused
| \216 | =8E | | | Unused
| \217 | =8F | | | Unused
| \220 | =90 | | | Unused
| \221 | =91 | | | Unused
| \222 | =92 | | | Unused
| \223 | =93 | | | Unused
| \224 | =94 | | | Unused
| \225 | =95 | | | Unused
| \226 | =96 | | | Unused
| \227 | =97 | | | Unused
| \230 | =98 | | | Unused
| \231 | =99 | | | Unused
| \232 | =9A | | | Unused
| \233 | =9B | | | Unused
| \234 | =9C | | | Unused
| \235 | =9D | | | Unused
| \236 | =9E | | | Unused
| \237 | =9F | | | Unused
| \240 | =A0 | | ( ) | Non-breaking space ( )
¡ | \241 | =A1 | ! ! | (¡) | Inverted exclamation
¢ | \242 | =A2 | c / | (¢) | Cent sign
£ | \243 | =A3 | l - | (£) | Pound sterling
¤ | \244 | =A4 | o x | (¤) | General currency sign
¥ | \245 | =A5 | y - | (¥) | Yen sign
¦ | \246 | =A6 | | | | (¦) | Broken vertical bar
§ | \247 | =A7 | s o | (§) | Section sign
¨ | \250 | =A8 | " " | (¨) | Umlaut (dieresis)
© | \251 | =A9 | c o | (©) | Copyright
ª | \252 | =AA | - a | (ª) | Feminine ordinal
« | \253 | =AB | < < | («) | Left angle quote, guillemotleft
¬ | \254 | =AC | - , | (¬) | Not sign
| \255 | =AD | - - | () | Soft hyphen
® | \256 | =AE | r o | (®) | Registered trademark
¯ | \257 | =AF | ^ - | (¯) | Macron accent
° | \260 | =B0 | ^ * | (°) | Degree sign
± | \261 | =B1 | + - | (±) | Plus or minus
² | \262 | =B2 | ^ 2 | (²) | Superscript two
³ | \263 | =B3 | ^ 3 | (³) | Superscript three
´ | \264 | =B4 | \ \ | (´) | Acute accent
µ | \265 | =B5 | / u | (µ) | Micro sign
¶ | \266 | =B6 | P ! | (¶) | Paragraph sign
· | \267 | =B7 | ^ . | (·) | Middle dot
¸ | \270 | =B8 | , , | (¸) | Cedilla
¹ | \271 | =B9 | ^ 1 | (¹) | Superscript one
º | \272 | =BA | _ o | (º) | Masculine ordinal
» | \273 | =BB | > > | (») | Right angle quote, guillemotright
¼ | \274 | =BC | 1 4 | (¼) | Fraction one-fourth
½ | \275 | =BD | 1 2 | (½) | Fraction one-half
¾ | \276 | =BE | 3 4 | (¾) | Fraction three-fourths
¿ | \277 | =BF | ? ? | (¿) | Inverted question mark
À | \300 | =C0 | A ` | (À) | Capital A, grave accent
Á | \301 | =C1 | A ' | (Á) | Capital A, acute accent
 | \302 | =C2 | A ^ | (Â) | Capital A, circumflex accent
à | \303 | =C3 | A ~ | (Ã) | Capital A, tilde
Ä | \304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark
Å | \305 | =C5 | A * | (Å) | Capital A, ring
Æ | \306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature)
Ç | \307 | =C7 | C , | (Ç) | Capital C, cedilla
È | \310 | =C8 | E ` | (È) | Capital E, grave accent
É | \311 | =C9 | E ' | (É) | Capital E, acute accent
Ê | \312 | =CA | E ^ | (Ê) | Capital E, circumflex accent
Ë | \313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark
Ì | \314 | =CC | I ` | (Ì) | Capital I, grave accent
Í | \315 | =CD | I ' | (Í) | Capital I, acute accent
Î | \316 | =CE | I ^ | (Î) | Capital I, circumflex accent
Ï | \317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark
Ð | \320 | =D0 | D - | (Ð) | Capital Eth, Icelandic
Ñ | \321 | =D1 | N ~ | (Ñ) | Capital N, tilde
Ò | \322 | =D2 | O ` | (Ò) | Capital O, grave accent
Ó | \323 | =D3 | O ' | (Ó) | Capital O, acute accent
Ô | \324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent
Õ | \325 | =D5 | O ~ | (Õ) | Capital O, tilde
Ö | \326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark
× | \327 | =D7 | x x | (×) | Multiply sign
Ø | \330 | =D8 | O / | (Ø) | Capital O, slash
Ù | \331 | =D9 | U ` | (Ù) | Capital U, grave accent
Ú | \332 | =DA | U ' | (Ú) | Capital U, acute accent
Û | \333 | =DB | U ^ | (Û) | Capital U, circumflex accent
Ü | \334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark
Ý | \335 | =DD | Y ' | (Ý) | Capital Y, acute accent
Þ | \336 | =DE | P | | (Þ) | Capital THORN, Icelandic
ß | \337 | =DF | s s | (ß) | Small sharp s, German (sz ligature)
à | \340 | =E0 | a ` | (à) | Small a, grave accent
á | \341 | =E1 | a ' | (á) | Small a, acute accent
â | \342 | =E2 | a ^ | (â) | Small a, circumflex accent
ã | \343 | =E3 | a ~ | (ã) | Small a, tilde
ä | \344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark
å | \345 | =E5 | a * | (å) | Small a, ring
æ | \346 | =E6 | a e | (æ) | Small ae dipthong (ligature)
ç | \347 | =E7 | c , | (ç) | Small c, cedilla
è | \350 | =E8 | e ` | (è) | Small e, grave accent
é | \351 | =E9 | e ' | (é) | Small e, acute accent
ê | \352 | =EA | e ^ | (ê) | Small e, circumflex accent
ë | \353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark
ì | \354 | =EC | i ` | (ì) | Small i, grave accent
í | \355 | =ED | i ' | (í) | Small i, acute accent
î | \356 | =EE | i ^ | (î) | Small i, circumflex accent
ï | \357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark
ð | \360 | =F0 | d - | (ð) | Small eth, Icelandic
ñ | \361 | =F1 | n ~ | (ñ) | Small n, tilde
ò | \362 | =F2 | o ` | (ò) | Small o, grave accent
ó | \363 | =F3 | o ' | (ó) | Small o, acute accent
ô | \364 | =F4 | o ^ | (ô) | Small o, circumflex accent
õ | \365 | =F5 | o ~ | (õ) | Small o, tilde
ö | \366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark
÷ | \367 | =F7 | - : | (÷) | Division sign
ø | \370 | =F8 | o / | (ø) | Small o, slash
ù | \371 | =F9 | u ` | (ù) | Small u, grave accent
ú | \372 | =FA | u ' | (ú) | Small u, acute accent
û | \373 | =FB | u ^ | (û) | Small u, circumflex accent
ü | \374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark
ý | \375 | =FD | y ' | (ý) | Small y, acute accent
þ | \376 | =FE | p | | (þ) | Small thorn, Icelandic
ÿ | \377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark
Notes for HTML documents
HTML entity names are given in the "MEANING" column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.
The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)
The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.
The non-breaking space ( ) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: | |.
Other notes
Alternative Sun/X11 "Compose" sequences for the Icelandic "thorn" are "t h" (þ, lowercase) and "T H" (Þ, uppercase).
Note that the Sun/X11 "Compose" sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas...
Additional references
- Martin Ramsch's iso8859-1 table.
- The HTML 2.0 Standard [Character Entity Sets] [HTML Coded Character Set]
- The HTML 3.0 specification [Latin-1 Character Entities]
- The HTML+ Discussion Document [Appendix II]
- An exhaustive entity table including HTML 2.0, HTML3.0, HTML+, with Postscript equivalents.
Composed by J. Stolfi from several sources found throughout the net.
http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html
Latin1的所有字符编码的更多相关文章
- mysql命令行修改字符编码
1.修改数据库字符编码 mysql> alter database mydb character set utf8 ; 2.创建数据库时,指定数据库的字符编码 mysql> create ...
- mysql修改数据库编码(数据库字符集)和表的字符编码的方法
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程和将表的字符编码转换成utf-8的方法,需要的朋友可以参考下. mysql将表的字符编码转换成utf-8 ...
- mysql字符编码集(乱码)问题解决
1.创建数据库 CREATE DATABASE `test` CHARACTER SET 'utf8' COLLATE 'utf8_general_ci'; 创建表 CREATE TABLE tp_w ...
- 彻底解决mysql中文乱码的办法,修改mysql解压缩版(免安装版或zip版)字符编码
MySQL会出现中文乱码的原因不外乎下列几点:1.server本身设定问题,例如server字符编码还停留在latin12.table的语系设定问题(包含character与collation)3.客 ...
- 彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, Utf-8
彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, U 1.ASCII: 0-127(128-255未使用),美国标准 2.IS0-8859-1(lati ...
- 修改数据库mysql字符编码为UTF8
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程. 步骤1:查看当前的字符编码方法 mysql> show variables like'char ...
- Linux 下查看文件字符编码和转换编码
Linux 下查看文件字符编码和转换编码 如果你需要在Linux中操作windows下的文件,那么你可能会经常遇到文件编码转换的问题.Windows中默认的文件格式是GBK(gb2312),而Linu ...
- vim字符编码设置
vim 编码方式的设置 和所有的流行文本编辑器一样,Vim 可以很好的编辑各种字符编码的文件,这当然包括UCS-2.UTF-8 等流行的 Unicode 编码方式.然而不幸的是,和很多来自 Linux ...
- linux mysql字符编码问题
发布:thatboy 来源:脚本学堂 [大 中 小] 本文介绍下,linux环境中mysql字符编码问题的解决办法,有遇到mysql编码问题的朋友,可以参考下本文的介绍,希望对你有一定的帮 ...
随机推荐
- Solr搜索结果说明
在admin页面,输入相关内容后,会返回xml格式的内容.说明如下: [html] view plaincopy <?xml version="1.0" encoding=& ...
- ios网络学习------1get post异步请求
网络请求的步骤: get请求: #pragma mark - 这是私有方法,尽量不要再方法中直接使用属性,由于一般来说属性都是和界面关联的,我们能够通过參数的方式来使用属性 #pragma mark ...
- 超详细SDK Hello World
Windows应用程序的基本运行机制与HelloWin程序详细解 总的来说最基本的Windows应用程序的运行执行顺序总是以如下的基本顺序执行的. 顺序结构: 调用WinMain函数开始执行--à定义 ...
- 基于visual Studio2013解决C语言竞赛题之1041反向打印
题目 解决代码及点评 /* 功能:将一个整数(最多是10位数)从低位到高位打印出来, 如该数是12345时,输出应是54321(只占一行) 时间:19:18 201 ...
- CentOS下利用sshpass不用手动输入密码远程执行命令
在测试的时候要同时操作多台机器,每次都要挨个去执行几乎相同的命令或者修改一些设置,这样很影响工作效率也很烦,所以就想写一个脚本,远程自动去做这些操作.远程执行命令很简单,但是不能在执行命令加上命 ...
- 对于一颗完全二叉树,要求给所有节点加上一个pNext指针,指向同一层的相邻节点-----层序遍历的应用题
题目:对于一颗完全二叉树,要求给所有节点加上一个pNext指针,指向同一层的相邻节点:如果当前节点已经是该层的最后一个节点,则将pNext指针指向NULL:给出程序实现,并分析时间复杂度和空间复杂度. ...
- boost::asio async_write也不能保证一次发完所有数据 二
只有看boost源码才能弄明白发生了什么.首先我是将vector里面写入了数据,然后用boost::asio::buffer将vector构造成了mutable_buffer_1对象. 参考该文档的重 ...
- POJ3189_Steady Cow Assignment(二分图多重匹配/网络流+二分构图)
解题报告 http://blog.csdn.net/juncoder/article/details/38340447 题目传送门 题意: B个猪圈,N头猪.每头猪对每一个猪圈有一个惬意值.要求安排这 ...
- <转载>CSS解决图片过大撑破DIV的方法
DIV+CSS网页内容中如果插入大于DIV层宽度显示,过大的图片将会撑破网页宽度显示从而网页严重变形,您是否遇到过?这里DIVCSS5给大家介绍几种解决图片撑破撑开网页DIV层方法. 图片撑破宽度解决 ...
- 解决Ajax.BeginForm还是刷新页面的问题
在.net mvc中用Ajax.BeginForm来实现异步提交,在Ajax.BeginForm里面还是可以用submit按钮,一般来说 submit按钮是提交整个页面的数据.但是在Ajax.Begi ...