Latin1的所有字符编码
ISO-8859-1 (ISO Latin 1) Character Encoding
Contents
- The characters at a glance
- Character codes and names
- Notes for html documents
- Other notes
- Additional references
The characters at a glance
Here are all the printable characters, in collating order:
! " # $ % & ' ( ) * + , - . /
0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
[ \ ] ^ _ `
a b c d e f g h i j k l m n o p q r s t u v w x y z
{ | } ~ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö
×
Ø Ù Ú Û Ü Ý Þ
ß
à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö
÷
ø ù ú û ü ý þ
ÿ
The first six rows are the ASCII character set.
Note the ordinary ASCII space (before `!') and the ISO Latin-1 non-breaking space (before `¡')
Character codes and names
The columns show, in order:
- HTML: the HTML notation (decimal);
- OCTL: the C/Modula-3 octal notation;
- HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
- CMP: the Sun/X11 "Compose" key combinations;
- CHR: the charater itself, when HTML-printable;
- MEANING: the ISO-8859-1 (and ASCII) definition.
HTML OCTL HEX CMP CHR MEANING
------ + ---- + --- + --- + --- + ------------------------------
� | \000 | =00 | | | Invalid (ASCII NUL)
 | \001 | =01 | | | Unused (ASCII SOH)
 | \002 | =02 | | | Unused (ASCII STX)
 | \003 | =03 | | | Unused (ASCII ETX)
 | \004 | =04 | | | Unused (ASCII EOT)
 | \005 | =05 | | | Unused (ASCII ENQ)
 | \006 | =06 | | | Unused (ASCII ACK)
 | \007 | =07 | | | Unused (ASCII BEL, audible bell)
 | \010 | =08 | | | Unused (ASCII BS, backspace)
| \011 | =09 | | | Horizontal tab (ASCII HT)
| \012 | =0A | | | Line feed (ASCII NL, newline)
 | \013 | =0B | | | Unused (ASCII VT, vertical tab)
 | \014 | =0C | | | Unused (ASCII NP, new page)
| \015 | =0D | | | Carriage Return (ASCII CR)
 | \016 | =0E | | | Unused (ASCII SO)
 | \017 | =0F | | | Unused (ASCII SI)
 | \020 | =10 | | | Unused (ASCII DLE)
 | \021 | =11 | | | Unused (ASCII DC1)
 | \022 | =12 | | | Unused (ASCII DC2)
 | \023 | =13 | | | Unused (ASCII DC3)
 | \024 | =14 | | | Unused (ASCII DC4)
 | \025 | =15 | | | Unused (ASCII NAK)
 | \026 | =16 | | | Unused (ASCII SYN)
 | \027 | =17 | | | Unused (ASCII ETB)
 | \030 | =18 | | | Unused (ASCII CAN)
 | \031 | =19 | | | Unused (ASCII EM)
 | \032 | =1A | | | Unused (ASCII SUB)
 | \033 | =1B | | | Unused (ASCII ESC, escape)
 | \034 | =1C | | | Unused (ASCII FS)
 | \035 | =1D | | | Unused (ASCII GS)
 | \036 | =1E | | | Unused (ASCII RS)
 | \037 | =1F | | | Unused (ASCII US)
| \040 | =20 | | ( ) | Space (ASCII SP)
! | \041 | =21 | | (!) | Exclamation mark
" | \042 | =22 | | (") | Quotation mark (")
# | \043 | =23 | | (#) | Number sign
$ | \044 | =24 | | ($) | Dollar sign
% | \045 | =25 | | (%) | Percent sign
& | \046 | =26 | | (&) | Ampersand (&)
' | \047 | =27 | | (') | Apostrophe (right single quote)
( | \050 | =28 | | (() | Left parenthesis
) | \051 | =29 | | ()) | Right parenthesis
* | \052 | =2A | | (*) | Asterisk
+ | \053 | =2B | | (+) | Plus sign
, | \054 | =2C | | (,) | Comma
- | \055 | =2D | | (-) | Hyphen
. | \056 | =2E | | (.) | Period (fullstop)
/ | \057 | =2F | | (/) | Solidus (slash)
0 | \060 | =30 | | (0) | Digit 0
. . .
9 | \071 | =39 | | (9) | Digit 9
: | \072 | =3A | | (:) | Colon
; | \073 | =3B | | (;) | Semi-colon
< | \074 | =3C | | (<) | Less than (<)
= | \075 | =3D | | (=) | Equals sign
> | \076 | =3E | | (>) | Greater than (>)
? | \077 | =3F | | (?) | Question mark
@ | \100 | =40 | | (@) | Commercial at-sign
A | \101 | =41 | | (A) | Uppercase letter A
. . .
Z | \132 | =5A | | (Z) | Uppercase letter Z
[ | \133 | =5B | | ([) | Left square bracket
\ | \134 | =5C | | (\) | Reverse solidus (backslash)
] | \135 | =5D | | (]) | Right square bracket
^ | \136 | =5E | | (^) | Caret
_ | \137 | =5F | | (_) | Horizontal bar (underscore)
` | \140 | =60 | | (`) | Reverse apostrophe (left single quote)
a | \141 | =61 | | (a) | Lowercase letter a
. . .
z | \172 | =7A | | (z) | Lowercase letter z
{ | \173 | =7B | | ({) | Left curly brace
| | \174 | =7C | | (|) | Vertical bar
} | \175 | =7D | | (}) | Right curly brace
~ | \176 | =7E | | (~) | Tilde
| \177 | =7F | | | Unused (ASCII DEL)
| \200 | =80 | | | Unused
| \201 | =81 | | | Unused
| \202 | =82 | | | Unused
| \203 | =83 | | | Unused
| \204 | =84 | | | Unused
| \205 | =85 | | | Unused
| \206 | =86 | | | Unused
| \207 | =87 | | | Unused
| \210 | =88 | | | Unused
| \211 | =89 | | | Unused
| \212 | =8A | | | Unused
| \213 | =8B | | | Unused
| \214 | =8C | | | Unused
| \215 | =8D | | | Unused
| \216 | =8E | | | Unused
| \217 | =8F | | | Unused
| \220 | =90 | | | Unused
| \221 | =91 | | | Unused
| \222 | =92 | | | Unused
| \223 | =93 | | | Unused
| \224 | =94 | | | Unused
| \225 | =95 | | | Unused
| \226 | =96 | | | Unused
| \227 | =97 | | | Unused
| \230 | =98 | | | Unused
| \231 | =99 | | | Unused
| \232 | =9A | | | Unused
| \233 | =9B | | | Unused
| \234 | =9C | | | Unused
| \235 | =9D | | | Unused
| \236 | =9E | | | Unused
| \237 | =9F | | | Unused
| \240 | =A0 | | ( ) | Non-breaking space ( )
¡ | \241 | =A1 | ! ! | (¡) | Inverted exclamation
¢ | \242 | =A2 | c / | (¢) | Cent sign
£ | \243 | =A3 | l - | (£) | Pound sterling
¤ | \244 | =A4 | o x | (¤) | General currency sign
¥ | \245 | =A5 | y - | (¥) | Yen sign
¦ | \246 | =A6 | | | | (¦) | Broken vertical bar
§ | \247 | =A7 | s o | (§) | Section sign
¨ | \250 | =A8 | " " | (¨) | Umlaut (dieresis)
© | \251 | =A9 | c o | (©) | Copyright
ª | \252 | =AA | - a | (ª) | Feminine ordinal
« | \253 | =AB | < < | («) | Left angle quote, guillemotleft
¬ | \254 | =AC | - , | (¬) | Not sign
| \255 | =AD | - - | () | Soft hyphen
® | \256 | =AE | r o | (®) | Registered trademark
¯ | \257 | =AF | ^ - | (¯) | Macron accent
° | \260 | =B0 | ^ * | (°) | Degree sign
± | \261 | =B1 | + - | (±) | Plus or minus
² | \262 | =B2 | ^ 2 | (²) | Superscript two
³ | \263 | =B3 | ^ 3 | (³) | Superscript three
´ | \264 | =B4 | \ \ | (´) | Acute accent
µ | \265 | =B5 | / u | (µ) | Micro sign
¶ | \266 | =B6 | P ! | (¶) | Paragraph sign
· | \267 | =B7 | ^ . | (·) | Middle dot
¸ | \270 | =B8 | , , | (¸) | Cedilla
¹ | \271 | =B9 | ^ 1 | (¹) | Superscript one
º | \272 | =BA | _ o | (º) | Masculine ordinal
» | \273 | =BB | > > | (») | Right angle quote, guillemotright
¼ | \274 | =BC | 1 4 | (¼) | Fraction one-fourth
½ | \275 | =BD | 1 2 | (½) | Fraction one-half
¾ | \276 | =BE | 3 4 | (¾) | Fraction three-fourths
¿ | \277 | =BF | ? ? | (¿) | Inverted question mark
À | \300 | =C0 | A ` | (À) | Capital A, grave accent
Á | \301 | =C1 | A ' | (Á) | Capital A, acute accent
 | \302 | =C2 | A ^ | (Â) | Capital A, circumflex accent
à | \303 | =C3 | A ~ | (Ã) | Capital A, tilde
Ä | \304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark
Å | \305 | =C5 | A * | (Å) | Capital A, ring
Æ | \306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature)
Ç | \307 | =C7 | C , | (Ç) | Capital C, cedilla
È | \310 | =C8 | E ` | (È) | Capital E, grave accent
É | \311 | =C9 | E ' | (É) | Capital E, acute accent
Ê | \312 | =CA | E ^ | (Ê) | Capital E, circumflex accent
Ë | \313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark
Ì | \314 | =CC | I ` | (Ì) | Capital I, grave accent
Í | \315 | =CD | I ' | (Í) | Capital I, acute accent
Î | \316 | =CE | I ^ | (Î) | Capital I, circumflex accent
Ï | \317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark
Ð | \320 | =D0 | D - | (Ð) | Capital Eth, Icelandic
Ñ | \321 | =D1 | N ~ | (Ñ) | Capital N, tilde
Ò | \322 | =D2 | O ` | (Ò) | Capital O, grave accent
Ó | \323 | =D3 | O ' | (Ó) | Capital O, acute accent
Ô | \324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent
Õ | \325 | =D5 | O ~ | (Õ) | Capital O, tilde
Ö | \326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark
× | \327 | =D7 | x x | (×) | Multiply sign
Ø | \330 | =D8 | O / | (Ø) | Capital O, slash
Ù | \331 | =D9 | U ` | (Ù) | Capital U, grave accent
Ú | \332 | =DA | U ' | (Ú) | Capital U, acute accent
Û | \333 | =DB | U ^ | (Û) | Capital U, circumflex accent
Ü | \334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark
Ý | \335 | =DD | Y ' | (Ý) | Capital Y, acute accent
Þ | \336 | =DE | P | | (Þ) | Capital THORN, Icelandic
ß | \337 | =DF | s s | (ß) | Small sharp s, German (sz ligature)
à | \340 | =E0 | a ` | (à) | Small a, grave accent
á | \341 | =E1 | a ' | (á) | Small a, acute accent
â | \342 | =E2 | a ^ | (â) | Small a, circumflex accent
ã | \343 | =E3 | a ~ | (ã) | Small a, tilde
ä | \344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark
å | \345 | =E5 | a * | (å) | Small a, ring
æ | \346 | =E6 | a e | (æ) | Small ae dipthong (ligature)
ç | \347 | =E7 | c , | (ç) | Small c, cedilla
è | \350 | =E8 | e ` | (è) | Small e, grave accent
é | \351 | =E9 | e ' | (é) | Small e, acute accent
ê | \352 | =EA | e ^ | (ê) | Small e, circumflex accent
ë | \353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark
ì | \354 | =EC | i ` | (ì) | Small i, grave accent
í | \355 | =ED | i ' | (í) | Small i, acute accent
î | \356 | =EE | i ^ | (î) | Small i, circumflex accent
ï | \357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark
ð | \360 | =F0 | d - | (ð) | Small eth, Icelandic
ñ | \361 | =F1 | n ~ | (ñ) | Small n, tilde
ò | \362 | =F2 | o ` | (ò) | Small o, grave accent
ó | \363 | =F3 | o ' | (ó) | Small o, acute accent
ô | \364 | =F4 | o ^ | (ô) | Small o, circumflex accent
õ | \365 | =F5 | o ~ | (õ) | Small o, tilde
ö | \366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark
÷ | \367 | =F7 | - : | (÷) | Division sign
ø | \370 | =F8 | o / | (ø) | Small o, slash
ù | \371 | =F9 | u ` | (ù) | Small u, grave accent
ú | \372 | =FA | u ' | (ú) | Small u, acute accent
û | \373 | =FB | u ^ | (û) | Small u, circumflex accent
ü | \374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark
ý | \375 | =FD | y ' | (ý) | Small y, acute accent
þ | \376 | =FE | p | | (þ) | Small thorn, Icelandic
ÿ | \377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark
Notes for HTML documents
HTML entity names are given in the "MEANING" column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.
The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)
The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.
The non-breaking space ( ) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: | |.
Other notes
Alternative Sun/X11 "Compose" sequences for the Icelandic "thorn" are "t h" (þ, lowercase) and "T H" (Þ, uppercase).
Note that the Sun/X11 "Compose" sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas...
Additional references
- Martin Ramsch's iso8859-1 table.
- The HTML 2.0 Standard [Character Entity Sets] [HTML Coded Character Set]
- The HTML 3.0 specification [Latin-1 Character Entities]
- The HTML+ Discussion Document [Appendix II]
- An exhaustive entity table including HTML 2.0, HTML3.0, HTML+, with Postscript equivalents.
Composed by J. Stolfi from several sources found throughout the net.
http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html
Latin1的所有字符编码的更多相关文章
- mysql命令行修改字符编码
1.修改数据库字符编码 mysql> alter database mydb character set utf8 ; 2.创建数据库时,指定数据库的字符编码 mysql> create ...
- mysql修改数据库编码(数据库字符集)和表的字符编码的方法
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程和将表的字符编码转换成utf-8的方法,需要的朋友可以参考下. mysql将表的字符编码转换成utf-8 ...
- mysql字符编码集(乱码)问题解决
1.创建数据库 CREATE DATABASE `test` CHARACTER SET 'utf8' COLLATE 'utf8_general_ci'; 创建表 CREATE TABLE tp_w ...
- 彻底解决mysql中文乱码的办法,修改mysql解压缩版(免安装版或zip版)字符编码
MySQL会出现中文乱码的原因不外乎下列几点:1.server本身设定问题,例如server字符编码还停留在latin12.table的语系设定问题(包含character与collation)3.客 ...
- 彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, Utf-8
彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, U 1.ASCII: 0-127(128-255未使用),美国标准 2.IS0-8859-1(lati ...
- 修改数据库mysql字符编码为UTF8
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程. 步骤1:查看当前的字符编码方法 mysql> show variables like'char ...
- Linux 下查看文件字符编码和转换编码
Linux 下查看文件字符编码和转换编码 如果你需要在Linux中操作windows下的文件,那么你可能会经常遇到文件编码转换的问题.Windows中默认的文件格式是GBK(gb2312),而Linu ...
- vim字符编码设置
vim 编码方式的设置 和所有的流行文本编辑器一样,Vim 可以很好的编辑各种字符编码的文件,这当然包括UCS-2.UTF-8 等流行的 Unicode 编码方式.然而不幸的是,和很多来自 Linux ...
- linux mysql字符编码问题
发布:thatboy 来源:脚本学堂 [大 中 小] 本文介绍下,linux环境中mysql字符编码问题的解决办法,有遇到mysql编码问题的朋友,可以参考下本文的介绍,希望对你有一定的帮 ...
随机推荐
- 清华集训2014 day1 task1 玛里苟斯
题目 这可算是描述很简单的一道题了!但是不简单. \(S\)是一个可重集合,\(S = \{a_1, a_2, \dots, a_n \}\). 等概率随机取\(S\)的一个子集\(A = \{a_{ ...
- HNCU1741:算法3-2:行编辑程序
http://hncu.acmclub.com/index.php?app=problem_title&id=111&problem_id=1741 题目描述 一个简单的行编辑程序的功 ...
- 浅析SSH核心原理(二)
Hibernate是一个开放源代码的ORM(对象-关系映射)框架,它对JDBC进行了非常轻量级的对象封装,使得Java程序员可以随心所欲的使用对象编程思维来操纵数据库. Hibernate可以应用在任 ...
- 基于visual Studio2013解决C语言竞赛题之1013字符串查找
题目 解决代码及点评 /* 功能:编写函数IND,让它判断一个字符串是否为另一个字符串的子串的功能,若是则返回第一次出现的起始位置,否则返回0 时间:13:55 2013 ...
- 刘德华夏日Fiesta演唱会上那个表演探戈舞的演员是谁啊?_百度知道
刘德华夏日Fiesta演唱会上那个表演探戈舞的演员是谁啊?_百度知道 刘德华夏日Fiesta演唱会上那个表演探戈舞的演员是谁啊? 2008-05-28 00:04 topofhill | ...
- HTML+CSS - 前端设计的小技巧(持续更新......)
2015年7月6日20:28:20 1.设置文字的居中,非控件内的. :text-alain:center 2.图片在ASP.NET中,可以直接拖放到界面,自动形成img控件. 3.CSS直接在全局样 ...
- 用c#实现单链表(程序代码已经验证,完全正确)
1.程序的大致结构如下图: 2.下面依次列出各个类的代码 ①ILISTDs.cs 这是一个接口类,列出单链表的方法 using System; using System.Collections.Ge ...
- perl .*?和.*
redis01:/root# cat x2.pl my $str="212121a19823a456123"; if ($str =~/.*a(.*)23/){print &quo ...
- 请问,如何在windows系统下面同时使用中文和英文的cmd?_百度知道
请问,如何在windows系统下面同时使用中文和英文的cmd?_百度知道 在批处理开始加一行chcp 437就是英文的cmdchcp 936就是中文的cmd
- gridView自己定义做时间排版
公司有需求要做一个时间排版,原型例如以下 由于要用的gridView,曾经就是它的排版非常多,最看是想用一个checkbox搞定. 后来证实功能可以达到.可是排版是乱的.还是老老实实多写点吧(直接上代 ...