几种 HtmlEncode 的区别(转发）

问题：

HttpUtility.HtmlDecode ，HttpUtility.HtmlEncode 与 Server.HtmlDecode ，Server.HtmlEncode 与 HttpServerUtility.HtmlDecode ， HttpServerUtility.HtmlEncode 有什么区别？

他们与下面一般手工写的代码有什么不一样的？

public static string htmlencode(string str)

 {

        if (str == null || str == "")

            return "";

        str = str.Replace(">", "&gt;");

        str = str.Replace(" <", "&lt;");

        str = str.Replace(" ", "&nbsp;");

        str = str.Replace("  ", " &nbsp;");

        str = str.Replace("\"", "&quot;");

        str = str.Replace("\'", "'");

        str = str.Replace("\n", " <br/> ");

        return str;

}

答案：

HtmlEncode：将 Html 源文件中不允许出现的字符进行编码，通常是编码以下字符"<"、">"、"&" 等。

HtmlDecode：刚好跟 HtmlEncode 相关，解码出来原本的字符。

HttpServerUtility 实体类的 HtmlEncode 方法是一种简便方式，用于在运行时从 ASP.NET Web 应用程序访问 System.Web.HttpUtility.HtmlEncode 方法。HttpServerUtility 实体类的 HtmlEncode 方法在内部使用 System.Web.HttpUtility.HtmlEncode 对字符串进行编码。

Server.HtmlEncode 其实就是 System.Web.UI.Page 类封装的 HttpServerUtility 实体类的 HtmlEncode 方法； System.Web.UI.Page 类有这样的一个属性： public HttpServerUtility Server { get; }

所以我们可以认为：

Server.HtmlDecode = HttpServerUtility 实体类的 HtmlDecode 方法 = HttpUtility.HtmlDecode ;

Server.HtmlEncode = HttpServerUtility 实体类的 HtmlEncode 方法 = HttpUtility.HtmlEncode ;

他们只不过是为了调用方便，做了封装而已。

在 ASP 中， Server.HTMLEncode Method 过滤的字符描述如下：

如果字符串不是 DBCS 编码。这个方法将转换下面字符：

less-than character (<)	<
greater-than character (>)	>
ampersand character (&)	&
double-quote character (")	"
Any ASCII code character whose code is greater-than or equal to 0x80	&#<number>, where <number> is the ASCII character value.

如果是 DBCS 编码

All extended characters are converted.
Any ASCII code character whose code is greater-than or equal to 0x80 is converted to &#<number>, where <number> is the ASCII character value.
Half-width Katakana characters in the Japanese code page are not converted.

相关资料：

Server.HTMLEncode Method

http://msdn.microsoft.com/en-us/library/ms525347.aspx

在ASP.net 中情况也类似

下面是一个简单的替换测试代码，测试结果看之后的注释：

protected void Page_Load(object sender, EventArgs e)

{

    TestChar("<"); // 小于号    替换   &lt;

    TestChar(">"); // 大于号    替换   &gt;

    TestChar("'"); // 单引号    替换   '

    TestChar(" "); // 半角英文空格    不做替换

    TestChar(" "); // 全角中文空格    不做替换

    TestChar("&"); // &    替换   &amp;

    TestChar("\""); // 英文双引号    替换   &quot;

    TestChar("\n"); // 回车    不做替换

    TestChar("\r"); // 回车    不做替换

    TestChar("\r\n"); // 回车    不做替换

}

public void TestChar(string t)

{

    Response.Write(Server.HtmlEncode(t));

    Response.Write("__");

    Response.Write(HttpUtility.HtmlEncode(t));

    Response.Write("<br />");

}

所以上面我们提到的常用替换方式还是非常有用的，他还处理了一些 HttpUtility.HtmlEncode 不支持的替换。

public static string htmlencode(string str)

{

    if (str == null || str == "")

        return "";

    str = str.Replace(">", "&gt;");

    str = str.Replace(" <", "&lt;");

    str = str.Replace(" ", "&nbsp;");       // HttpUtility.HtmlEncode( 并不支持这个替换

    str = str.Replace("  ", " &nbsp;");     // HttpUtility.HtmlEncode( 并不支持这个替换

    str = str.Replace("\"", "&quot;");

    str = str.Replace("\'", "'");

    str = str.Replace("\n", " <br/> ");     // HttpUtility.HtmlEncode( 并不支持这个替换

    return str;

}

我们使用 Reflector 查看 HttpUtility.HtmlEncode 的实现，我们就可以看到，它只考虑的五种情况，空格，回车是没有处理的：

使用 Reflector 查看 HttpUtility.HtmlEncode 实现代码其中最重要的代码如下：

public static unsafe void HtmlEncode(string value, TextWriter output)

{

    if (value != null)

    {

        if (output == null)

        {

            throw new ArgumentNullException("output");

        }

        int num = IndexOfHtmlEncodingChars(value, );

        if (num == -)

        {

            output.Write(value);

        }

        else

        {

            int num2 = value.Length - num;

            fixed (char* str = ((char*) value))

            {

                char* chPtr = str;

                char* chPtr2 = chPtr;

                while (num-- > )

                {

                    chPtr2++;

                    output.Write(chPtr2[]);

                }

                while (num2-- > )

                {

                    chPtr2++;

                    char ch = chPtr2[];

                    if (ch <= '>')

                    {

                        switch (ch)

                        {

                            case '&':

                            {

                                output.Write("&amp;");

                                continue;

                            }

                            case '\'':

                            {

                                output.Write("'");

                                continue;

                            }

                            case '"':

                            {

                                output.Write("&quot;");

                                continue;

                            }

                            case '<':

                            {

                                output.Write("&lt;");

                                continue;

                            }

                            case '>':

                            {

                                output.Write("&gt;");

                                continue;

                            }

                        }

                        output.Write(ch);

                        continue;

                    }

                    if ((ch >= '\x00a0') && (ch < 'ā'))

                    {

                        output.Write("&#");

                        output.Write(((int) ch).ToString(NumberFormatInfo.InvariantInfo));

                        output.Write(';');

                    }

                    else

                    {

                        output.Write(ch);

                    }

                }

            }

        }

    }

}

参考资料：

HttpUtility.HtmlDecode与Server.HtmlDecode区别

http://topic.csdn.net/u/20090220/11/110c8079-1632-418a-b43b-3ddb2f0a06e2.html

詳細解說幾個建置網站時常用的編碼方法

http://blog.miniasp.com/?tag=/htmlencode

用于 Silverlight 的 .NET Framework 类库HttpUtility.HtmlEncode 方法

http://msdn.microsoft.com/zh-cn/library/system.windows.browser.httputility.htmlencode(VS.95).aspx

HttpUtility.HtmlEncode() and HttpServerUtility.HtmlEncode() do not encode all non-ASCII characters

https://connect.microsoft.com/VisualStudio/feedback/details/102251/httputility-htmlencode-and-httpserverutility-htmlencode-do-not-encode-all-non-ascii-characters?wa=wsignin1.0

转自：http://blog.joycode.com/ghj/archives/2010/02/26/115894.joy

几种 HtmlEncode 的区别(转发）的更多相关文章

几种HtmlEncode的区别(转)
一.C#中的编码 HttpUtility.HtmlDecode.HttpUtility.HtmlEncode与Server.HtmlDecode.Server.HtmlEncode与HttpServe ...
(转)几种HtmlEncode的区别
一.C#中的编码 HttpUtility.HtmlDecode.HttpUtility.HtmlEncode与Server.HtmlDecode.Server.HtmlEncode与HttpServe ...
Java中serialVersionUID的解释及两种生成方式的区别（转载）
转载自:http://blog.csdn.net/xuanxiaochuan/article/details/25052057 serialVersionUID作用: 序列化时为了保持版 ...
链接属性rel=’external’、rel=’nofollow’、rel=’external nofollow’三种写法的区别
链接属性rel='external'.rel='nofollow'.rel='external nofollow'三种写法的区别大家应该都知道rel='nofllow'的作用,它是告诉搜索引擎, ...
jsp中两种include的区别【转】
引用文章:http://www.ibm.com/developerworks/cn/java/j-jsp04293/ http://www.cnblogs.com/lazycoding/archive ...
UIImage两种初始化的区别
UIImage可以通过以下两种方式进行初始化: //第一种初始化方式:[注意使用这种初始化的时候如果是png格式的可以不给后缀名,根据屏幕的的分辨率去匹配图片] UIImage *image = [U ...
Linux 下Shell 脚本几种基本命令替换区别
Shell 脚本几种基本命令替换区别前言:因为工作需要,需要编写 shell script .编写大量 shell script 时,累计了大量经验,也让自己开始迷糊几种函数输出调用的区别.后面和 ...
PHP中数组合并的两种方法及区别介绍
PHP数组合并两种方法及区别如果是关联数组,如下: 复制代码代码如下: $a = array( 'where' => 'uid=1', 'order' => 'uid', ); $b = ...
执行shell脚本的几种方法及区别
执行shell脚本的几种方法及区别 http://blog.csdn.net/lanxinju/article/details/6032368 (认真看) 注意:如果涉及到脚本之间的调用一定要用 . ...

随机推荐

怎么去掉Xcode工程中的某种类型的警告
XCode警告问题描述在我们的项目中,通常使用了大量的第三方代码,这些代码可能很复杂,我们不敢改动他们,可是作者已经停止更新了,当sdk升级或者是编译器升级后,这些遗留的代码可能会出现许许多 ...
jsp页面元素和内置对象
java server pages其根本是一个简化的servlet设计.实现了在java当中使用html标签.javaEE标准一.页面元素 1.静态内容 html.js.css相关标签元素. 2.指 ...
Hadoop MapReduce概念学习系列之mr程序组件全貌（二十）
其实啊,spilt是,控制Apache Hadoop Mapreduce的map并发任务数,详细见http://www.cnblogs.com/zlslch/p/5713652.html map,是m ...
linux极点五笔无法输入词组_ibus设置
菜鸟学linux——用的是ubuntu 不知道是不是按个哪些快捷键,极点五笔突然无法输入词组.那个抓狂啊没关系,设置一下就ok 第一步:右上角输入法,右键——>首选项——>常规——> ...
VPN 隧道协议PPTP、L2TP、IPSec和SSLVPN的区别
最近软矿频繁地介绍了各种VPN,有免费的PacketiX.NET和Hotspot Shield,有付费的Astrill VPN,iVPN和PureVPN.在介绍这些VPN的时候,常常会说到PPTP.L ...
oracle常见小问题解答ORA-01008，ORA-01036
第一个问题,参数传的空值,需要检查参数们有没有空值的情况第二个问题,与MSSQL不同的是,.net使用参数化调用oracle不加@加的是:,然后在参数化语句里面可以省略:冒号,如果不这么写,就会出现 ...
HDU题目分类
基础题: 1000.1001.1004.1005.1008.1012.1013.1014.1017.1019.1021.1028.1029.1032.1037.1040.1048.1056.1058. ...
CENTOS LINUX查询内存大小、频率
more /proc/meminfo dmidecode [root@barcode-mcs ~]# dmidecode -t memory linux下查看主板内存槽与内存信息 1.查看内存槽数.那 ...
Could not load file or assembly 'MagickNet.dll'
1 确定项目中bin目录下存在该DLL文件 2 安装 VC++发布组件_缩略图用_x86(1).exe
SCCM 2007 R2部署、操作详解系列之概念
站点类型在安装站点时,您决定它将是主站点还是辅助站点.然后,在安装其他站点时,您可以选择将其安排到层次结构关系中,以便父站点管理子站点,中央站点收集所有站点信息,从而进行集中式管理.也可以根据业务和 ...

几种 HtmlEncode 的区别(转发）

几种 HtmlEncode 的区别(转发）的更多相关文章

随机推荐

热门专题