asp.net C# 获取网页源代码的几种方式

1 方法

               System.Net.WebClient aWebClient = new System.Net.WebClient();

               aWebClient.Encoding = System.Text.Encoding.Default;

               Byte[] pageData = aWebClient.DownloadData(url);

               string nhtml = Encoding.GetEncoding("utf-8").GetString(pageData);

2方法

                System.Net.WebClient aWebClient = new System.Net.WebClient();

                aWebClient.Encoding = System.Text.Encoding.Default;

                string nhtml = aWebClient.DownloadString(goodstidurl);

3方法

               WebBrowser webbrowser = new WebBrowser();

                StreamReader sr = new StreamReader(this.webBTaobao.DocumentStream, Encoding.Default);

                html = sr.ReadToEnd();

                html = html.Replace("\r\n", "");

                html = html.Replace("\n", "");

                html = html.Replace("  ", "");

                html = html.Replace("(", "");

                html = html.Replace(")", "");

                string nurl = Regex.Match(html, "(?<=data-url=\").*?(?=\")").Value;

                //新建一个WebBrowser

                WebBrowser webAddress = new WebBrowser();

                webAddress.Navigate(nurl);

                //等待载入完毕

                while (webAddress.ReadyState < WebBrowserReadyState.Complete) Application.DoEvents();

                StreamReader sraddress = new StreamReader(webAddress.DocumentStream, Encoding.Default);

                jsonaddress = sraddress.ReadToEnd();

4方法

            WebRequest hwr = WebRequest.Create(@"http://item.taobao.com/item.htm?

id=" + row["urlId"].ToString());//向指定Url发出请求

            HttpWebResponse hwp = hwr.GetResponse() as HttpWebResponse;//将hwr对HTTP的请求

            string text;

            StreamReader sr;

            string code = hwp.ContentType;//请求响应得到的内容类型

            //得到编码了

            code = code.Split('=')[1];

            Stream rep = hwp.GetResponseStream();//将请求得到的内容以流的形式读出

            sr = new StreamReader(rep, Encoding.GetEncoding(code));//用指定的字符编码为指定的流初始化

asp.net C# 获取网页源代码的几种方式的更多相关文章

Python 2.7获取网站源代码的几种方式_20160924
#coding:utf-8 import urllib2,cookielib if __name__ == '__main__': root_url='https://www.baidu.com/' ...
c#利用WebClient和WebRequest获取网页源代码的比较
前几天举例分析了用asp+xmlhttp获取网页源代码的方法,但c#中一般是可以利用WebClient类和WebRequest类获取网页源代码.下面分别说明这两种方法的实现. WebClient类获取 ...
Java 网络爬虫获取网页源代码原理及实现
Java 网络爬虫获取网页源代码原理及实现 1.网络爬虫是一个自动提取网页的程序,它为搜索引擎从万维网上下载网页,是搜索引擎的重要组成.传统爬虫从一个或若干初始网页的URL开始,获得初始网页上的URL ...
delphi 获取网页源代码
//获取网页源代码 var s: string; begin s := WebBrowser1.OleObject.document.body.innerHTML; //body内的所有代码 ...
JS远程获取网页源代码的例子
js代码获取网页源代码. 代码: <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> < ...
c#利用WebClient和WebRequest获取网页源代码
C#中一般是可以利用WebClient类和WebRequest类获取网页源代码.下面分别说明这两种方法的实现. WebClient类获取网页源代码 WebClient类 WebClient ...
c#利用HttpWebRequest获取网页源代码
c#利用HttpWebRequest获取网页源代码,搞了好几天终于解决了,直接获取网站编码进行数据读取,再也不用担心乱码了! 命名空间:Using System.Net private static ...
js技术要点---JS 获取网页源代码
JS 获取网页源代码 <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html& ...
C# 获取网页源代码
/// <summary> /// 获取网页源代码 /// </summary> /// <param name="url"></para ...

随机推荐

BZOJ 4864 [BJWC2017]神秘物质 (splay)
题目大意: 让你维护一个序列,支持: 1.合并两个相邻的数为一个新的数 2.在某个位置插入一个数 3.查询一个区间的任意子区间极差的最大值 4.查询一个区间的任意子区间极差的最小值前两个操作可以用$ ...
echarts图表属性说明
参考博客: https://blog.csdn.net/luanpeng825485697/article/details/76691965
Django REST Framework - 分页 - 渲染器 - 解析器
为什么要使用分页? 我们数据表中可能会有成千上万条数据,当我们访问某张表的所有数据时,我们不太可能需要一次把所有的数据都展示出来,因为数据量很大,对服务端的内存压力比较大还有就是网络传输过程中耗时也会 ...
如何让myeclipse左边选中文件后自动关联右边树
在左侧项目树的右上角下拉菜单里有link with editor 点击即可
linux中fork（）函数详解（搬砖）
一.fork入门知识一个进程,包括代码.数据和分配给进程的资源.fork()函数通过系统调用创建一个与原来进程几乎完全相同的进程,也就是两个进程可以做完全相同的事,但如果初始参数或者传入的变量不同, ...
关于thinkpadU盘系统盘启动不了解决方法
http://www.laomaotao.org/softhelp/bios/382.html(原文章地址,比较全面) thinkpad笔记本uefi无法启动详细解决教程最近有个别用户反映说thin ...
QT5 OpenGL (六，键盘事件，开关灯，放大缩小综合运用)
概要实例效果图立体图放大图立体图缩小图不加矢量开灯图不加矢量关灯图加矢量关灯图1 加矢量关灯图2 部分代码展示主要内容解析 QT键盘事件立体图形的放大和缩小上下左右键以及A键D争键控 ...
vue 路由demo2
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8&quo ...
NOIP卡常数技巧
NOIP卡常数技巧 https://blog.csdn.net/a1351937368/article/details/78162078 http://www.mamicode.com/info-de ...
TYVJ1415 差分约束
思路: i–>i+1连一条边权为0的边 i–>i-1连一条边权为-1的边 start-1 ->end 连一条边权为w的边求0->n的最长路即可 //By SiriusRen ...

asp.net C# 获取网页源代码的几种方式

asp.net C# 获取网页源代码的几种方式的更多相关文章

随机推荐

热门专题