0.目录

1.参考

2. pool_connections 默认值为10，一个站点主机host对应一个pool
　　(4)分析
　　host A>>host B>>host A page2>>host A page3
　　限定只保留一个pool（host），根据TCP源端口可知，第四次get才能复用连接。

3. pool_maxsize 默认值为10，一个站点主机host对应一个pool，该pool内根据多线程需求可保留到某一相同主机host的多条连接
　　(4)分析
　　多线程启动时到特定主机host的连接数没有收到 pool_maxsize 的限制，但是之后只有min(线程数，pool_maxsize ) 的连接数能够保留。
　　后续线程(应用层)并不关心实际会使用到的具体连接(传输层源端口)

1.参考

【转载-译文】requests库连接池说明

Requests' secret: pool_connections and pool_maxsize

Python - 体验urllib3 -- HTTP连接池的应用

　　通过wireshark抓取包：

　　所有http://ent.qq.com/a/20111216/******.htm对应的src port都是13136，可见端口重用了

2. pool_connections 默认值为10，一个站点主机host对应一个pool

(1)代码

#!/usr/bin/env python

# -*- coding: UTF-8 -*

import time

import requests

from threading import Thread

import logging

logging.basicConfig()

logging.getLogger().setLevel(logging.DEBUG)

requests_log = logging.getLogger("requests.packages.urllib3")

requests_log.setLevel(logging.DEBUG)

requests_log.propagate = True

url_sohu_1 = 'http://www.sohu.com/sohu/1.html'

url_sohu_2 = 'http://www.sohu.com/sohu/2.html'

url_sohu_3 = 'http://www.sohu.com/sohu/3.html'

url_sohu_4 = 'http://www.sohu.com/sohu/4.html'

url_sohu_5 = 'http://www.sohu.com/sohu/5.html'

url_sohu_6 = 'http://www.sohu.com/sohu/6.html'

url_news_1 = 'http://news.163.com/air/'

url_news_2 = 'http://news.163.com/domestic/'

url_news_3 = 'http://news.163.com/photo/'

url_news_4 = 'http://news.163.com/shehui/'

url_news_5 = 'http://news.163.com/uav/5/'

url_news_6 = 'http://news.163.com/world/6/'

s = requests.Session()

s.mount('http://', requests.adapters.HTTPAdapter(pool_connections=1))

s.get(url_sohu_1)

s.get(url_news_1)

s.get(url_sohu_2)

s.get(url_sohu_3)

(2)log输出

DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): www.sohu.com              #host A

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/1.html HTTP/1.1" 404 None

DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): news.163.com　　　　　　　　 #host B

DEBUG:urllib3.connectionpool:http://news.163.com:80 "GET /air/ HTTP/1.1" 200 None

DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): www.sohu.com　　　　　　　　 #host A　　

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/2.html HTTP/1.1" 404 None  #host A page2

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/3.html HTTP/1.1" 404 None  #host A page3

(3)wireshark抓包 https过滤方法？用tcp syn？ ping m.10010.com 然后 tcp.flags == 0x0002 and ip.dst == 157.255.128.111

(4)分析

host A>>host B>>host A page2>>host A page3

限定只保留一个pool（host），根据TCP源端口可知，第四次get才能复用连接。

3. pool_maxsize 默认值为10，一个站点主机host对应一个pool，该pool内根据多线程需求可保留到某一相同主机host的多条连接

(1)代码

def thread_get(url):

    s.get(url)

def thread_get_wait_3s(url):

    s.get(url)

    time.sleep(3)

    s.get(url)

def thread_get_wait_5s(url):

    s.get(url)

    time.sleep(5)

    s.get(url)

s = requests.Session()

s.mount('http://', requests.adapters.HTTPAdapter(pool_maxsize=2))

t1 = Thread(target=thread_get_wait_5s, args=(url_sohu_1,))

t2 = Thread(target=thread_get, args=(url_news_1,))

t3 = Thread(target=thread_get_wait_3s, args=(url_sohu_2,))

t4 = Thread(target=thread_get_wait_5s, args=(url_sohu_3,))

t1.start()

t2.start()

t3.start()

t4.start()

t1.join()

t2.join()

t3.join()

t4.join()

(2)log输出

DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): www.sohu.com    #pool_sohu_connection_1_port_54805

DEBUG:urllib3.connectionpool:Starting new HTTP connection (1): news.163.com  　 #pool_163_connection_1_port_54806

DEBUG:urllib3.connectionpool:Starting new HTTP connection (2): www.sohu.com    #pool_sohu_connection_2_port_54807

DEBUG:urllib3.connectionpool:Starting new HTTP connection (3): www.sohu.com    #pool_sohu_connection_3_port_54808

DEBUG:urllib3.connectionpool:http://news.163.com:80 "GET /air/ HTTP/1.1" 200 None

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/3.html HTTP/1.1" 404 None

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/2.html HTTP/1.1" 404 None

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/1.html HTTP/1.1" 404 None

WARNING:urllib3.connectionpool:Connection pool is full, discarding connection: www.sohu.com  #pool_sohu_connection_1_port_54805 被丢弃？最初host sohu能够建立3条连接，之后终究只能保存2条？？？

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/2.html HTTP/1.1" 404 None     #pool_sohu_connection_2_port_54807 3秒后sohu/2复用了原来sohu/2的端口

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/3.html HTTP/1.1" 404 None     #pool_sohu_connection_2_port_54807 5秒后sohu/3复用了原来sohu/2的端口

DEBUG:urllib3.connectionpool:http://www.sohu.com:80 "GET /sohu/1.html HTTP/1.1" 404 None     #pool_sohu_connection_3_port_54807 5秒后sohu/1复用了原来sohu/3的端口

(3)wireshark抓包

(4)分析

多线程启动时到特定主机host的连接数没有收到 pool_maxsize 的限制，但是之后只有min(线程数，pool_maxsize ) 的连接数能够保留。

后续线程(应用层)并不关心实际会使用到的具体连接(传输层源端口)

python之requests urllib3 连接池的更多相关文章

python中实现mysql连接池
python中实现mysql连接池 import pymysql from DBUtils.PooledDB import PooledDB MYSQL_HOST = 'localhost' USER ...
【转载-译文】requests库连接池说明
转译自:https://laike9m.com/blog/requests-secret-pool_connections-and-pool_maxsize,89/ Requests' secret: ...
python socketpool：通用连接池（转）
简介在软件开发中经常要管理各种“连接”资源,通常我们会使用对应的连接池来管理,比如mysql数据库连接可以用sqlalchemy中的池来管理,thrift连接可以通过thriftpool管理,red ...
python socketpool：通用连接池
简介在软件开发中经常要管理各种“连接”资源,通常我们会使用对应的连接池来管理,比如mysql数据库连接可以用sqlalchemy中的池来管理,thrift连接可以通过thriftpool管理,red ...
Python下Mysql数据连接池——单例
# coding:utf-8 import threading import pymysql from DBUtils.PooledDB import PooledDB from app.common ...
python操作Redis安装、支持存储类型、普通连接、连接池
一.python操作redis安装和支持存储类型安装redis模块 pip3 install redis 二.Python操作Redis之普通连接 redis-py提供两个类Redis和Strict ...
Python使用urllib,urllib3,requests库+beautifulsoup爬取网页
Python使用urllib/urllib3/requests库+beautifulsoup爬取网页 urllib urllib3 requests 笔者在爬取时遇到的问题 1.结果不全 2.'抓取失 ...
urllib3使用池管理发送请求和requests常用方法的基本使用+session使用
使用urllib3的池管理器 urllib3是在urllib进行更加深入的改进,最大的好处就是在urllib的基础上添加了池管理,以至于我们不需要再去注意我们需要由那个链接去发送请求,而只需要获取到链 ...
用python自定义实现db2的连接池
想要模仿zabbix的oracle插件orabix来实现对db2的监控,但是Java能力有限,就用python来实现了.但是python常用的连接池PooledDB似乎并不支持db2,一直报这样的错误 ...

随机推荐

使用命令行解析php文件
使用命令行解析php文件,这样可以调用Log4PHP库中的一些demo,因为默认的输出使用命令行作为输出. 建一个bat文件: echo 以下是使用命令行解析php文件 C:\xampp\php\ph ...
jquery $.trim()去除字符串空格
语法jQuery.trim()函数用于去除字符串两端的空白字符. 作用该函数可以去除字符串开始和末尾两端的空白字符(直到遇到第一个非空白字符串为止).它会清除包括换行符.空格.制表符等常见的空白字符. ...
VS2017编译LevelDB
环境: 操作系统:Win7 x64 编译器:VS2017 需要Boost库支持,需要先将Boost库编译成为64位版本. 一.项目文件导入 1. 下载leveldb-windows,https://c ...
POJ 1305
毕达哥斯三元组的模板题练习练习 #include<iostream> #include<cstring> #include<cstdio> #include< ...
HNUOJ 13341
题目给你一个串, 串是严格的 1 – n 的排列,里面的数是随机的把这个串里面的数字分别输出//先预处理,对于给出的串能找到里面的最大数,再 DFS 处理 #include<iostream& ...
python第13天
装饰器装饰器本质上就是一个python函数,他可以让其他函数在不需要做任何改动的前提下,增加额外的功能,装饰器的返回值也是一个函数对象. 装饰器的应用场景:比如插入日志,性能测试,事务处理,缓存等等 ...
[转] 为什么说 Java 程序员必须掌握 Spring Boot ？
Spring Boot 2.0 的推出又激起了一阵学习 Spring Boot 热,那么, Spring Boot 诞生的背景是什么?Spring 企业又是基于什么样的考虑创建 Spring Boot ...
Codeforces 1093E Intersection of Permutations [CDQ分治]
洛谷 Codeforces 思路一开始想到莫队+bitset,发现要T. 再想到分块+bitset,脑子一抽竟然直接开始写了,当然也T了. 最后发现这就是个裸的CDQ分治-- 发现$a$不变,可 ...
解决Javascript中$(window).resize()多次执行(转)
https://www.cnblogs.com/shuilangyizu/p/6816756.html 有些时候,我们需要在浏览器窗口发生变化的时候,动态的执行一些操作,比如做自适应页面时的适配.这个 ...
Vue1.0到2.0变化
一.生命周期二.代码片段在vue1.0中可以在template编写时出现: <template> <div>第一行</div> <div>第二行&l ...

python之requests urllib3 连接池

0.目录

1.参考

2. pool_connections 默认值为10，一个站点主机host对应一个pool

(1)代码

(2)log输出

(3)wireshark抓包 https过滤方法？用tcp syn？ ping m.10010.com 然后 tcp.flags == 0x0002 and ip.dst == 157.255.128.111

(4)分析

3. pool_maxsize 默认值为10，一个站点主机host对应一个pool， 该pool内根据多线程需求可保留到某一相同主机host的多条连接

(1)代码

(2)log输出

(3)wireshark抓包

(4)分析

python之requests urllib3 连接池的更多相关文章

随机推荐

热门专题

3. pool_maxsize 默认值为10，一个站点主机host对应一个pool，该pool内根据多线程需求可保留到某一相同主机host的多条连接