selenium是如何启动浏览器的

前几天有同学问到selenium是怎么样启动浏览器的(selenium启动浏览器的原理)，当时稍微讲解了一下，不过自我感觉不够具体，现在特地把启动原理通过代码和一系列操作给串联起来，希望可以帮助大家更好的理解。

以chrome浏览器为例，selenium启动chrome浏览器的代码如下：



def __init__(self, executable_path="chromedriver", port=0,

                 options=None, service_args=None,

                 desired_capabilities=None, service_log_path=None,

                 chrome_options=None):

        """

        Creates a new instance of the chrome driver.

        Starts the service and then creates new instance of chrome driver.

        :Args:

         - executable_path - path to the executable. If the default is used it assumes the executable is in the $PATH

         - port - port you would like the service to run, if left as 0, a free port will be found.

         - desired_capabilities: Dictionary object with non-browser specific

           capabilities only, such as "proxy" or "loggingPref".

         - options: this takes an instance of ChromeOptions

        """

        if chrome_options:

            warnings.warn('use options instead of chrome_options', DeprecationWarning)

            options = chrome_options

        if options is None:

            # desired_capabilities stays as passed in

            if desired_capabilities is None:

                desired_capabilities = self.create_options().to_capabilities()

        else:

            if desired_capabilities is None:

                desired_capabilities = options.to_capabilities()

            else:

                desired_capabilities.update(options.to_capabilities())

        self.service = Service(

            executable_path,

            port=port,

            service_args=service_args,

            log_path=service_log_path)

        self.service.start()

        try:

            RemoteWebDriver.__init__(

                self,

                command_executor=ChromeRemoteConnection(

                    remote_server_addr=self.service.service_url),

                desired_capabilities=desired_capabilities)

        except Exception:

            self.quit()

            raise

        self._is_remote = False

其中跟浏览器启动密切相关的是这几句

self.service = Service(

    executable_path,

    port=port,

    service_args=service_args,

    log_path=service_log_path)

self.service.start()

通过查看跟Service相关的代码复盘得到启动逻辑: 调用chromedriver可执行文件运行chromedirver。这也是为什么我们需要把chromedriver放到系统PATH里的原因。

所以selenium先启动了chromedriver。当然，我们可以手工启动chromedriver来模拟这个启动过程。

在命令行中运行下面的命令chromedirver

你应该可以看来类似的结果

Starting ChromeDriver 2.38.552518 (183d19265345f54ce39cbb94cf81ba5f15905011) on port 9515

Only local connections are allowed.

这样我们就手工启动了chromedriver。driver监听的端口是9515.

启动了driver之后，我们需要告诉driver打开浏览器。selenium的源码里这一过程如下



def start_session(self, capabilities, browser_profile=None):

        """

        Creates a new session with the desired capabilities.

        :Args:

         - browser_name - The name of the browser to request.

         - version - Which browser version to request.

         - platform - Which platform to request the browser on.

         - javascript_enabled - Whether the new session should support JavaScript.

         - browser_profile - A selenium.webdriver.firefox.firefox_profile.FirefoxProfile object. Only used if Firefox is requested.

        """

        if not isinstance(capabilities, dict):

            raise InvalidArgumentException("Capabilities must be a dictionary")

        if browser_profile:

            if "moz:firefoxOptions" in capabilities:

                capabilities["moz:firefoxOptions"]["profile"] = browser_profile.encoded

            else:

                capabilities.update({'firefox_profile': browser_profile.encoded})

        w3c_caps = _make_w3c_caps(capabilities)

        parameters = {"capabilities": w3c_caps,

                      "desiredCapabilities": capabilities}

        response = self.execute(Command.NEW_SESSION, parameters)

        if 'sessionId' not in response:

            response = response['value']

        self.session_id = response['sessionId']

        self.capabilities = response.get('value')

        # if capabilities is none we are probably speaking to

        # a W3C endpoint

        if self.capabilities is None:

            self.capabilities = response.get('capabilities')

        # Double check to see if we have a W3C Compliant browser

        self.w3c = response.get('status') is None

        self.command_executor.w3c = self.w3c

这一过程的核心就是就是向localhost:9515/session发送1个POST请求，并发送1个json对象，默认情况下，这个对象应该是下面这个样子。

{

    "capabilities": {

        "alwaysMatch": {

            "browserName": "chrome",

            "goog:chromeOptions": {

                "args": [],

                "extensions": []

            },

            "platformName": "any"

        },

        "firstMatch": [

            {}

        ]

    },

    "desiredCapabilities": {

        "browserName": "chrome",

        "goog:chromeOptions": {

            "args": [],

            "extensions": []

        },

        "platform": "ANY",

        "version": ""

    }

}

简单理解就是告诉remote driver打开什么浏览器，上面的例子里我们打开的是chrome浏览器。

我们可以手工还原这个过程。

确保chromedriver是在运行中的，然后打开postman，构造1个POST请求，路径是localhost:9515/session。在Body里选择raw和JSON(application/json), 贴入上面的json字符串，如下图所示

点击send，几秒之后chrome浏览器应该可以正常启动，并且postman的response里会有大致如下的返回值

{

    "sessionId": "ad4407e133cfd5f3f49bff4c2f1f087a",

    "status": 0,

    "value": {

        "acceptInsecureCerts": false,

        "acceptSslCerts": false,

        "applicationCacheEnabled": false,

        "browserConnectionEnabled": false,

        "browserName": "chrome",

        "chrome": {

            "chromedriverVersion": "2.38.552518 (183d19265345f54ce39cbb94cf81ba5f15905011)",

            "userDataDir": "/var/folders/s6/f2_brc114wv2g8w0qggk_m2c0000gn/T/.org.chromium.Chromium.NMsAKJ"

        },

        "cssSelectorsEnabled": true,

        "databaseEnabled": false,

        "handlesAlerts": true,

        "hasTouchScreen": false,

        "javascriptEnabled": true,

        "locationContextEnabled": true,

        "mobileEmulationEnabled": false,

        "nativeEvents": true,

        "networkConnectionEnabled": false,

        "pageLoadStrategy": "normal",

        "platform": "Mac OS X",

        "rotatable": false,

        "setWindowRect": true,

        "takesHeapSnapshot": true,

        "takesScreenshot": true,

        "unexpectedAlertBehaviour": "",

        "version": "66.0.3359.181",

        "webStorageEnabled": true

    }

}

上面的返回里最重要的就是sessionId，因为后面所有跟浏览器的交互都是基于该id进行的。

总结

selenium里，selenium client先打开chromedriver
chromedirver创建session时打开了浏览器，所以浏览器的打开跟selenium无关，完全是chromedriver的能力

其实上面的例子里我们手工调用了webdriver协议里的new session协议，创建了webdriver session。具体的细节大家可以参考协议了解更多。

selenium是如何启动浏览器的的更多相关文章

Selenium自动化测试之启动浏览器
Selenium自动化测试之启动浏览器一.Eclipse新建java工程 1.新建java工程:File->New->Java Project,输入Project name:如AutoT ...
Selenium WebDriver原理（二）：Selenium是如何操纵浏览器的？
前言上一篇文章<selenium webdriver 是怎么运行的>用了一个简单的例子--搭出租车,形象地讲解selenium webdriver 是如何运行的,而这一篇文章可以理解为深 ...
使用selenium时，使用从系统启动浏览器与通过自动化驱动方式启动浏览器控件ID不一样解决方法
最近遇到一个怪事,通过正常打开浏览器,按照正常的web登录然后点击进入系统流程,将各控件的ID识别成功,然后使用 python3+selenium写好脚本,高高兴兴的用脚本跑时老是提示找不到控件,然后 ...
selenium+python自动化87-Chrome浏览器静默模式启动（headless）
前言 selenium+phantomjs可以打开无界面的浏览器,实现静默模式启动浏览器完成自动化测试,这个模式是极好的,不需要占用电脑的屏幕. 但是呢,phantomjs这个坑还是比较多的,并且遇到 ...
自动化测试-selenium启动浏览器
在自动化测试过程中,通过selenium启动浏览器时,可能需要加载插件(如测试用的firebug.或产品中要求必须添加某插件等).读取用户数据(自己浏览器的配置文件/别人直接给的浏览器配置文件).设置 ...
Java&Selenium根据实参启动相应浏览器
Java&Selenium根据实参启动相应浏览器 /** * 定义函数initBrowser * @param browser:字符串参数chrome/ie/xx * @return 并返回驱 ...
python脚本中selenium启动浏览器报错os.path.basename(self.path), self.start_error_message) selenium.common.excep
在python脚本中,使用selenium启动浏览器报错,原因是未安装浏览器驱动,报错内容如下: # -*- coding:utf-8 -*-from selenium import webdrive ...
爬虫（五）—— selenium模块启动浏览器自动化测试
目录 selenium模块一.selenium介绍二.环境搭建三.使用selenium模块 1.使用chrome并设置为无GUI模式 2.使用chrome有GUI模式 3.查找元素 4.获取标签 ...
基于Selenium2+Java的UI自动化(2) - 启动浏览器
一.准备工作我们常用的浏览器主要有三个:chrome.Firefox.IE:其中chrome 和 IE 需要下载驱动程序,才能启动浏览器,注意驱动程序有32位和64位两种. 另外:如何查看本机的浏览 ...

随机推荐

openstack学习-nove控制节点部署（四）
nove在openstack非常重要,主要负责创建虚拟机 nova计算服务 API :负责接收和响应外部请求.支持openstack API,EC2 API Cert:负责身份认证EC 2 Sched ...
js手机端和pc端加载不同的样式
function loadCSS() { if((navigator.userAgent.match(/(phone|pad|pod|iPhone|iPod|ios|iPad|Android| ...
P1219 八皇后含优化 1/5
题目描述检查一个如下的6 x 6的跳棋棋盘,有六个棋子被放置在棋盘上,使得每行.每列有且只有一个,每条对角线(包括两条主对角线的所有平行线)上至多有一个棋子. 上面的布局可以用序列2 4 6 1 3 ...
6-6 小球下落 uva679
较为简单的找规律题目开始认识二叉树虽然这题和二叉树没有啥关系 #include<bits/stdc++.h> using namespace std; int main() { in ...
jquery $与jQuery
jquery的兼容 ie8 <script type="text/javascript" src="<%=path%>/js/jquery-3.1.1. ...
全排列-hdu1027
题目描述: 题目大意:现在给我们两个数字,N和M.我们应该编程找出由1到N组成的第M个最小序列.主要运用了全排列的思想,运用了全排列中next_permutation()函数: next_permut ...
abstract class和interface有什么区别？
含有abstract修饰符的class即为抽象类,abstract 类不能创建的实例对象.含有abstract方法的类必须定义为abstract class,abstract class类中的方法不必 ...
监听发现局域网dropbox客户端broadcast-dropbox-listener
监听发现局域网dropbox客户端broadcast-dropbox-listener Dropbox是一款网盘文件同步工具.为了实现局域网内同步,该工具会通过UDP 17500端口发送广播包.N ...
angular.js--demo2-----声明局部控制器controller
<!doctype html><html ng-app="HelloAngular"> <head> <meta charset=&quo ...
(转）【Java线程】Java内存模型总结
Java的并发采用的是共享内存模型(而非消息传递模型),线程之间共享程序的公共状态,线程之间通过写-读内存中的公共状态来隐式进行通信.多个线程之间是不能直接传递数据交互的,它们之间的交互只能通过共享变 ...

selenium是如何启动浏览器的

总结

更多

selenium是如何启动浏览器的的更多相关文章

随机推荐

热门专题