Selenium RemoteWebDriver 利用CDP修改User-Agent

地球人都知道，如果使用selenium时要修改user-agent可以在启动浏览器时添加配置项，如chromeOptions.addArguments("user-agent=xxx");。但是如何在每次请求的时候动态更改user-agent呢？

经过我的不懈努力，终于在网上找到一个相关的信息使用python3和selenium4修改chrome的user-agent。

这里面提到了使用driver.execute_cdp_cmd来切换，这让我了解了一下cdp命令。简单的来说，cdp命令时chrome支持的一种基于websocket的协议，通过这个协议可以与浏览器内核通信。平常使用的F12浏览器开发工具就是基于cdp的。cpd命令可以实现的功能很多，可以参考Chrome DevTools Protocol。再这里面我找到了一个Network.setUserAgentOverride命令可以修改请求user-agent。

命令的参数如下：

但是，在我的项目中目前使用的selenium-java的版本是3.141.59，这个版本还没用提供对于cdp命令的支持，前面信息中提到了是在selenium4中使用使用的cdp命令。于是我又去maven仓库搜索有没有selenium4的jar包可以用。

这里面已经有5个alpha测试的版本了，虽然还不是稳定版本，但是为了实现新功能先试一试，在maven中添加依赖：

<!-- https://mvnrepository.com/artifact/org.seleniumhq.selenium/selenium-java -->

<dependency>

    <groupId>org.seleniumhq.selenium</groupId>

    <artifactId>selenium-java</artifactId>

    <version>4.0.0-alpha-5</version>

</dependency>

然后尝试调用ChromeDriver的executeCdpCommand方法

可以看到，第一个参数是commandName，对于修改user-agent的需求，这个地方应该填写Network.setUserAgentOverride，后面是参数的键值对，现在只需要填写userAgent就可以了，其他都是不需要的可选参数。

一般情况下，问题到这里就解决了，但是在我的项目中却没有这么简单。因为各种原因，我的项目中使用的是Selenium-Server来提供浏览器环境的，也就是说，创建的都是RemoteWebDriver，虽然ChromeDriver是继承自RemoteWebDriver的，但是cdp命令是chrome浏览器独有的，因此RemoteWebDriver也就没有提供相关的支持了。那么如何在确定RemoteWebDriver调用chrome浏览器的情况下提供cpd命令的支持呢？为了实现这个功能还是费了一些时间，因此在这里把过程记录下来。

首先看一下ChromeWebDriver是如何实现cdp命令的：

public Map<String, Object> executeCdpCommand(String commandName, Map<String, Object> parameters) {

    Objects.requireNonNull(commandName, "Command name must be set.");

    Objects.requireNonNull(parameters, "Parameters for command must be set.");

    Map<String, Object> toReturn = (Map)this.getExecuteMethod().execute("executeCdpCommand", ImmutableMap.of("cmd", commandName, "params", parameters));

    return ImmutableMap.copyOf(toReturn);

}

在Selenium4中，ChromeDriver继承自ChromiumDriver，二者其实是一模一样的。ChromiumDriver提供了cdp命令的支持，利用executeMethod运行命令executeCdpCommand，将要运行的具体命令和参数一并传入。于是我又开始找这个ExecuteMethod是什么东西，发现ChromiumWebDriver并没有对这个参数进行任何设置，因此应该是在ChromiumDriver继承的RemoteWebDriver来设置的。果然，在RemoteWebDriver中有this.executeMethod = new RemoteExecuteMethod(this);，在ChromiumWebDriver中获取到的一定也是这个对象。那么很容易想到，继承一个RemoteWebDriver并编写一个方法调用这个executeMethod不就行了吗？

public class CdpRemoteWebDriver extends RemoteWebDriver {

    public CdpRemoteWebDriver(URL remoteAddress, Capabilities capabilities) {

        super(remoteAddress, capabilities);

    }

    public Map<String, Object> executeCdpCommand(String commandName, Map<String, Object> parameters) {

        Objects.requireNonNull(commandName, "Command name must be set.");

        Objects.requireNonNull(parameters, "Parameters for command must be set.");

        Map<String, Object> toReturn = (Map)this.getExecuteMethod().execute("executeCdpCommand", ImmutableMap.of("cmd", commandName, "params", parameters));

        return ImmutableMap.copyOf(toReturn);

    }

}

然后再创建CdpRemoteWebDriver实例，在访问网页之前设置user-agent

Map uaMap = new HashMap(){{

    put("userAgent", "customUserAgent");

}};

((CdpRemoteWebDriver) driver).executeCdpCommand("Network.setUserAgentOverride",

        uaMap

        );

driver.get(url);

运行试一下！

org.openqa.selenium.UnsupportedCommandException: executeCdpCommand

Build info: version: '4.0.0-alpha-5', revision: 'b3a0d621cc'

System info: host: 'DESKTOP-BM176Q1', ip: '192.168.137.1', os.name: 'Windows 10', os.arch: 'amd64', os.version: '10.0', java.version: '1.8.0_161'

Driver info: driver.version: CdpRemoteWebDriver

	at org.openqa.selenium.remote.codec.AbstractHttpCommandCodec.encode(AbstractHttpCommandCodec.java:246)

	at org.openqa.selenium.remote.codec.AbstractHttpCommandCodec.encode(AbstractHttpCommandCodec.java:129)

	at org.openqa.selenium.remote.HttpCommandExecutor.execute(HttpCommandExecutor.java:155)

	at org.openqa.selenium.remote.RemoteWebDriver.execute(RemoteWebDriver.java:582)

	at org.openqa.selenium.remote.RemoteWebDriver.execute(RemoteWebDriver.java:639)

	at org.openqa.selenium.remote.RemoteExecuteMethod.execute(RemoteExecuteMethod.java:36)

	at com.zju.edu.eagle.accessibilitycheck.a11ycheck.executor.impl.CdpRemoteWebDriver.executeCdpCommand(CdpRemoteWebDriver.java:24)

结果不行，说executeCdpCommand是不支持的命令，为什么一样的executeMethod结果不一样呢？定位到错误的地方看一下

public HttpRequest encode(Command command) {

    String name = (String)this.aliases.getOrDefault(command.getName(), command.getName());

    AbstractHttpCommandCodec.CommandSpec spec = (AbstractHttpCommandCodec.CommandSpec)this.nameToSpec.get(name);

    if (spec == null) {

        throw new UnsupportedCommandException(command.getName());

    }

    ...

}

运行命令时，先从(AbstractHttpCommandCodec.CommandSpec)this.nameToSpec中获取命令的相关信息了，而要运行的executeCdpCommand没有事先定义，所以就出现异常了。

public AbstractHttpCommandCodec() {

    this.defineCommand("status", get("/status"));

    this.defineCommand("getAllSessions", get("/sessions"));

    this.defineCommand("newSession", post("/session"));

    this.defineCommand("getCapabilities", get("/session/:sessionId"));

    ...

}

这些命令是在AbstractHttpCommandCodec中定义的，而executeCdpCommand不在其中。这说明虽然ChromeDriver和RemoteWebDriver有相同的executeMethod，但后续调用还是涉及到了不同的类，于是我又回头查看ChromeDriver中的代码，发现有这样一个构造函数

public ChromeDriver(ChromeDriverService service, Capabilities capabilities) {

    super(new ChromiumDriverCommandExecutor(service), capabilities, "goog:chromeOptions");

}

这里面创建了一个ChromiumDriverCommandExecutor，再点进来看一下

static {

    CHROME_COMMAND_NAME_TO_URL.put("launchApp", new CommandInfo("/session/:sessionId/chromium/launch_app", HttpMethod.POST));

    CHROME_COMMAND_NAME_TO_URL.put("getNetworkConditions", new CommandInfo("/session/:sessionId/chromium/network_conditions", HttpMethod.GET));

    CHROME_COMMAND_NAME_TO_URL.put("setNetworkConditions", new CommandInfo("/session/:sessionId/chromium/network_conditions", HttpMethod.POST));

    CHROME_COMMAND_NAME_TO_URL.put("deleteNetworkConditions", new CommandInfo("/session/:sessionId/chromium/network_conditions", HttpMethod.DELETE));

    CHROME_COMMAND_NAME_TO_URL.put("executeCdpCommand", new CommandInfo("/session/:sessionId/goog/cdp/execute", HttpMethod.POST));

    CHROME_COMMAND_NAME_TO_URL.put("getCastSinks", new CommandInfo("/session/:sessionId/goog/cast/get_sinks", HttpMethod.GET));

    CHROME_COMMAND_NAME_TO_URL.put("selectCastSink", new CommandInfo("/session/:sessionId/goog/cast/set_sink_to_use", HttpMethod.POST));

    CHROME_COMMAND_NAME_TO_URL.put("startCastTabMirroring", new CommandInfo("/session/:sessionId/goog/cast/start_tab_mirroring", HttpMethod.POST));

    CHROME_COMMAND_NAME_TO_URL.put("getCastIssueMessage", new CommandInfo("/session/:sessionId/goog/cast/get_issue_message", HttpMethod.GET));

    CHROME_COMMAND_NAME_TO_URL.put("stopCasting", new CommandInfo("/session/:sessionId/goog/cast/stop_casting", HttpMethod.POST));

    CHROME_COMMAND_NAME_TO_URL.put("setPermission", new CommandInfo("/session/:sessionId/permissions", HttpMethod.POST));

}

可以看到这里面也定义了一些命令，executeCdpCommand也在其中。而RemoteWebDriver没有这个命令的信息，自然也就无法执行了。经过进一步查看源码，我发现ChromiumDriverCommandExecutor是HttpCommandExecutor的子类，HttpCommandExecutor是RemoteWebDriver中真正的命令执行者。

ChromeWebDriver能够提供自定义的CommandExecutor来增加额外命令，自然我们自己继承的类也可以。在HttpCommandExecutor中有这样一个构造函数HttpCommandExecutor(Map<String, CommandInfo> additionalCommands, URL addressOfRemoteServer)，只要把添加的命令的键值对传入，就可以支持额外的命令了。

最终版本的代码如下

public class CdpRemoteWebDriver extends RemoteWebDriver {

    private static final HashMap<String, CommandInfo> CHROME_COMMAND_NAME_TO_URL = new HashMap();

    public CdpRemoteWebDriver(URL remoteAddress, Capabilities capabilities) {

        super((CommandExecutor)(new HttpCommandExecutor(ImmutableMap.copyOf(CHROME_COMMAND_NAME_TO_URL), remoteAddress)), capabilities);

    }

    public Map<String, Object> executeCdpCommand(String commandName, Map<String, Object> parameters) {

        Objects.requireNonNull(commandName, "Command name must be set.");

        Objects.requireNonNull(parameters, "Parameters for command must be set.");

        Map<String, Object> toReturn = (Map)this.getExecuteMethod().execute("executeCdpCommand", ImmutableMap.of("cmd", commandName, "params", parameters));

        return ImmutableMap.copyOf(toReturn);

    }

    static {

        CHROME_COMMAND_NAME_TO_URL.put("launchApp", new CommandInfo("/session/:sessionId/chromium/launch_app", HttpMethod.POST));

        CHROME_COMMAND_NAME_TO_URL.put("getNetworkConditions", new CommandInfo("/session/:sessionId/chromium/network_conditions", HttpMethod.GET));

        CHROME_COMMAND_NAME_TO_URL.put("setNetworkConditions", new CommandInfo("/session/:sessionId/chromium/network_conditions", HttpMethod.POST));

        CHROME_COMMAND_NAME_TO_URL.put("deleteNetworkConditions", new CommandInfo("/session/:sessionId/chromium/network_conditions", HttpMethod.DELETE));

        CHROME_COMMAND_NAME_TO_URL.put("executeCdpCommand", new CommandInfo("/session/:sessionId/goog/cdp/execute", HttpMethod.POST));

        CHROME_COMMAND_NAME_TO_URL.put("getCastSinks", new CommandInfo("/session/:sessionId/goog/cast/get_sinks", HttpMethod.GET));

        CHROME_COMMAND_NAME_TO_URL.put("selectCastSink", new CommandInfo("/session/:sessionId/goog/cast/set_sink_to_use", HttpMethod.POST));

        CHROME_COMMAND_NAME_TO_URL.put("startCastTabMirroring", new CommandInfo("/session/:sessionId/goog/cast/start_tab_mirroring", HttpMethod.POST));

        CHROME_COMMAND_NAME_TO_URL.put("getCastIssueMessage", new CommandInfo("/session/:sessionId/goog/cast/get_issue_message", HttpMethod.GET));

        CHROME_COMMAND_NAME_TO_URL.put("stopCasting", new CommandInfo("/session/:sessionId/goog/cast/stop_casting", HttpMethod.POST));

        CHROME_COMMAND_NAME_TO_URL.put("setPermission", new CommandInfo("/session/:sessionId/permissions", HttpMethod.POST));

    }

}

再测试一下效果

可以看到user-agent已经被成功替换了。

总结一下解决问题的流程

继承RemoteWebDriver类
参考ChromeDriver实现executeCdpCommand方法
参考ChromeDriver创建自定义的commandExecutor增加命令

事实上，因为cdp命令是chrome浏览器提供的支持，与selenium无关，在selenium4中只是内置了这个命令的参数和地址，调用的原理与原来支持的方法是一样的。在自己实现的CdpRemoteWebDriver中已经自己添加了参数，并不需要将依赖升级到4.0.0就可以调用cdp命令了。

Selenium RemoteWebDriver 利用CDP修改User-Agent的更多相关文章

利用Photoshop修改图片以达到投稿要求
摘自:http://www.dxy.cn/bbs/thread/8602152#8602152 利用Photoshop修改图片以达到投稿要求软件版本为Photoshop CS V8.0.1(中文版) ...
利用phpmyadmin修改mysql的root密码及如何进入修改密码后的phpmyadmin
1.利用phpmyadmin修改mysql的root密码很多人利用phpmyadmin或者命令行来修改了mysql的root密码,重启后发现mysql登录错误,这是为什么呢?修改mysql的root ...
UIWebView使用时的问题,包含修改user agent
1.①像普通controller那样实现跳转到webview的效果,而不是直接加到当前controller②隐藏webview的某些元素③webview跳往原生app④给webview添加进度条解决 ...
利用脚本修改SQL SERVER排序规则
利用脚本修改SQL SERVER排序规则编写人:CC阿爸 2014-3-1 l 今年的一项重要工作是对公司所用系统进行繁简的转换,程序转成简体基本很容易解决,但数据库转换成简体,就没那么容易了.经 ...
Selenium之利用Excel实现参数化
Selenium之利用Excel实现参数化说明:我是通过Workbook方式来读取excel文件的,这次以登陆界面为例备注:使用Workbook读取excel文件,前提是excel需要2003版本 ...
[Python爬虫] 之二十：Selenium +phantomjs 利用 pyquery通过搜狗搜索引擎数据
一.介绍本例子用Selenium +phantomjs 利用 pyquery通过搜狗搜索引擎数据()的资讯信息,输入给定关键字抓取资讯信息. 给定关键字:数字:融合:电视抓取信息内如下: 1.资讯 ...
利用反射修改final数据域
当final修饰一个数据域时,意义是声明该数据域是最终的,不可修改的.常见的使用场景就是eclipse自动生成的serialVersionUID一般都是final的. 另外还可以构造线程安全(thre ...
shell编程系列12--文本处理三剑客之sed利用sed修改文件内容
shell编程系列12--文本处理三剑客之sed利用sed修改文件内容修改命令对照表编辑命令 1s/old/new/ 替换第1行内容old为new ,10s/old/new/ 替换第1行到10行的 ...
[唐胡璐]Selenium技巧 - 利用MonteScreenRecorder录制视频
我们可以用以下方式在Selenium Webdriver中capture video. 基本步骤：从 http://www.randelshofer.ch/monte/，下载“MonteScreen ...

随机推荐

非阻塞同步机制和CAS
目录什么是非阻塞同步悲观锁和乐观锁 CAS 非阻塞同步机制和CAS 我们知道在java 5之前同步是通过Synchronized关键字来实现的,在java 5之后,java.util.concur ...
初篇：我与Linux
据悉,红帽认证将于本年的8月份更换Rhel7为Rhel8.所以我想趁这次机会搏一搏. 我个人是初中就神仰Linux已久,只不过那个时候的我只知道Linux系统,不知道有什么区分.奈何那 ...
JavaSE——装饰设计模式+简单加密解密工程
2019独角兽企业重金招聘Python工程师标准>>> 声明:本栏目所使用的素材都是凯哥学堂VIP学员所写,学员有权匿名,对文章有最终解释权:凯哥学堂旨在促进VIP学员互相学习的基础 ...
nginx 反向代理转发导致css，js，图片失效
为什么80%的码农都做不了架构师?>>> 需要添加以下配置 location ~ .*\.(gif|jpg|jpeg|png|bmp|swf)$ { proxy_pass htt ...
java中for循环和while循环，哪个更快？--一道面试题
for的 while的
C语言基础知识总结
知识点的回忆与巩固一. 条件分支结构 1.if分支语句 2.switch语句二.循环体部分知识点整理 1.for循环 2.while循环-适合不确定循环次数时使用三.字符串与数组数组的操作 1 ...
MySQL 8.0.20 源码安装数据库软件
官方支持的平台: https://www.mysql.com/support/supportedplatforms/database.html
nginx代理路径配置总结
一.发现问题配置nginx代理的时候,发现location配置的路径和代理的上下文路径的组合不同,服务端接收到的uri的路径不同,导致了controller的RequestMapping匹配出现问题 ...
Web概念
目录 Web概念概述 Web概念概述 JavaWeb 使用 Java 语言开发基于互联网的项目软件架构 C / S:Client / Server 客户端 / 服务器端在用户本地有一个客户端程序, ...
Spring源码阅读之配置的读取，解析
在上文中我们已经知道了Spring如何从我们给定的位置加载到配置文件,并将文件包装成一个Resource对象.这篇文章我们将要探讨的就是,如何从这个Resouce对象中加载到我们的容器?加载到容器后又 ...

Selenium RemoteWebDriver 利用CDP修改User-Agent

Selenium RemoteWebDriver 利用CDP修改User-Agent的更多相关文章

随机推荐

热门专题