python+selenium实现自动识别验证码并登录

最近学习python+selenium实现网站的自动登录,但是遇到需要输入验证码的问题，经过查询百度收获了几种破解验证码的方式。

方式一）从万能的网友那收获了一个小众但非常实用的第3方库ddddocr，仅几行代码就可以帮你解决大部分的数字+字母验证码问题了。（PS：使用这个库还需要安装最新的opencv-python库）

首先需要安装一下这个库：pip install ddddocr，安装后使用示例代码就可以得到验证码字符串了

import ddddocr

ocr=ddddocr.DdddOcr()

with open(r'F:\Test\venv\vfi_code.png' ,'rb')as f:

    img_bytes =f.read()

res =ocr.classification(img_bytes)

print(res)

实现整个自动登录流程的思路是：

先打开登录页面，然后将整个页面截图，再截取验证码部分使用第3方库ddddocr进行破解，通过driver.find_element_by_xpath() 定位到用户名，密码，验证码的输入框模拟自动输入和模拟点击登录按钮成功登录。附上完整代码：

from selenium import webdriver

import time,ddddocr

from PIL import Image

driver=webdriver.Chrome()

driver.maximize_window()

def get_img_code():

    driver.get('http://192.168.11.55:12345/#/login')

    driver.save_screenshot('web_screen.png')

    page_snap_obj = Image.open('web_screen.png')

    img=driver.find_element_by_xpath('//*[@id="userLayout"]/div/div[1]/form/div[3]/div/div[1]/div[2]/img')  # 根据css选择器来获取元素列表，鼠标右键选择Copy-》selector

    time.sleep(1)

    location = img.location

    size = img.size

    left = location['x']

    top = location['y']

    right = left + size['width']

    bottom = top + size['height']

    image_obj = page_snap_obj.crop((left, top, right, bottom))

    img_code = image_obj.save('vfi_code.png')

    #image_obj.show()

    ocr1=ddddocr.DdddOcr()

    with open(r'F:\Test\venv\vfi_code.png','rb')as f:

        img_bytes=f.read()

    res=ocr1.classification(img_bytes)

    print(res)

    return res

if __name__ == '__main__':

    res=get_img_code()

    driver.find_element_by_xpath('/html/body/div/div/div/div/div[1]/form/div[1]/div/div[1]/div/input').clear()  #copy的xpth包含一些随机数字，每次页面加载时都会更改，选择copy full xpth可以解决

    driver.find_element_by_xpath('/html/body/div/div/div/div/div[1]/form/div[1]/div/div[1]/div/input').send_keys('ussername')

    driver.find_element_by_xpath('/html/body/div/div/div/div/div[1]/form/div[2]/div/div[1]/div/input').clear()

    driver.find_element_by_xpath('/html/body/div/div/div/div/div[1]/form/div[2]/div/div[1]/div/input').send_keys('password')

    driver.find_element_by_xpath('/html/body/div/div/div/div/div[1]/form/div[3]/div/div[1]/div[1]/div/input').clear()

    driver.find_element_by_xpath('/html/body/div/div/div/div/div[1]/form/div[3]/div/div[1]/div[1]/div/input').send_keys(res)

    time.sleep(3)

    driver.find_element_by_xpath('//*[@id="userLayout"]/div/div[1]/form/div[4]/div/button[2]/span').click()

方式二）待补充

参考链接：https://blog.csdn.net/leenhem/article/details/121507694

python+selenium实现自动识别验证码并登录的更多相关文章

Python Selenium Cookie 绕过验证码实现登录
Python Selenium Cookie 绕过验证码实现登录之前介绍过博客园的通过cookie 绕过验证码实现登录的方法.这里并不多余,会增加分析和另外一种方法实现登录. 1.思路介绍 1.1. ...
python+selenium破解极验验证登录
1.前言: 目前很多网站会在正常的账号密码认证之外加一些验证码,以此来明确区分人/机行为,最典型的就是极验滑动验证.(如下图) 这里我们以简单实例说明如何实现自动校验类似验证. 2.步骤: 1)点击验 ...
一次完整的自动化登录测试-基于python+selenium进行cnblog的自动化登录测试
Web登录测试是很常见的测试!手动测试大家再熟悉不过了,那如何进行自动化登录测试呢!本文作者就用python+selenium结合unittest单元测试框架来进行一次简单但比较完整的cnblog自动 ...
使用Python + Selenium破解滑块验证码
在前面一篇博客<使用 Python + Selenium 打造浏览器爬虫>中,我介绍了 Selenium 的基本用法和爬虫开发过程中经常使用的一些小技巧,利用这些写出一个浏览器爬虫已经完全 ...
一次简单完整的自动化登录测试-基于python+selenium进行cnblog的自动化登录测试
Web登录测试是很常见的测试,手动测试大家再熟悉不过了,那如何进行自动化登录测试呢!本文就基于python+selenium结合unittest单元测试框架来进行一次简单但比较完整的cnblog自动化 ...
【python】带图片验证码的登录自动化实战
近期在跟进新项目的时候,整体的业务线非常之长,会一直重复登录退出不同账号的这个流程,所以想从登录开始实现部分的自动化.因为是B/S的架构,所以采用的是selenium的框架来实现.大致实现步骤如下: ...
分享一个爬取HUST(哈理工)学生成绩的Python程序(OCR自动识别验证码)
Python版本:3.5.2 日期:2018/1/21 __Author__ = "Lance#" # -*- coding = utf-8 -*- from urllib imp ...
Python selenium自动化测试框架入门实战--登录测试案例
本文为Python自动化测试框架基础入门篇,主要帮助会写基本selenium测试代码又没有规划的同仁.本文应用到POM模型.selenium.unittest框架.configparser配置文件.s ...
python+selenium识别图片验证码
import timeimport pytesseractfrom PIL import Image, ImageEnhancefrom selenium import webdriver url = ...
python+selenium进行简单验证码获取
# _*_ coding:utf-8 _*_from PIL import Imagefrom selenium import webdriverimport pytesseractimport ti ...

随机推荐

automagic webUI 自动化
https://www.cnblogs.com/tsbc/p/6244268.html
（Fiddler）Fiddler 的相关操作
Fiddler 的几个常用操作: 1. Statistics:会话信息统计 1)选择当前页面的第一个请求和最后一个请求,通过计算 statistics,就知道该页面总共的耗时时间. 2)查出当前页面耗 ...
py09函数简介
函数的返回值 # def func():# return 'asfjsfda'# res = func()# print(res) # 函数内要想返回给调用者值必须用关键字return"& ...
div垂直居中的4种方式方式
一.使用单元格居中 <!DOCTYPE html> <html> <head> <title>测试</title> </head> ...
[CSP-S2019] Emiya 家今天的饭
洛咕题意:原题面见链接,简单来说就是给出一个\(n*m\)的矩阵,每一行代表同一种烹饪方法,每一列代表同一种食材,\(a_{i,j}\)表示使用第i种烹饪方法第j种食材能做出多少种菜,要求至少做一道 ...
解决QtCreator运行程序报plugin xcb的错误
解决方法:将对应项目的运行环境的LD_LIBRARY_PATH中的qt的库路径移到最前面,如下图: LD_LIBRARY_PATH可以指定查找共享库的路径,将qt的共享库移到前面,可以优先使用qt的库
写于vue3.0发布前夕的helloworld之三
接上,watcher构造函数: var Watcher = function Watcher ( vm, expOrFn, cb, options, isRen ...
abap sql中进行除法操作
在abap 得sql中进行除法操作要用division,不能用 " / "这个符号 SELECT vbeln, 100 * CAST( 10 + DIVISION( ZMENG, ...
C语言代码格式脚本-astyle
安装astyle sudo apt install astyle 代码格式化脚本 #!/bin/sh # http://astyle.sourceforge.net/astyle.html PARAM ...
react的生命周期和使用
完整的生命周期我们都知道生命周期分为三个大阶段: 挂载更新卸载挂载的时候我们我们有 constructor . getDerivedStateFromProps .render . compo ...

python+selenium实现自动识别验证码并登录

python+selenium实现自动识别验证码并登录的更多相关文章

随机推荐

热门专题