Python实现网页截图（PyQT5）

方案说明

功能要求：实现网页加载后将页面截取成长图片
涉及模块：PyQT5 PIL
逻辑说明：

1：完成窗口设置，利用PyQT5 QWebEngineView加载网页地址，待网页加载完成后，调用check_pag；

class MainWindow(QMainWindow):

    def __init__(self, parent=None):

        super(MainWindow, self).__init__(parent)

        self.setWindowTitle('易哈佛')

        self.temp_height = 0

        self.setWindowFlag(Qt.WindowMinMaxButtonsHint, False)  # 禁用最大化，最小化

        # self.setWindowFlag(Qt.WindowStaysOnTopHint, True)  # 窗口顶置

        self.setWindowFlag(Qt.FramelessWindowHint, True)  # 窗口无边框

    def urlScreenShot(self, url):

        self.browser = QWebEngineView()

        self.browser.load(QUrl(url))

        geometry = self.chose_screen()

        self.setGeometry(geometry)

        self.browser.loadFinished.connect(self.check_page)

        self.setCentralWidget(self.browser)

    def get_page_size(self):

        size = self.browser.page().contentsSize()

        self.set_height = size.height()

        self.set_width = size.width()

        return size.width(), size.height()

    def chose_screen(self):

        width, height = 750, 1370

        desktop = QApplication.desktop()

        screen_count = desktop.screenCount()

        for i in range(0, screen_count):

            rect = desktop.availableGeometry(i)

            s_width, s_height = rect.width(), rect.height()

            if s_width > width and s_height > height:

                return QRect(rect.left(), rect.top(), width, height)

        return QRect(0, 0, width, height)

if __name__ == '__main__':

    app = QApplication(sys.argv)

    win = MainWindow()

    win.show()

    app.exit(app.exec_())

2：收集页面高度，并计算分次截屏的次数和余量高度；实例化图片合并工具，设置定时器，超时信号发出后，执行exe_command；

    def check_page(self):

        p_width, p_height = self.get_page_size()

        self.page, self.over_flow_size = divmod(p_height, self.height())

        if self.page == 0:

            self.page = 1

        self.ssm = ScreenShotMerge(self.page, self.over_flow_size)

        self.timer = QTimer(self)

        self.timer.timeout.connect(self.exe_command)

        self.timer.setInterval(400)

        self.timer.start()

3：exe_command用来控制截图次数，并在每次截图完成后控制网页向下滑屏幕的高度；所有的页面都已截取时，完成图片合并。

    def exe_command(self):

        if self.page > 0:

            self.screen_shot()

            self.run_js()

        elif self.page < 0:

            self.timer.stop()

            self.ssm.image_merge()

            self.close()

        elif self.over_flow_size > 0:

            self.screen_shot()

        self.page -= 1

    def run_js(self):

        script = """

            var scroll = function (dHeight) {

            var t = document.documentElement.scrollTop

            var h = document.documentElement.scrollHeight

            dHeight = dHeight || 0

            var current = t + dHeight

            if (current > h) {

                window.scrollTo(0, document.documentElement.clientHeight)

              } else {

                window.scrollTo(0, current)

              }

            }

        """

        command = script + '\n scroll({})'.format(self.height())

        self.browser.page().runJavaScript(command)

4：screen_shot在每次截图完成后将图片保存，并将图片对象由图片合并根据保存到列表中。

   def screen_shot(self):

        screen = QApplication.primaryScreen()

        winid = self.browser.winId()

        pix = screen.grabWindow(int(winid))

        name = '{}/temp.png'.format(self.ssm.root_path)

        pix.save(name)

        self.ssm.add_im(name)

5：截图合并工具，在每次截图完成后将图片对象保存，完成余量截图的重绘和截图的合并。

class ScreenShotMerge():

    def __init__(self, page, over_flow_size):

        self.im_list = []

        self.page = page

        self.over_flow_size = over_flow_size

        self.get_path()

    def get_path(self):

        self.root_path = Path(__file__).parent.joinpath('temp')

        if not self.root_path.exists():

            self.root_path.mkdir(parents=True)

        self.save_path = self.root_path.joinpath('merge.png')

    def add_im(self, path):

        if len(self.im_list) == self.page:

            im = self.reedit_image(path)

        else:

            im = Image.open(path)

        im.save('{}/{}.png'.format(self.root_path, len(self.im_list) + 1))

        self.im_list.append(im)

    def get_new_size(self):

        max_width = 0

        total_height = 0

        # 计算合成后图片的宽度（以最宽的为准）和高度

        for img in self.im_list:

            width, height = img.size

            if width > max_width:

                max_width = width

            total_height += height

        return max_width, total_height

    def image_merge(self, ):

        if len(self.im_list) > 1:

            max_width, total_height = self.get_new_size()

            # 产生一张空白图

            new_img = Image.new('RGB', (max_width - 15, total_height), 255)

            x = y = 0

            for img in self.im_list:

                width, height = img.size

                new_img.paste(img, (x, y))

                y += height

            new_img.save(self.save_path)

            print('截图成功:', self.save_path)

        else:

            obj = self.im_list[0]

            width, height = obj.size

            left, top, right, bottom = 0, 0, width, height

            box = (left, top, right, bottom)

            region = obj.crop(box)

            new_img = Image.new('RGB', (width, height), 255)

            new_img.paste(region, box)

            new_img.save(self.save_path)

            print('截图成功:', self.save_path)

    def reedit_image(self, path):

        obj = Image.open(path)

        width, height = obj.size

        left, top, right, bottom = 0, height - self.over_flow_size, width, height

        box = (left, top, right, bottom)

        region = obj.crop(box)

        return region

截图功能完整代码

#!/usr/bin/env python

# -*- coding:UTF-8 -*-

# Author:Leslie-x

import sys

from PyQt5.QtCore import *

from PyQt5.QtWidgets import *

from PyQt5.QtWebEngineWidgets import *

from PIL import Image

from pathlib import Path

class ScreenShotMerge():

    def __init__(self, page, over_flow_size):

        self.im_list = []

        self.page = page

        self.over_flow_size = over_flow_size

        self.get_path()

    def get_path(self):

        self.root_path = Path(__file__).parent.joinpath('temp')

        if not self.root_path.exists():

            self.root_path.mkdir(parents=True)

        self.save_path = self.root_path.joinpath('merge.png')

    def add_im(self, path):

        if len(self.im_list) == self.page:

            im = self.reedit_image(path)

        else:

            im = Image.open(path)

        im.save('{}/{}.png'.format(self.root_path, len(self.im_list) + 1))

        self.im_list.append(im)

    def get_new_size(self):

        max_width = 0

        total_height = 0

        # 计算合成后图片的宽度（以最宽的为准）和高度

        for img in self.im_list:

            width, height = img.size

            if width > max_width:

                max_width = width

            total_height += height

        return max_width, total_height

    def image_merge(self, ):

        if len(self.im_list) > 1:

            max_width, total_height = self.get_new_size()

            # 产生一张空白图

            new_img = Image.new('RGB', (max_width - 15, total_height), 255)

            x = y = 0

            for img in self.im_list:

                width, height = img.size

                new_img.paste(img, (x, y))

                y += height

            new_img.save(self.save_path)

            print('截图成功:', self.save_path)

        else:

            obj = self.im_list[0]

            width, height = obj.size

            left, top, right, bottom = 0, 0, width, height

            box = (left, top, right, bottom)

            region = obj.crop(box)

            new_img = Image.new('RGB', (width, height), 255)

            new_img.paste(region, box)

            new_img.save(self.save_path)

            print('截图成功:', self.save_path)

    def reedit_image(self, path):

        obj = Image.open(path)

        width, height = obj.size

        left, top, right, bottom = 0, height - self.over_flow_size, width, height

        box = (left, top, right, bottom)

        region = obj.crop(box)

        return region

class MainWindow(QMainWindow):

    def __init__(self, parent=None):

        super(MainWindow, self).__init__(parent)

        self.setWindowTitle('易哈佛')

        self.temp_height = 0

        self.setWindowFlag(Qt.WindowMinMaxButtonsHint, False)  # 禁用最大化，最小化

        # self.setWindowFlag(Qt.WindowStaysOnTopHint, True)  # 窗口顶置

        self.setWindowFlag(Qt.FramelessWindowHint, True)  # 窗口无边框

    def urlScreenShot(self, url):

        self.browser = QWebEngineView()

        self.browser.load(QUrl(url))

        geometry = self.chose_screen()

        self.setGeometry(geometry)

        self.browser.loadFinished.connect(self.check_page)

        self.setCentralWidget(self.browser)

    def get_page_size(self):

        size = self.browser.page().contentsSize()

        self.set_height = size.height()

        self.set_width = size.width()

        return size.width(), size.height()

    def chose_screen(self):

        width, height = 750, 1370

        desktop = QApplication.desktop()

        screen_count = desktop.screenCount()

        for i in range(0, screen_count):

            rect = desktop.availableGeometry(i)

            s_width, s_height = rect.width(), rect.height()

            if s_width > width and s_height > height:

                return QRect(rect.left(), rect.top(), width, height)

        return QRect(0, 0, width, height)

    def check_page(self):

        p_width, p_height = self.get_page_size()

        self.page, self.over_flow_size = divmod(p_height, self.height())

        if self.page == 0:

            self.page = 1

        self.ssm = ScreenShotMerge(self.page, self.over_flow_size)

        self.timer = QTimer(self)

        self.timer.timeout.connect(self.exe_command)

        self.timer.setInterval(400)

        self.timer.start()

    def exe_command(self):

        if self.page > 0:

            self.screen_shot()

            self.run_js()

        elif self.page < 0:

            self.timer.stop()

            self.ssm.image_merge()

            self.close()

        elif self.over_flow_size > 0:

            self.screen_shot()

        self.page -= 1

    def run_js(self):

        script = """

            var scroll = function (dHeight) {

            var t = document.documentElement.scrollTop

            var h = document.documentElement.scrollHeight

            dHeight = dHeight || 0

            var current = t + dHeight

            if (current > h) {

                window.scrollTo(0, document.documentElement.clientHeight)

              } else {

                window.scrollTo(0, current)

              }

            }

        """

        command = script + '\n scroll({})'.format(self.height())

        self.browser.page().runJavaScript(command)

    def screen_shot(self):

        screen = QApplication.primaryScreen()

        winid = self.browser.winId()

        pix = screen.grabWindow(int(winid))

        name = '{}/temp.png'.format(self.ssm.root_path)

        pix.save(name)

        self.ssm.add_im(name)

if __name__ == '__main__':

    url = 'http://blog.sina.com.cn/lm/rank/focusbang//'

    app = QApplication(sys.argv)

    win = MainWindow()

    win.urlScreenShot(url)

    win.show()

    app.exit(app.exec_())

Python实现网页截图（PyQT5）的更多相关文章

Python中使用 Selenium 实现网页截图实例
Selenium 是一个可以让浏览器自动化地执行一系列任务的工具,常用于自动化测试.不过,也可以用来给网页截图.目前,它支持 Java.C#.Ruby 以及 Python 四种客户端语言.如果你使用 ...
Python各种花式截图工具，截到你手软
前言: 最近,项目中遇到了一个关于实现通过给定URL,实现对网页屏幕进行截图的一个功能,前面代码中已经用python的第三方库实现了截图功能,但在上线以后出现了一些bug,所以就改bug的任务就落在了 ...
Python下载网页的几种方法
get和post方式总结 get方式:以URL字串本身传递数据参数,在服务器端可以从'QUERY_STRING'这个变量中直接读取,效率较高,但缺乏安全性,也无法来处理复杂的数据(只能是字符串,比如在 ...
使用PhantomJS实现网页截图服务
这是上半年遇到的一个小需求,想实现网页的抓取,并保存为图片.研究了不少工具,效果都不理想,不是显示太差了(Canvas.Html2Image.Cobra),就是性能不怎么样(如SWT的Brower). ...
html2canvas 网页截图下载上传
利用html2canvas插件对网页截图并下载和上传图片. <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//E ...
Python编写网页爬虫爬取oj上的代码信息
OJ升级,代码可能会丢失. 所以要事先备份. 一開始傻傻的复制粘贴, 后来实在不能忍, 得益于大潇的启示和聪神的原始代码, 网页爬虫走起! 已经有段时间没看Python, 这次网页爬虫的原始代码是 p ...
爬虫学习笔记（1）-- 利用Python从网页抓取数据
最近想从一个网站上下载资源,懒得一个个的点击下载了,想写一个爬虫把程序全部下载下来,在这里做一个简单的记录 Python的基础语法在这里就不多做叙述了,黑马程序员上有一个基础的视频教学,可以跟着学习一 ...
iPhone 收藏网址[添加到书签] 和 [添加到主屏幕] 显示自定义图标,而不是网页截图
iPhone 收藏网址[添加到书签] 和 [添加到主屏幕] 显示自定义图标,而不是网页截图:  <link rel="sh ...
chrome也可以整张网页截图,保存完整网页为图片
转自:http://www.webkaka.com/blog/archives/chrome-save-a-webpage.html 关于浏览器截图,一直以为Chrome无能为力,最近发现,原来Chr ...

随机推荐

C#代码分析--阅读程序，回答问题
阅读下面程序,请回答如下问题: 问题1:这个程序要找的是符合什么条件的数? 问题2:这样的数存在么?符合这一条件的最小的数是什么? 问题3:在电脑上运行这一程序,你估计多长时间才能输出第一个结果?时间 ...
实现项目WC
软件的需求分析程序处理用户需求的模式为: wc.exe [parameter][filename] 在[parameter]中,用户通过输入参数与程序交互,需实现的功能如下: 1.基本功能支持 - ...
ElasticSearch 2 (34) - 信息聚合系列之多值排序
ElasticSearch 2 (34) - 信息聚合系列之多值排序摘要多值桶(terms.histogram 和 date_histogram)动态生成很多桶,Elasticsearch 是如何 ...
php artisan 命令列表
php artisan 命令列表命令获取上面的翻译内容命令说明备注 php artisan make:resource ? 创建api返回格式化资源 >=5.4版本可用 php ar ...
[转帖]在VMware ESXi服务器上配置NAT上网需要学习一下。
http://blog.51cto.com/boytnt/1292487 在使用VMware workstation的时候,我们经常以NAT的方式配置虚拟机的网络,与桥接方式相比,这样配置可以让虚拟机 ...
qemu-img.exe 工具简介
1. 下载地址 https://cloudbase.it/qemu-img-windows/ 2. 解压缩然后扔到 system32目录下或者是修改环境变量-- 我很懒,我决定扔到system3 ...
sqlserver 比较两个表的列
一.问题给了两个各有四五十个列的表,找出他们相同的列和不同的列二.查询两个表的列,存在临时表 --#a ,#b都是临时表,当前连接断开后自动删除--RANK() OVER (ORDER BY sy ...
SSM框架 mapper.xml中 value的空值判断问题
先看解决方案,其他的都是问题的出处解决方案:if中使用 _parameter,#{value}不变 <if test="_parameter!='' and _parameter!= ...
【转】Altium Designer 3D封装下载及导入教程
首先先晒几个图:是不是很逼真啊.. ---------------------------------------教程---------------------------------------- ...
luogu1941 [NOIp2014]飞扬的小鸟 (dp)
设f[i][j]为到达(i,j)这个位置的最小操作数就有$f[i][j]=min\{f[i-1][j+Y[i-1]],f[i-1][j-X[i-1]*k]+k\}$ 然后考虑优化一下转移: 对于一系 ...

Python实现网页截图（PyQT5）

方案说明

1：完成窗口设置，利用PyQT5 QWebEngineView加载网页地址，待网页加载完成后，调用check_pag；

2：收集页面高度，并计算分次截屏的次数和余量高度；实例化图片合并工具，设置定时器，超时信号发出后，执行exe_command；

3：exe_command用来控制截图次数，并在每次截图完成后控制网页向下滑屏幕的高度；所有的页面都已截取时，完成图片合并。

4：screen_shot在每次截图完成后将图片保存，并将图片对象由图片合并根据保存到列表中。

5：截图合并工具，在每次截图完成后将图片对象保存，完成余量截图的重绘和截图的合并。

截图功能完整代码

Python实现网页截图（PyQT5）的更多相关文章

随机推荐

热门专题