python 3.6 urllib库实现天气爬取、邮件定时给妹子发送天气

#由于每天早上要和妹子说早安，于是做个定时任务，每天早上自动爬取天气，发送天气问好邮件
#
#涉及模块：
#（1）定时任务：windows的定时任务
# 配置教程链接：http://blog.csdn.net/wwy11/article/details/51100432
#（2）爬取天气：用的是中国天气网　　http://www.weather.com.cn/weather/101190101.shtml　　　101190101为城市id，动态获取
#　　　　　　　爬虫代码见上一篇博客 http://www.cnblogs.com/chenyuebai/p/6728532.html
#（3）发送邮件：代码同在上一篇博客
#（4）结束处理：笔记本自动关机,代码同在上一篇博客 #os.system('shutdown -s -t 1')

#20170427 加入失败重试；优化邮件正文

################################################################# 
#author: 陈月白 
#_blogs: http://www.cnblogs.com/chenyuebai/ 
################################################################# 

# -*- coding: utf-8 -*-

import sys

import time

import os

import traceback

import crawler_tools_01

# curPath = os.path.abspath(os.path.dirname(__file__))

# sys.path.append(curPath)

city_code_dic = {

    "南京": "",

    "北京": ""

}

class MORNING(crawler_tools_01.CRAWLER):

    #获取城市id，返回url

    def get_wertherUrl_by_cityName(self, cityName):

        cityId = city_code_dic[cityName]

        if cityId == "":

            print("get cityId failed,use default:101010100 " % cityName)

            wertherUrl = "http://www.weather.com.cn/weather/" + "" + ".shtml"

            return wertherUrl

        else:

            wertherUrl = "http://www.weather.com.cn/weather/" + cityId + ".shtml"

            # print(wertherUrl)

            return wertherUrl

    #获取天气信息

    def get_today_weather_by_weatherUrl(self,weatherUrl):

        flag_today = '<li class="sky skyid lv2 on">.*?<h1>(.*?)</h1>.*?</big>.*?title=(.*?)class.*?<span>(.*?)</span>.*?<i>(.*?)</i>.*?span title=(.*?)class=.*?<i>(.*?)</i>'

        items_today_tmp = self.select_items_from_url(weatherUrl,flag_today)

        #获取页面信息失败重试一次

        if not items_today_tmp:

            items_today_tmp = self.select_items_from_url(weatherUrl,flag_today)

            print("items_today_tmp =",items_today_tmp)

        #数据处理 元组转列表

        items_today = []

        try:

            for i in items_today_tmp[0]:

                items_today.append(i)

            print("items_today =", items_today)

            return items_today

        except:

            traceback.print_exc()

            print("CATCH AN ERROR AT:items_today_tmp transTo items_today")

            return items_today_tmp

    def make_mail_body(self,items_today):

        try:

            body_text = "美好的一天，从我的问候开始~~~\n  \n今日天气：\n%s:  %s    温度：%s 至 %s    %s  %s\n  \n \n请根据温度注意穿衣，阴雨天记得带伞   \n                                    from Mr.ch"%(items_today[0], items_today[1], items_today[2], items_today[3], items_today[4], items_today[5])

            return body_text

        except:

            traceback.print_exc()

            body_text = "美好的一天，从我的问候开始~~~\n  \n今日天气：%s\n  \n   \n请根据温度注意穿衣，阴雨天记得带伞   \n      \n                                   from Mr.ch" % items_today

            return body_text

def main():

    ZMJ = MORNING()

    weatherUrl = ZMJ.get_wertherUrl_by_cityName("南京")

    print("01 weatherUrl =", weatherUrl)

    # 获取今日天气信息

    items_today = ZMJ.get_today_weather_by_weatherUrl(weatherUrl)

    #生成邮件正文

    body_text = ZMJ.make_mail_body(items_today)

    #发送邮件

    date = time.strftime('%Y-%m-%d', time.localtime(time.time()))

    #ZMJ.send_email(["50*******@qq.com"], "爱心天气预报_%s"%date,body_text)

    ZMJ.send_email(["50*******@qq.com","46********@qq.com"], "爱心天气预报_%s"%date,body_text)

    ZMJ.shutdown(10)

main()

运行结果：

python 3.6 urllib库实现天气爬取、邮件定时给妹子发送天气的更多相关文章

第三百三十节，web爬虫讲解2—urllib库爬虫—实战爬取搜狗微信公众号—抓包软件安装Fiddler4讲解
第三百三十节,web爬虫讲解2—urllib库爬虫—实战爬取搜狗微信公众号—抓包软件安装Fiddler4讲解封装模块 #!/usr/bin/env python # -*- coding: utf- ...
九 web爬虫讲解2—urllib库爬虫—实战爬取搜狗微信公众号—抓包软件安装Fiddler4讲解
封装模块 #!/usr/bin/env python # -*- coding: utf-8 -*- import urllib from urllib import request import j ...
使用Python自带的库和正则表达式爬取熊猫直播主播观看人气
主要是体现代码的规范性 from urllib import request import re class Spider(): url = 'https://www.panda.tv/cate/lo ...
python 爬虫之 urllib库
文章更新于:2020-03-02 注:代码来自老师授课用样例. 一.初识 urllib 库在 python2.x 版本,urllib 与urllib2 是两个库,在 python3.x 版本,二者合 ...
python爬虫之urllib库（三）
python爬虫之urllib库(三) urllib库访问网页都是通过HTTP协议进行的,而HTTP协议是一种无状态的协议,即记不住来者何人.举个栗子,天猫上买东西,需要先登录天猫账号进入主页,再去 ...
python爬虫之urllib库（二）
python爬虫之urllib库(二) urllib库超时设置网页长时间无法响应的,系统会判断网页超时,无法打开网页.对于爬虫而言,我们作为网页的访问者,不能一直等着服务器给我们返回错误信息,耗费 ...
python爬虫之urllib库（一）
python爬虫之urllib库(一) urllib库 urllib库是python提供的一种用于操作URL的模块,python2中是urllib和urllib2两个库文件,python3中整合在了u ...
[Python爬虫] 使用 Beautiful Soup 4 快速爬取所需的网页信息
[Python爬虫] 使用 Beautiful Soup 4 快速爬取所需的网页信息 2018-07-21 23:53:02 larger5 阅读数 4123更多分类专栏: 网络爬虫版权声明: ...
Python爬虫教程-13-爬虫使用cookie爬取登录后的页面(人人网)（下）
Python爬虫教程-13-爬虫使用cookie爬取登录后的页面(下) 自动使用cookie的方法,告别手动拷贝cookie http模块包含一些关于cookie的模块,通过他们我们可以自动的使用co ...

随机推荐

YUM源、磁盘基础知识 CDN概念
第1章 YUM源 1.1 什么是yum源 Yellowdog Updater, Modified 一个基于RPM包管理的字符前端软件包管理器.能够从指定的服务器自动下载RPM包并且安装,可以处理依赖性 ...
Django 模板中 include 标签使用小结
include 标签允许在模板中包含其它的模板的内容. 标签的参数是所要包含的模板名称,可以是一个变量,也可以是用单/双引号硬编码的字符串. 每当在多个模板中出现相同的代码时,就应该考虑是否要使用 { ...
Android 开发笔记___textvieww__跑马灯效果
<?xml version="1.0" encoding="utf-8"?> <LinearLayout xmlns:android=&quo ...
转-Gitorious搭建步骤
先标记一下,后续手动验证 http://blog.csdn.net/king_sundi/article/details/7457475 安装Gitorious Git是一个分布式的版本控制系统,用于 ...
Python之文件与目录
file 通常建议使用open()打开文件,file用于类型判断如果要把数据写到磁盘上,除调用flush()外,还得用sync(),以确保数据从系统缓冲区同步到磁盘.close()总是会调用这两个方 ...
如何完全根据官方下载包搭建hibernate框架
好久没有用s2sh的框架了,最近业务需要又要拾起来.在搭框架时,发现之前都是复制配置文件,对具体的细节却很懵懂,所以要从新来一遍,也是一次新的学习. 我使用的版本是hibernate-release- ...
ionic开发遇到的坑及总结
前言 ionic是一个用来开发混合手机应用的,开源的,免费的代码库.可以优化html.css和js的性能,构建高效的应用程序,而且还可以用于构建Sass和AngularJS的优化.ionic会是一个可 ...
Java父线程(或是主线程)等待所有子线程退出
static void testLock1(){ final AtomicInteger waitCount = new AtomicInteger(30000); final Object wait ...
c# Debug的一些技巧
c# Debug的一些技巧专业工作也快两年,从最开始的F9,F10的断点调试,慢慢积累一些调试的技巧,令开发工作更加的效率 1.F9 最基础的断点, 点击F10 不跳入方法内部,点击F11逐行逐 ...
自学HTML5难我们应该怎么做
互联网发展到今天,越来越多的技术岗位人才出现了稀缺的状态,就拿当前的HTML5来讲,基本成为了每家互联网公司不可缺少的人才.如果抓住这个机会,把HTML5搞好,那么前途不可限量,而且这门行业是越老越吃 ...

python 3.6 urllib库实现天气爬取、邮件定时给妹子发送天气

python 3.6 urllib库实现天气爬取、邮件定时给妹子发送天气的更多相关文章

随机推荐

热门专题