性能分析和调优工具简介

总会遇到一个时候你会想提高程序执行效率，想看看哪部分耗时长成为瓶颈，想知道程序运行时内存和CPU使用情况。这时候你会需要一些方法对程序进行性能分析和调优。

Context Manager

可以上下文管理器自己实现一个计时器, 参见之前的介绍 timeit 文章里做的那样，通过定义类的 __enter__ 和 __exit__ 方法来实现对管理的函数计时, 类似如:

# timer.py

import time

class Timer(object):

    def __init__(self, verbose=False):

        self.verbose = verbose

    def __enter__(self):

        self.start = time.time()

        return self

    def __exit__(self, *args):

        self.end = time.time()

        self.secs = self.end - self.start

        self.msecs = self.secs * 1000            # 毫秒

        if self.verbose:

            print 'elapsed time: %f ms' % self.msecs

使用方式如下:

from timer import Timer

with Timer() as t:

    foo()

print "=> foo() spends %s s" % t.secs

Decorator

然而我认为装饰器的方式更加优雅

import time

from functools import wraps

def timer(function):

    @wraps(function)

    def function_timer(*args, **kwargs):

        t0 = time.time()

        result = function(*args, **kwargs)

        t1 = time.time()

        print ("Total time running %s: %s seconds" %

                (function.func_name, str(t1-t0))

                )

        return result

    return function_timer

使用就很简单了:

@timer

def my_sum(n):

    return sum([i for i in range(n)])

if __name__ == "__main__":

    my_sum(10000000)

运行结果:

➜  python profile.py

Total time running my_sum: 0.817697048187 seconds

系统自带的time命令

使用示例如下:

➜ time python profile.py

Total time running my_sum: 0.854454040527 seconds

python profile.py  0.79s user 0.18s system 98% cpu 0.977 total

上面的结果说明: 执行脚本消耗0.79sCPU时间， 0.18秒执行内核函数消耗的时间，总共0.977s时间。

其中， total时间 - (user时间 + system时间) = 消耗在输入输出和系统执行其它任务消耗的时间

python timeit 模块

可以用来做benchmark, 可以方便的重复一个程序执行的次数，来查看程序可以运行多块。具体参考之前写的文章。

cProfile

直接看带注释的使用示例吧。

#coding=utf8

def sum_num(max_num):

    total = 0

    for i in range(max_num):

        total += i

    return total

def test():

    total = 0

    for i in range(40000):

        total += i

    t1 = sum_num(100000)

    t2 = sum_num(200000)

    t3 = sum_num(300000)

    t4 = sum_num(400000)

    t5 = sum_num(500000)

    test2()

    return total

def test2():

    total = 0

    for i in range(40000):

        total += i

    t6 = sum_num(600000)

    t7 = sum_num(700000)

    return total

if __name__ == "__main__":

    import cProfile

    # # 直接把分析结果打印到控制台

    # cProfile.run("test()")

    # # 把分析结果保存到文件中

    # cProfile.run("test()", filename="result.out")

    # 增加排序方式

    cProfile.run("test()", filename="result.out", sort="cumulative")

cProfile将分析的结果保存到result.out文件中，但是以二进制形式存储的，想直接查看的话用提供的 pstats 来查看。

import pstats

# 创建Stats对象

p = pstats.Stats("result.out")

# strip_dirs(): 去掉无关的路径信息

# sort_stats(): 排序，支持的方式和上述的一致

# print_stats(): 打印分析结果，可以指定打印前几行

# 和直接运行cProfile.run("test()")的结果是一样的

p.strip_dirs().sort_stats(-1).print_stats()

# 按照函数名排序，只打印前3行函数的信息, 参数还可为小数,表示前百分之几的函数信息

p.strip_dirs().sort_stats("name").print_stats(3)

# 按照运行时间和函数名进行排序

p.strip_dirs().sort_stats("cumulative", "name").print_stats(0.5)

# 如果想知道有哪些函数调用了sum_num

p.print_callers(0.5, "sum_num")

# 查看test()函数中调用了哪些函数

p.print_callees("test")

截取一个查看test()调用了哪些函数的输出示例:

➜  python python profile.py

   Random listing order was used

   List reduced from 6 to 2 due to restriction <'test'>

Function              called...

                          ncalls  tottime  cumtime

profile.py:24(test2)  ->       2    0.061    0.077  profile.py:3(sum_num)

                               1    0.000    0.000  {range}

profile.py:10(test)   ->       5    0.073    0.094  profile.py:3(sum_num)

                               1    0.002    0.079  profile.py:24(test2)

                               1    0.001    0.001  {range}

profile.Profile

cProfile还提供了可以自定义的类，可以更精细的分析, 具体看文档。

格式如： class profile.Profile(timer=None, timeunit=0.0, subcalls=True, builtins=True)

下面这个例子来自官方文档:

import cProfile, pstats, StringIO

pr = cProfile.Profile()

pr.enable()

# ... do something ...

pr.disable()

s = StringIO.StringIO()

sortby = 'cumulative'

ps = pstats.Stats(pr, stream=s).sort_stats(sortby)

ps.print_stats()

print s.getvalue()

line_profiler

line_{profiler是一个对函数进行逐行性能分析的工具，可以参见github项目说明，地址}: https://github.com/rkern/line_profiler

示例

#coding=utf8

def sum_num(max_num):

    total = 0

    for i in range(max_num):

        total += i

    return total

@profile                     # 添加@profile 来标注分析哪个函数

def test():

    total = 0

    for i in range(40000):

        total += i

    t1 = sum_num(10000000)

    t2 = sum_num(200000)

    t3 = sum_num(300000)

    t4 = sum_num(400000)

    t5 = sum_num(500000)

    test2()

    return total

def test2():

    total = 0

    for i in range(40000):

        total += i

    t6 = sum_num(600000)

    t7 = sum_num(700000)

    return total

test()

通过 kernprof 命令来注入分析，运行结果如下：

➜ kernprof -l -v profile.py

Wrote profile results to profile.py.lprof

Timer unit: 1e-06 s

Total time: 3.80125 s

File: profile.py

Function: test at line 10

Line #      Hits         Time  Per Hit   % Time  Line Contents

==============================================================

    10                                           @profile

    11                                           def test():

    12         1            5      5.0      0.0      total = 0

    13     40001        19511      0.5      0.5      for i in range(40000):

    14     40000        19066      0.5      0.5          total += i

    15

    16         1      2974373 2974373.0     78.2      t1 = sum_num(10000000)

    17         1        58702  58702.0      1.5      t2 = sum_num(200000)

    18         1        81170  81170.0      2.1      t3 = sum_num(300000)

    19         1       114901 114901.0      3.0      t4 = sum_num(400000)

    20         1       155261 155261.0      4.1      t5 = sum_num(500000)

    21         1       378257 378257.0     10.0      test2()

    22

    23         1            2      2.0      0.0      return total

hits（执行次数）和 time（耗时）值高的地方是有比较大优化空间的地方。

memory_profiler

类似于"line_profiler"对基于行分析程序内存使用情况的模块。github 地址：https://github.com/fabianp/memory_profiler 。ps:安装 psutil, 会分析的更快。

同样是上面"line_profiler"中的代码，运行 python -m memory_profiler profile.py 命令生成结果如下:

➜ python -m memory_profiler profile.py

Filename: profile.py

Line #    Mem usage    Increment   Line Contents

================================================

    10   24.473 MiB    0.000 MiB   @profile

    11                             def test():

    12   24.473 MiB    0.000 MiB       total = 0

    13   25.719 MiB    1.246 MiB       for i in range(40000):

    14   25.719 MiB    0.000 MiB           total += i

    15

    16  335.594 MiB  309.875 MiB       t1 = sum_num(10000000)

    17  337.121 MiB    1.527 MiB       t2 = sum_num(200000)

    18  339.410 MiB    2.289 MiB       t3 = sum_num(300000)

    19  342.465 MiB    3.055 MiB       t4 = sum_num(400000)

    20  346.281 MiB    3.816 MiB       t5 = sum_num(500000)

    21  356.203 MiB    9.922 MiB       test2()

    22

    23  356.203 MiB    0.000 MiB       return total

TODO objgraph

参考资料:

Python 性能分析工具简介的更多相关文章

Python性能分析工具Profile
Python性能分析工具Profile 代码优化的前提是需要了解性能瓶颈在什么地方,程序运行的主要时间是消耗在哪里,对于比较复杂的代码可以借助一些工具来定位,python 内置了丰富的性能分析工具,如 ...
Python性能分析工具
import cProfile import pstats from flask import Flask,jsonify, request @app.route("/test", ...
cProfile——Python性能分析工具
Python自带了几个性能分析的模块:profile.cProfile和hotshot,使用方法基本都差不多,无非模块是纯Python还是用C写的.本文介绍cProfile. 例子 import t ...
Python性能分析
Python性能分析 https://www.cnblogs.com/lrysjtu/p/5651816.html https://www.cnblogs.com/cbscan/articles/33 ...
系统级性能分析工具perf的介绍与使用
测试环境:Ubuntu16.04(在VMWare虚拟机使用perf top存在无法显示问题) Kernel:3.13.0-32 系统级性能优化通常包括两个阶段:性能剖析(performance pro ...
Linux性能分析工具与图形化方法
欢迎大家前往腾讯云+社区,获取更多腾讯海量技术实践干货哦~. 作者:赵坤|腾讯魔王工作室后台开发工程师在项目开发中,经常会遇到程序启动时间过长.CPU使用率过高等问题,这个时候需要依靠性能分析工具来 ...
系统级性能分析工具perf的介绍与使用[转]
测试环境:Ubuntu16.04(在VMWare虚拟机使用perf top存在无法显示问题) Kernel:3.13.0-32 系统级性能优化通常包括两个阶段:性能剖析(performance pro ...
Python性能分析与优化PDF高清完整版免费下载|百度云盘
百度云盘|Python性能分析与优化PDF高清完整版免费下载提取码:ubjt 内容简介全面掌握Python代码性能分析和优化方法,消除性能瓶颈,迅速改善程序性能! 对于Python程序员来说,仅仅 ...
如何进行python性能分析？
在分析python代码性能瓶颈,但又不想修改源代码的时候,ipython shell以及第三方库提供了很多扩展工具,可以不用在代码里面加上统计性能的装饰器,也能很方便直观的分析代码性能.下面以我自己实 ...

随机推荐

WCF SecurityElement
1.SslStreamSecurityElement <sslStreamSecurity requireClientCertificate="true"/> Requ ...
C# 多线程之线程控制
方案一: 调用线程控制方法.启动:Thread.Start();停止:Thread.Abort();暂停:Thread.Suspend();继续:Thread.Resume(); private vo ...
自动布局库--Masonry使用
参考资料(戳这里): > Masonry官网 > Masonry介绍与使用实践(快速上手Autolayout) > iOS 开发实践之 Auto Layout > Ma ...
swift第一课快速体验playground
最近听说苹果要大力推行swift语言,所以我必须要赶快好好学一学,今天做第一个就遇到问题. 在Xcode7.2欢迎界面,选中创建第一个,我们一般都是默认创建第二个. 创建完后,出现问题了,提示如下: ...
基于HttpClient的新版正方教务系统模拟登录及信息获取API
简介通过HttpClient获取网页数据源,通过Jsoup解析数据.先模拟登录,再获取信息.模拟浏览器正常操作,封装请求头信息获取SESSIONID.模拟登录成功后切勿断开会话,依赖登录请求得到的C ...
无法定位程序输入点到_ftol2于动态链接库msvcrt.dll的错误的解决
作者:朱金灿来源:http://blog.csdn.net/clever101 今天同事在Windows XP系统上运行程序遇到这样一个错误: 我试了一下,在Win7上运行则没有这个错误.只是程序运 ...
C#使用Aspose.Words操作word文档
最近接到个需求,由于客服这边要导出大量有一定规则的word文件,里面的内容希望系统自动填充,例如这里我使用Aspose.Words.dll这个类库, 1.首先,我们需要创建模板文件,毕竟有规则的东西 ...
如何处理SAP HANA Web-Based Development Workbench的403 Forbidden错误
打开SAP云平台上的SAP HANA Web-Based Development Workbench超链接: 遇到错误信息:403 - Forbidden - The server refused t ...
TFS看板的迭代规划
故事点故事点更多体现的是用户情景或者bug的规模,采用斐波拉契数列(1,2,3,5,8,13)这样的数字表示,包含如下内容: 相对工作量复杂度风险和不确定性相对工作量下面演示一个Case来说 ...
Web前端学习流程

Python 性能分析工具简介

Table of Contents

性能分析和调优工具简介

Context Manager

Decorator

系统自带的time命令

python timeit 模块

cProfile

profile.Profile

line_profiler

示例

memory_profiler

TODO objgraph

参考资料:

Python 性能分析工具简介的更多相关文章

随机推荐

热门专题

Python 性能分析工具简介

Table of Contents

性能分析和调优工具简介

Context Manager

Decorator

系统自带的time命令

python timeit 模块

cProfile

profile.Profile

lineprofiler

示例

memoryprofiler

TODO objgraph

参考资料:

Python 性能分析工具简介的更多相关文章

随机推荐

热门专题

line_profiler

memory_profiler