pandas-10 pd.pivot_table()透视表功能

和excel一样，pandas也有一个透视表的功能，具体demo如下：

import numpy as np

import pandas as pd

from pandas import Series, DataFrame

#显示所有列

pd.set_option('display.max_columns', None)

#显示所有行

pd.set_option('display.max_rows', None)

#设置value的显示长度为100，默认为50

pd.set_option('max_colwidth',100)

df = pd.read_excel('./sales-funnel.xlsx')

print(df.head())

'''

   Account                          Name            Rep       Manager  \

0   714466               Trantow-Barrows   Craig Booker  Debra Henley

1   714466               Trantow-Barrows   Craig Booker  Debra Henley

2   714466               Trantow-Barrows   Craig Booker  Debra Henley

3   737550  Fritsch, Russel and Anderson   Craig Booker  Debra Henley

4   146832                  Kiehn-Spinka  Daniel Hilton  Debra Henley   

       Product  Quantity  Price     Status

0          CPU         1  30000  presented

1     Software         1  10000  presented

2  Maintenance         2   5000    pending

3          CPU         1  35000   declined

4          CPU         2  65000        won

'''

print(pd.pivot_table(df, index=['Name']))

'''

                              Account   Price  Quantity

Name

Barton LLC                     740150   35000  1.000000

Fritsch, Russel and Anderson   737550   35000  1.000000

Herman LLC                     141962   65000  2.000000

Jerde-Hilpert                  412290    5000  2.000000

Kassulke, Ondricka and Metz    307599    7000  3.000000

Keeling LLC                    688981  100000  5.000000

Kiehn-Spinka                   146832   65000  2.000000

Koepp Ltd                      729833   35000  2.000000

Kulas Inc                      218895   25000  1.500000

Purdy-Kunde                    163416   30000  1.000000

Stokes LLC                     239344    7500  1.000000

Trantow-Barrows                714466   15000  1.333333

对名字进行了去重，将每个人的销售记录取进行统计，上例是求了均值。

这是由aggfunc参数来决定的。

'''

print(pd.pivot_table(df, index=['Name'], aggfunc='sum'))

'''

                              Account   Price  Quantity

Name

Barton LLC                     740150   35000         1

Fritsch, Russel and Anderson   737550   35000         1

Herman LLC                     141962   65000         2

Jerde-Hilpert                  412290    5000         2

Kassulke, Ondricka and Metz    307599    7000         3

Keeling LLC                    688981  100000         5

Kiehn-Spinka                   146832   65000         2

Koepp Ltd                     1459666   70000         4

Kulas Inc                      437790   50000         3

Purdy-Kunde                    163416   30000         1

Stokes LLC                     478688   15000         2

Trantow-Barrows               2143398   45000         4

'''

print(pd.pivot_table(df, index=['Name', 'Rep', 'Manager']))

'''

                                                          Account    ...     Quantity

Name                         Rep           Manager                   ...

Barton LLC                   John Smith    Debra Henley    740150    ...     1.000000

Fritsch, Russel and Anderson Craig Booker  Debra Henley    737550    ...     1.000000

Herman LLC                   Cedric Moss   Fred Anderson   141962    ...     2.000000

Jerde-Hilpert                John Smith    Debra Henley    412290    ...     2.000000

Kassulke, Ondricka and Metz  Wendy Yule    Fred Anderson   307599    ...     3.000000

Keeling LLC                  Wendy Yule    Fred Anderson   688981    ...     5.000000

Kiehn-Spinka                 Daniel Hilton Debra Henley    146832    ...     2.000000

Koepp Ltd                    Wendy Yule    Fred Anderson   729833    ...     2.000000

Kulas Inc                    Daniel Hilton Debra Henley    218895    ...     1.500000

Purdy-Kunde                  Cedric Moss   Fred Anderson   163416    ...     1.000000

Stokes LLC                   Cedric Moss   Fred Anderson   239344    ...     1.000000

Trantow-Barrows              Craig Booker  Debra Henley    714466    ...     1.333333

'''

print(pd.pivot_table(df, index=['Manager', 'Rep']))

# manager 和 rep 之间 存在 一对多的 关系

'''

                              Account         Price  Quantity

Manager       Rep

Debra Henley  Craig Booker   720237.0  20000.000000  1.250000

              Daniel Hilton  194874.0  38333.333333  1.666667

              John Smith     576220.0  20000.000000  1.500000

Fred Anderson Cedric Moss    196016.5  27500.000000  1.250000

              Wendy Yule     614061.5  44250.000000  3.000000

'''

print(pd.pivot_table(df, index=['Manager', 'Rep'], values=['Price', 'Quantity']))

'''

                                    Price  Quantity

Manager       Rep

Debra Henley  Craig Booker   20000.000000  1.250000

              Daniel Hilton  38333.333333  1.666667

              John Smith     20000.000000  1.500000

Fred Anderson Cedric Moss    27500.000000  1.250000

              Wendy Yule     44250.000000  3.000000

'''

print(pd.pivot_table(df, index=['Manager', 'Rep'], values=['Price', 'Quantity'], columns=['Product']))

'''

                               Price               ...    Quantity

Product                          CPU Maintenance   ...     Monitor Software

Manager       Rep                                  ...

Debra Henley  Craig Booker   32500.0      5000.0   ...         NaN      1.0

              Daniel Hilton  52500.0         NaN   ...         NaN      1.0

              John Smith     35000.0      5000.0   ...         NaN      NaN

Fred Anderson Cedric Moss    47500.0      5000.0   ...         NaN      1.0

              Wendy Yule     82500.0      7000.0   ...         2.0      NaN

由以上输出可以看出，当column指定为product之后，price和quantity进行了细分，将每个product的详情列出。

另外还可以设置一个fill_value的参数，可以将nan填充为某个值。

'''

'''

总结：

    使用透视表之前，需要对原始数据有一个大概的了解，这样生成的透视表才能够有意义。

'''

pandas-10 pd.pivot_table()透视表功能的更多相关文章

pandas之groupby分组与pivot_table透视表
zhuanzi: https://blog.csdn.net/qq_33689414/article/details/78973267 pandas之groupby分组与pivot_table透视表 ...
Excel学习 -- 数据透视表功能
Excel -- 数据透视表基础数据透视表(Pivot Table)是一种交互式的表,可以进行某些计算,如求和与计数等.所进行的计算与数据跟数据透视表中的排列有关. 之所以称为数据透视表,是因 ...
python pandas使用数据透视表
1) 官网啰嗦这一堆, pandas.pivot_table函数中包含四个主要的变量,以及一些可选择使用的参数.四个主要的变量分别是数据源data,行索引index,列columns,和数值value ...
【Excle数据透视表】如何移动数据透视表的位置
数据透视表创建完成了,现在需要将它移动到D5位置,如何移动呢? 解决办法通过"移动数据透视表"功能实现数据透视表的位置移动步骤1 单击数据透视表任意单元格→数据透视表工具→分析 ...
Python中pandas透视表pivot_table功能详解（非常简单易懂）
一文看懂pandas的透视表pivot_table 一.概述 1.1 什么是透视表? 透视表是一种可以对数据动态排布并且分类汇总的表格格式.或许大多数人都在Excel使用过数据透视表,也体会到它的强大 ...
Pandas透视表（pivot_table）详解
介绍也许大多数人都有在Excel中使用数据透视表的经历,其实Pandas也提供了一个类似的功能,名为pivot_table.虽然pivot_table非常有用,但是我发现为了格式化输出我所需要的内容 ...
pandas实现excel中的数据透视表和Vlookup函数功能
在孩子王实习中做的一个小工作,方便整理数据. 目前这几行代码是实现了一个数据透视表和匹配的功能,但是将做好的结果写入了不同的excel中, 如何实现将结果连续保存到同一个Excel的同一个工作表中?还 ...
小白学 Python 数据分析（12）：Pandas （十一）数据透视表（pivot_table）
人生苦短,我用 Python 前文传送门: 小白学 Python 数据分析(1):数据分析基础小白学 Python 数据分析(2):Pandas (一)概述小白学 Python 数据分析(3):P ...
pandas常用操作详解(复制别人的)——数据透视表操作：pivot_table()
原文链接:https://www.cnblogs.com/Yanjy-OnlyOne/p/11195621.html 一文看懂pandas的透视表pivot_table 一.概述 1.1 什么是透视表 ...

随机推荐

TP 控制器和模型里面order 写法不同
控制器: Db::table('think_user')->where('status=1')->order('id desc')->limit(5)->select(); ...
NodeJS包管理器之Yarn
一.安装yarn 首选必须先安装好NodeJS,之后可以去yarn官网下载mis安装程序安装.由于NodeJS自带了一个包管理器npm,所以安装yarn更简单的方法是直接使用npm install - ...
Incorrect string value: '\xF0\x9F\x98\xAD",...' for column 'commentContent' at row 1
我分为下面三个步骤来解决: (1):修改mysql数据库的编码为uft8mb4 (2):修改数据表的编码为utf8mb4 (3):修改连接数据库的连接代码更改数据库编码:ALTER DATABASE ...
Spring使用多个 <context:property-placeholder/>
Spring中报"Could not resolve placeholder"的解决方案(引入多个properties文件) 解决方案: (1) 在Spring 3.0中,可以写: ...
vue-cli webpack打包开启Gzip 报错—— Cannot find module 'compression-webpack-plugin
异常描述: 复用以前框架,打包的时候报异常提示: Cannot find module 'compression-webpack-plugin" 然后安装插件: npm install -- ...
Algorithm测试
Algorithm测试 #include <algorithm> #include <vector> #include <iostream> using names ...
Vue基础知识学习笔记
一.环境搭建1.安装nodejs ((https://nodejs.org/en/)2.安装脚手架 npm install --global vue-cli /cnmp install --globa ...
Operation之变换操作符
buffer buffer方法的作用是缓冲组合, 第一个参数是缓冲时间, 第二个参数是缓冲个数, 第三个参数是线程该方法简单来说就是缓存Observable中发出的新元素, 当元素达到某个数量, 或 ...
openlive writer 汉化方法
使用方法: 替换你安装openlive writer 位置C:\Users\Administrator\AppData\Local\OpenLiveWriter的 app目录下的 OpenLiveW ...
selenium===使用docker搭建selenium分布式测试环境
准备: #请在此之前先了解,selenium grid :参考:selenium-grid ,下载地址,win-本地部署过程 >>>环境准备: Linux操作系统 >>& ...

pandas-10 pd.pivot_table()透视表功能

pandas-10 pd.pivot_table()透视表功能

pandas-10 pd.pivot_table()透视表功能的更多相关文章

随机推荐

热门专题