pandas练习（二）------ 数据过滤与排序

数据过滤与排序------探索2012欧洲杯数据

步骤1 - 导入pandas库

import pandas as pd

步骤2 - 数据集

path2 = "./data/Euro2012.csv"      # Euro2012.csv

步骤3 - 将数据集命名为euro12

euro12 = pd.read_csv(path2)

euro12.tail()

输出：

步骤4 选取 `Goals` 这一列

euro12.Goals  # euro12['Goals']

输出：

步骤5 有多少球队参与了2012欧洲杯？

euro12.shape[0]

输出：

步骤6 该数据集中一共有多少列(columns)?

euro12.info()

输出：

<class 'pandas.core.frame.DataFrame'>

RangeIndex: 16 entries, 0 to 15

Data columns (total 35 columns):

Team                          16 non-null object

Goals                         16 non-null int64

Shots on target               16 non-null int64

Shots off target              16 non-null int64

Shooting Accuracy             16 non-null object

% Goals-to-shots              16 non-null object

Total shots (inc. Blocked)    16 non-null int64

Hit Woodwork                  16 non-null int64

Penalty goals                 16 non-null int64

Penalties not scored          16 non-null int64

Headed goals                  16 non-null int64

Passes                        16 non-null int64

Passes completed              16 non-null int64

Passing Accuracy              16 non-null object

Touches                       16 non-null int64

Crosses                       16 non-null int64

Dribbles                      16 non-null int64

Corners Taken                 16 non-null int64

Tackles                       16 non-null int64

Clearances                    16 non-null int64

Interceptions                 16 non-null int64

Clearances off line           15 non-null float64

Clean Sheets                  16 non-null int64

Blocks                        16 non-null int64

Goals conceded                16 non-null int64

Saves made                    16 non-null int64

Saves-to-shots ratio          16 non-null object

Fouls Won                     16 non-null int64

Fouls Conceded                16 non-null int64

Offsides                      16 non-null int64

Yellow Cards                  16 non-null int64

Red Cards                     16 non-null int64

Subs on                       16 non-null int64

Subs off                      16 non-null int64

Players Used                  16 non-null int64

dtypes: float64(1), int64(29), object(5)

memory usage: 4.5+ KB

步骤7 将数据集中的列Team, Yellow Cards和Red Cards单独存为一个名叫discipline的数据框

discipline = euro12[['Team', 'Yellow Cards', 'Red Cards']]

discipline

输出：

步骤8 对数据框discipline按照先Red Cards再Yellow Cards进行排序

discipline.sort_values(['Red Cards', 'Yellow Cards'], ascending = False)

输出：

步骤9 计算每个球队拿到的黄牌数的平均值

round(discipline['Yellow Cards'].mean())

输出：

7.0

步骤10 找到进球数Goals超过6的球队数据

euro12[euro12.Goals > 6]

输出：

步骤11 选取以字母G开头或以e结尾的球队数据

# euro12[euro12.Team.str.startswith('G')]

euro12[euro12.Team.str.endswith('e')]  # 以字母e结束的球队

输出：

步骤12 选取前7列

euro12.iloc[: , 0:7]

输出：

步骤13 选取除了最后3列之外的全部列

euro12.iloc[: , :-3]

输出：

步骤14 找到英格兰(England)、意大利(Italy)和俄罗斯(Russia)的命中率(Shooting Accuracy)

euro12.loc[euro12.Team.isin(['England', 'Italy', 'Russia']), ['Team','Shooting Accuracy']]

输出：

参考链接：

1、http://pandas.pydata.org/pandas-docs/stable/cookbook.html#cookbook

2、https://www.analyticsvidhya.com/blog/2016/01/12-pandas-techniques-python-data-manipulation/

3、https://github.com/guipsamora/pandas_exercises

pandas练习（二）------ 数据过滤与排序的更多相关文章

Vue 基本列表 && 数据过滤与排序
1 <!DOCTYPE html> 2 <html> 3 <head> 4 <meta charset="UTF-8" /> 5 & ...
pandas之DateFrame 数据过滤+遍历行+读写csv-txt-excel
# XLS转CSV df = pd.read_excel(r'列表.xls') df2 = pd.DataFrame()df2 = df2.append(list(df['列名']), ignore_ ...
Oracle学习(二)：过滤和排序
1.知识点:能够对比以下的录屏进行阅读 SQL> --字符串大写和小写敏感 SQL> --查询名叫KING的员工信息 SQL> select * 2 from emp 3 where ...
python 数据清洗之数据合并、转换、过滤、排序
前面我们用pandas做了一些基本的操作,接下来进一步了解数据的操作, 数据清洗一直是数据分析中极为重要的一个环节. 数据合并在pandas中可以通过merge对数据进行合并操作. import n ...
[数据清洗]- Pandas 清洗“脏”数据（二）
概要了解数据分析数据问题清洗数据整合代码了解数据在处理任何数据之前,我们的第一任务是理解数据以及数据是干什么用的.我们尝试去理解数据的列/行.记录.数据格式.语义错误.缺失的条目以及错误的 ...
mysql必知必会(四、检索数据，五、排序检索数据，六、过滤数据，七、数据过滤)
四.select语句 1.检索单个列 select prod_name from products; 2.检索多个列 select prod_name, prod_price from product ...
[数据清洗]-使用 Pandas 清洗“脏”数据
概要准备工作检查数据处理缺失数据添加默认值删除不完整的行删除不完整的列规范化数据类型必要的转换重命名列名保存结果更多资源 Pandas 是 Python 中很流行的类库,使用它可 ...
[数据清洗]- Pandas 清洗“脏”数据（三）
预览数据这次我们使用 Artworks.csv ,我们选取 100 行数据来完成本次内容.具体步骤: 导入 Pandas 读取 csv 数据到 DataFrame(要确保数据已经下载到指定路径) D ...
Oracle01——基本查询、过滤和排序、单行函数、多行函数和多表查询
作者: kent鹏转载请注明出处: http://www.cnblogs.com/xieyupeng/p/7272236.html Oracle的集群 Oracle的体系结构 SQL> --当 ...

随机推荐

State Server实现多机器多站点 Session 共享全手记
网络环境有2台windows 2008 (192.168.1.71,192.168.1.72) 需要部署成 WebFarm,提高容错性. 网站部署在2台机器上的2个站点,如何才能做到Session的共 ...
windows10 自带笔记本键盘禁止和开启
管理员打开cmd,输入sc config i8042prt start= disabled 然后重启就好了,注意 =后面有个空格. 恢复:sc config i8042prt start= deman ...
python获取windows所有com口
import serial import serial.tools.list_ports port_list = list(serial.tools.list_ports.comports()) po ...
微信小程序---示例DEMO
转:CSDN的文章: https://blog.csdn.net/rolan1993/article/details/73467867 不错的DEMO: https://github.com/skyv ...
Shell脚本导入外部脚本内容
vim subscript.sh #!/bin/bash tool="ApacheSpark" vim main.sh #!/bin/bash source /home/wx/su ...
Java.Util.List（List接口）
equals方法 equals(Object o) 方法用来比较指定的对象与列表是否相等,当且仅当指定的对象也是一个列表.两个列表有相同的大小,并且两个列表中的所有相应的元素对相等时才返回 true. ...
php无限极分类递归与普通
1. 递归 public function getInfo(){$data=$this->select();$arr=$this->noLimit($data,$f_id=0,$level ...
Python开发一个多并发的FTP SERVER
允许同时支持多用户在线用户认证用户空间配额权限限制可上传下载,上传下载中显示进度条用户可远程切换目录,查看服务端文件列表等可断点续传
拓扑_dfs——找最小环
今天在题库发现了一个wa了很久还没调过的题,这个题呢是2015年noip的day1t2,莫名感觉难度上升(其实水的一匹). 这道题输出是3,其实就是一个图中让你找最小环,尽管我不会找环,但是要是我的话 ...
Redis主从同步及哨兵原理
1.复制过程复制过程大致分为6个过程: 流程图如下: 1)保存主节点信息执行slaveof后从节点只保存主节点的地址信息便直接返回,这时建立复制流程还没有开始,在从节点执行info replica ...

pandas练习（二）------ 数据过滤与排序

数据过滤与排序------探索2012欧洲杯数据

步骤1 - 导入pandas库

步骤2 - 数据集

步骤3 - 将数据集命名为euro12

步骤4 选取 Goals 这一列

步骤5 有多少球队参与了2012欧洲杯？

步骤6 该数据集中一共有多少列(columns)?

步骤7 将数据集中的列Team, Yellow Cards和Red Cards单独存为一个名叫discipline的数据框

步骤8 对数据框discipline按照先Red Cards再Yellow Cards进行排序

步骤9 计算每个球队拿到的黄牌数的平均值

步骤10 找到进球数Goals超过6的球队数据

步骤11 选取以字母G开头或以e结尾的球队数据

步骤12 选取前7列

步骤13 选取除了最后3列之外的全部列

步骤14 找到英格兰(England)、意大利(Italy)和俄罗斯(Russia)的命中率(Shooting Accuracy)

参考链接：

pandas练习（二）------ 数据过滤与排序的更多相关文章

随机推荐

热门专题

步骤4 选取 `Goals` 这一列