pandas.DataFrame对象解析
pandas.DataFrame对象类型解析
df = pd.DataFrame([[1,"2",3,4],[5,"6",7,8]],columns=["a","b","c","d"])
method解析
1、add()方法:类似加法运算(相加的元素必须是同一对象的数据)
| add(self, other, axis='columns', level=None, fill_value=None)
| Addition of dataframe and other, element-wise (binary operator `add`).
|
| Equivalent to ``dataframe + other``, but with support to substitute a fill_value for
| missing data in one of the inputs.
|
| Parameters
| ----------
| other : Series, DataFrame, or constant
| axis : {0, 1, 'index', 'columns'}
| For Series input, axis to match Series index on
| level : int or name
| Broadcast across a level, matching Index values on the
| passed MultiIndex level
| fill_value : None or float value, default None
| Fill existing missing (NaN) values, and any new element needed for
| successful DataFrame alignment, with this value before computation.
| If data in both corresponding DataFrame locations is missing
| the result will be missing
pandas.DataFrame.add方法
example:
output:
2、aggregate()方法:可简写agg()方法
aggregate(self, func, axis=0, *args, **kwargs)
| Aggregate using one or more operations over the specified axis.
|
| .. versionadded:: 0.20.0
|
| Parameters
| ----------
| func : function, string, dictionary, or list of string/functions
| Function to use for aggregating the data. If a function, must either
| work when passed a DataFrame or when passed to DataFrame.apply. For
| a DataFrame, can pass a dict, if the keys are DataFrame column names.
|
| Accepted combinations are:
|
| - string function name.
| - function.
| - list of functions.
| - dict of column names -> functions (or list of functions).
pandas.DataFrame.aggregate方法
example:
#coding=utf-8
import pandas as pd
import numpy as np ds = pd.Series([11,"",13,14])
print ds,"\n" df = pd.DataFrame([[1,"",3,4],[5,"",7,8]],columns=["a","b","c","d"])
print df,"\n" print(df.agg(['sum', 'min']))
print(df.agg({"a":['sum', 'min']}))
output:
0 11
1 2
2 13
3 14
dtype: object a b c d
0 1 2 3 4
1 5 6 7 8 a b c d
sum 6 26 10 12
min 1 2 3 4
a
sum 6
min 1
常用的aggregation functions (`mean`, `median`, `prod`, `sum`, `std`,`var`)
mad(self, axis=None, skipna=None, level=None)
Return the mean absolute deviation of the values for the requested axis
max(self, axis=None, skipna=None, level=None, numeric_only=None, **kwargs)
This method returns the maximum of the values in the object.If you want the *index* of the maximum, use ``idxmax``. This is the equivalent of the ``numpy.ndarray`` method ``argmax``.
mean(self, axis=None, skipna=None, level=None, numeric_only=None, **kwargs)
Return the mean of the values for the requested axis
median(self, axis=None, skipna=None, level=None, numeric_only=None, **kwargs)
Return the median of the values for the requested axis
min(self, axis=None, skipna=None, level=None, numeric_only=None, **kwargs)
This method returns the minimum of the values in the object. memory_usage(self, index=True, deep=False)
Return the memory usage of each column in bytes.
merge(self, right, how='inner', on=None, left_on=None, right_on=None, left_index=False, right_index=False, sort=False, suffixes=('_x', '_y'), copy=True, indicator=False, validate=None)
Merge DataFrame objects by performing a database-style join operation by columns or indexes.
align(self, other, join='outer', axis=None, level=None, copy=True, fill_value=None, method=None, limit=None, fill_axis=0, broadcast_axis=None):
Align two objects on their axes with the specified join method for each axis Index
all(self, axis=None, bool_only=None, skipna=None, level=None, **kwargs):
Return whether all elements are True over series or dataframe axis.
any(self, axis=None, bool_only=None, skipna=None, level=None, **kwargs):
Return whether any element is True over requested axis.
apply(self, func, axis=0, broadcast=None, raw=False, reduce=None, result_type=None, args=(), **kwds):
Apply a function along an axis of the DataFrame.
applymap(self, func):
Apply a function to a Dataframe elementwise.This method applies a function that accepts and returns a scalarto every element of a DataFrame.
append(self, other, ignore_index=False, verify_integrity=False, sort=None):
Append rows of `other` to the end of this frame, returning a new object. Columns not in this frame are added as new columns.
assign(self, **kwargs):
Assign new columns to a DataFrame, returning a new object(a copy) with the new columns added to the original ones.Existing columns that are re-assigned will be overwritten.
insert(self, loc, column, value, allow_duplicates=False)
Insert column into DataFrame at specified location. combine(self, other, func, fill_value=None, overwrite=True):
Add two DataFrame objects and do not propagate NaN values, so if for a(column, time) one frame is missing a value, it will default to theother frame's value (which might be NaN as well)
count(self, axis=0, level=None, numeric_only=False):
Count non-NA cells for each column or row.
cov(self, min_periods=None):
Compute pairwise covariance of columns, excluding NA/null values.
drop(self, labels=None, axis=0, index=None, columns=None, level=None, inplace=False, errors='raise'):
Drop specified labels from rows or columns.
drop_duplicates(self, subset=None, keep='first', inplace=False):
Return DataFrame with duplicate rows removed, optionally onlyconsidering certain columns
dropna(self, axis=0, how='any', thresh=None, subset=None, inplace=False)
Remove missing values.
duplicated(self, subset=None, keep='first')
Return boolean Series denoting duplicate rows, optionally onlyconsidering certain columns
eq(self, other, axis='columns', level=None)
Wrapper for flexible comparison methods eq
eval(self, expr, inplace=False, **kwargs)
Evaluate a string describing operations on DataFrame columns.
fillna(self, value=None, method=None, axis=None, inplace=False, limit=None, downcast=None, **kwargs)
Fill NA/NaN values using the specified method
ge(self, other, axis='columns', level=None)
Wrapper for flexible comparison methods ge
gt(self, other, axis='columns', level=None)
Wrapper for flexible comparison methods gt
le(self, other, axis='columns', level=None)
Wrapper for flexible comparison methods le
lt(self, other, axis='columns', level=None)
Wrapper for flexible comparison methods lt get_value(self, index, col, takeable=False)
Quickly retrieve single value at passed column and index
info(self, verbose=None, buf=None, max_cols=None, memory_usage=None, null_counts=None)
Print a concise summary of a DataFrame.
isin(self, values)
Return boolean DataFrame showing whether each element in theDataFrame is contained in values.
isna(self)
Detect missing values.Return a boolean same-sized object indicating if the values are NA.
isnull(self)
Detect missing values.Return a boolean same-sized object indicating if the values are NA.
iteritems(self)
Iterator over (column name, Series) pairs.
iterrows(self)
Iterate over DataFrame rows as (index, Series) pairs.
itertuples(self, index=True, name='Pandas')
Iterate over DataFrame rows as namedtuples, with index value as firstelement of the tuple.
join(self, other, on=None, how='left', lsuffix='', rsuffix='', sort=False)
Join columns with other DataFrame either on index or on a keycolumn. Efficiently Join multiple DataFrame objects by index at once bypassing a list.
pandas.DataFrame对象解析的更多相关文章
- [Pandas技巧] 如何把pandas dataframe对象或series对象转换成list
import pandas as pd >>> df = pd.DataFrame({'a':[1,3,5,7,4,5,6,4,7,8,9], 'b':[3,5,6,2,4,6,7, ...
- 重拾Python(4):Pandas之DataFrame对象的使用
Pandas有两大数据结构:Series和DataFrame,之前已对Series对象进行了介绍(链接),本文主要对DataFrame对象的常用用法进行总结梳理. 约定: import pandas ...
- 将pandas的Dataframe对象读写Excel文件
Dataframe对象生成Excel文件 需要xlrd库 命令 pip install xlrd #导入pandas import pandas as pd import numpy as np ...
- pandas中DataFrame对象to_csv()方法中的encoding参数
当使用pd.read_csv()方法读取csv格式文件的时候,常常会因为csv文件中带有中文字符而产生字符编码错误,造成读取文件错误,在这个时候,我们可以尝试将pd.read_csv()函数的enco ...
- pandas.DataFrame学习系列1——定义及属性
定义: DataFrame是二维的.大小可变的.成分混合的.具有标签化坐标轴(行和列)的表数据结构.基于行和列标签进行计算.可以被看作是为序列对象(Series)提供的类似字典的一个容器,是panda ...
- pandas DataFrame apply()函数(1)
之前已经写过pandas DataFrame applymap()函数 还有pandas数组(pandas Series)-(5)apply方法自定义函数 pandas DataFrame 的 app ...
- python数据类型之pandas—DataFrame
DataFrame定义: DataFrame是pandas的两个主要数据结构之一,另一个是Series —一个表格型的数据结构 —含有一组有序的列 —大致可看成共享同一个index的Series集合 ...
- 【338】Pandas.DataFrame
Ref: Pandas Tutorial: DataFrames in Python Ref: pandas.DataFrame Ref: Pandas:DataFrame对象的基础操作 Ref: C ...
- pandas dataframe在指定的位置添加一列, 或者一次性添加几列,re
相信有很多人收这个问题的困扰,如果你想一次性在pandas.DataFrame里添加几列,或者在指定的位置添加一列,都会很苦恼找不到简便的方法:可以用到的函数有df.reindex, pd.conca ...
随机推荐
- div+css制作哆啦A梦
纯CSS代码加上 制作动画版哆啦A梦(机器猫) 哆啦A梦(机器猫)我们大家一定都很熟悉,今天给大家演示怎么用纯CSS.代码,来做一个动画版的哆啦A梦. 效果图: 下面代码同学可以查看一下,每个线条及椭 ...
- C博客作业--我的第一篇博客作业
1你对网络专业或计算机专业了解是怎样的 由于从小就与电脑打交道,对于各类软件的生产非常感兴趣,所以在高三开学查询有什么专业的时候,就打算报与计算机有关的专业.我对计算机专业感到非常神奇,毕竟只是看似简 ...
- Vue.js2.5+cube-ui重构饿了么App*下载
第1章 课程导学 第2章 项目准备工作 第3章 头部组件开发 第4章 Tab 组件开发 第5章 商品页面开发 第6章 商品详情页开发 第7章 评价和商家页面开发 第8章 create-api 原理分析 ...
- SpringCloud Feign通过FallbackFactory显示异常信息
SpringCloud Feign可以进行服务消费,而且内置了Hystrix,能够进行熔断. Feign可以通过fallback指定熔断回调的类.代码示例及讲解可见: https://www.cnbl ...
- 如何在IDEA上配置Maven
IDEA 全称 IntelliJ IDEA,是java语言开发的集成环境,IntelliJ在业界被公认为最好的Java开发工具之一, IDEA是JetBrains公司的产品,现在有逐步取代老牌Java ...
- Phpoffice 已经弃用
Package phpoffice/phpexcel is abandoned, you should avoid using it. Use phpoffice/phpspreadsheet i ...
- 自动化远程部署shell脚本
历史原因,有一段时间,项目开发采用一种模式:项目开发及代码版本管理在外网,而主要测试在内网.所以为了同步开发进度,每天会将所有服务在外网jenkins上打包好,然后将服务jar包拷进内网,由于内网服务 ...
- Red Hat Enterprise Linux ISO 全镜像下载
1.iso网盘下载地址:https://wanghualang.pipipan.com/dir/13133650-26232498-a8efb3/ 2.中国大陆开源镜像站汇总,企业贡献: 搜狐开源镜像 ...
- Python数据库连接池DBUtils详解
what's the DBUtils DBUtils 是一套用于管理数据库连接池的Python包,为高频度高并发的数据库访问提供更好的性能,可以自动管理连接对象的创建和释放.并允许对非线程安全的数据库 ...
- tushare包使用案例
Tushare是一个免费.开源的python财经数据接口包.主要实现对股票等金融数据从数据采集.清洗加工 到 数据存储的过程,能够为金融分析人员提供快速.整洁.和多样的便于分析的数据,为他们在数据获取 ...