pandas.Series

class pandas.Series(data=None, index=None, dtype=None, name=None, copy=False, fastpath=False)

One-dimensional ndarray with axis labels (including time series).

Labels need not be unique but must be any hashable type. The object supports both integer- and label-based indexing and provides a host of methods for performing operations involving the index. Statistical methods from ndarray have been overridden to automatically exclude missing data (currently represented as NaN)

Operations between Series (+, -, /, , *) align values based on their associated index values– they need not be the same length. The result index will be the sorted union of the two indexes.

Parameters :	data : array-like, dict, or scalar value Contains data stored in Series index : array-like or Index (1d) Values must be unique and hashable, same length as data. Index object (or other iterable of same length as data) Will default to np.arange(len(data)) if not provided. If both a dict and index sequence are used, the index will override the keys found in the dict. dtype : numpy.dtype or None If None, dtype will be inferred copy : boolean, default False Copy input data

Parameters :

data : array-like, dict, or scalar value

Contains data stored in Series

index : array-like or Index (1d)

Values must be unique and hashable, same length as data. Index object (or other iterable of same length as data) Will default to np.arange(len(data)) if not provided. If both a dict and index sequence are used, the index will override the keys found in the dict.

dtype : numpy.dtype or None

If None, dtype will be inferred

copy : boolean, default False

Copy input data

Series 类似数组，但是它有标签(label) 或者索引(index).

1. 从最简单的series开始看。

from pandas import Series, DataFrame

import pandas as pd

ser1 = Series([1,2,3,4])

print(ser1)

#0    1

#1    2

#2    3

#3    4

#dtype: int64

此时因为没有设置index,所以用默认

2. 加上索引

ser2 = Series(range(4),index=['a','b','c','d'])

print(ser2)

#a    0

#b    1

#c    2

#d    3

#dtype: int64

3. dictionnary 作为输入

dict1 = {'ohio':35000,'Texas':71000,'Oregon':1600,'Utah':500}

ser3 = Series(dict1)

#Oregon     1600

#Texas     71000

#Utah        500

#ohio      35000

#dtype: int64

key：默认设置为index

dict1 = {'ohio':35000,'Texas':71000,'Oregon':1600,'Utah':500}

ser3 = Series(dict1)

#Oregon     1600

#Texas     71000

#Utah        500

#ohio      35000

#dtype: int64

print(ser3)

states = ['California', 'Ohio', 'Oregon', 'Texas']

ser4 = Series(dict1,index = states)

print(ser4)

#California        NaN

#Ohio              NaN

#Oregon         1600.0

#Texas         71000.0

#dtype: float64

用了dictionary时候，也是可以特定的制定index的，当没有map到value的时候，给NaN.

print(pd.isnull(ser4))

#California     True

#Ohio           True

#Oregon        False

#Texas         False

#dtype: bool

函数isnull判断是否为null

print(pd.isnull(ser4))

#California     True

#Ohio           True

#Oregon        False

#Texas         False

#dtype: bool

函数notnull判断是否为非null

print(pd.notnull(ser4))

#California    False

#Ohio          False

#Oregon         True

#Texas          True

#dtype: bool

4. 访问元素和索引用法

print (ser2['a']) #

#print (ser2['a','c']) error

print (ser2[['a','c']])

#a    0

#c    2

#dtype: int64

print(ser2.values) #[0 1 2 3]

print(ser2.index) #Index(['a', 'b', 'c', 'd'], dtype='object')

5. 运算， pandas的series保留Numpy的数组操作

print(ser2[ser2>2])

#d    3

#dtype: int64

print(ser2*2)

#a    0

#b    2

#c    4

#d    6

#dtype: int64

print(np.exp(ser2))

#a     1.000000

#b     2.718282

#c     7.389056

#d    20.085537

#dtype: float64

6. series 的自动匹配，这个有点类似sql中的full join，会基于索引键链接，没有的设置为null

print (ser3+ser4)

#California         NaN

#Ohio               NaN

#Oregon          3200.0

#Texas         142000.0

#Utah               NaN

#ohio               NaN

#dtype: float64

7. series对象和索引都有一个name属性

ser4.index.name = 'state'

ser4.name = 'population count'

print(ser4)

#state

#California        NaN

#Ohio              NaN

#Oregon         1600.0

#Texas         71000.0

#Name: population count, dtype: float64

8.预览数据

print(ser4.head(2))

print(ser4.tail(2))

#state

#California   NaN

#Ohio         NaN

#Name: population count, dtype: float64

#state

#Oregon     1600.0

#Texas     71000.0

#Name: population count, dtype: float64

Python Pandas -- Series的更多相关文章

python. pandas(series,dataframe,index) method test
python. pandas(series,dataframe,index,reindex,csv file read and write) method test import pandas as ...
python pandas.Series&&DataFrame&& set_index&reset_index
参考CookBook :http://pandas.pydata.org/pandas-docs/stable/cookbook.html Pandas set_index&reset_ind ...
python pandas ---Series,DataFrame 创建方法,操作运算操作(赋值,sort,get,del,pop,insert,+,-,*,/)
pandas 是基于 Numpy 构建的含有更高级数据结构和工具的数据分析包 pandas 也是围绕着 Series 和 DataFrame 两个核心数据结构展开的, 导入如下: from panda ...
Python pandas 0.19.1 Intro to Data Structures 数据结构介绍文档翻译
官方文档链接http://pandas.pydata.org/pandas-docs/stable/dsintro.html 数据结构介绍我们将以一个快速的.非全面的pandas的基础数据结构概述来 ...
Python pandas学习总结
本来打算学习pandas模块,并写一个博客记录一下自己的学习,但是不知道怎么了,最近好像有点急功近利,就想把别人的东西复制过来,当心沉下来,自己自觉地将原本写满的pandas学习笔记删除了,这次打算写 ...
pandas.Series
1.系列(Series)是能够保存任何类型的数据(整数,字符串,浮点数,Python对象等)的一维标记数组.轴标签统称为索引. Pandas系列可以使用以下构造函数创建 - pandas.Series ...
Python pandas快速入门
Python pandas快速入门2017年03月14日 17:17:52 青盏阅读数:14292 标签: python numpy 数据分析更多个人分类: machine learning 来 ...
Python pandas & numpy 笔记
记性不好,多记录些常用的东西,真·持续更新中::先列出一些常用的网址: 参考了的莫烦python pandas DOC numpy DOC matplotlib 常用习惯上我们如此导入: impo ...
【跟着stackoverflow学Pandas】 - Adding new column to existing DataFrame in Python pandas - Pandas 添加列
最近做一个系列博客,跟着stackoverflow学Pandas. 以 pandas作为关键词,在stackoverflow中进行搜索,随后安照 votes 数目进行排序: https://stack ...

随机推荐

svn跨多个版本比较
由于一些原因某个路径下的 svn revision 不是连续的,在比对时需要跨多个版本进行比较,使用下面步骤 Show_log -> 按住 ctrl 键选择不同版本 -> 右键 -> ...
数据结构_Summary
问题描述可怜的 Bibi 丢了好几台手机以后,看谁都像是小偷,他已经在小本本上记下了他认为的各个地点的小偷数量.现在我们将 Bibi 的家附近的地形抽象成一棵有根树. 每个地点都是树上的一个节点,节 ...
undefined reference to `clock_gettime
下面这个错误通常是因为链接选项里漏了-lrt,但有时发现即使加了-lrt仍出现这个问题,使用nm命令一直,会发现-lrt最终指向的文件没有包含任何symbol,这个时候,可以找相应的静态库版本libr ...
Linux sogou input method
afda@afda-Y720-15IKB:~$ wget "http://pinyin.sogou.com/linux/download.php?f=linux&bit=64&quo ...
cross validation
k-folder cross-validation:k个子集,每个子集均做一次测试集,其余的作为训练集.交叉验证重复k次,每次选择一个子集作为测试集,并将k次的平均交叉验证识别正确率作为结果.优点:所 ...
没固定公网 IP 的公司内网实现动态域名解析（阿里云万网解析）
情景说明前段时间应公司需求,需要将内网的服务映射到公网.由于公司使用的是类似家庭宽带的线路,没有固定的公网 IP 地址,所以决定使用域名来完成. 当时有几种方案: 1.花生壳:但是目前需要乱七八糟的 ...
php写的非常简单的文件浏览器
php写的非常简单的一个文件浏览器,仅供参考. <?php /** * php文件浏览程序函数 showDir() * * $dirName 输入目录路径,默认php文件一级目录,不需输入: * ...
iOS开发之蓝牙使用-建立连接的
1.大佬笔记 CSDN 2.代码 github

Python Pandas -- Series

pandas.Series

Python Pandas -- Series的更多相关文章

随机推荐

热门专题