矩阵图

https://datawhalechina.github.io/pms50/#/chapter9/chapter9

导入所需要的库

import numpy as np              # 导入numpy库

import pandas as pd             # 导入pandas库

import matplotlib as mpl        # 导入matplotlib库

import matplotlib.pyplot as plt

import seaborn as sns           # 导入seaborn库

%matplotlib inline              # 在jupyter notebook显示图像

设定图像各种属性

large = 22; med = 16; small = 12

params = {'axes.titlesize': large,    # 设置子图上的标题字体

            'legend.fontsize': med,     # 设置图例的字体

            'figure.figsize': (16, 10), # 设置图像的画布

           'axes.labelsize': med,      # 设置标签的字体

            'xtick.labelsize': med,     # 设置x轴上的标尺的字体

            'ytick.labelsize': med,     # 设置整个画布的标题字体

          'figure.titlesize': large}

#plt.rcParams.update(params)           # 更新默认属性

plt.style.use('seaborn-whitegrid')    # 设定整体风格

sns.set_style("white")                # 设定整体背景风格

程序代码

# step1:导入数据

df = sns.load_dataset('iris')

# step2: 绘制矩阵图

    # 画布

plt.figure(figsize = (12, 10),    # 画布尺寸_(12, 10)

           dpi = 80)             # 分辨率_80

    # 矩阵图

sns.pairplot(df,                                     # 使用的数据

            kind = 'scatter',                        # 绘制图像的类型_scatter

            hue = 'species',                         # 类别的列，让不同类别具有不谈的颜色

            plot_kws = dict(s = 50,                  # 点的尺寸

                           edgecolor = 'white',      # 边缘颜色

                           linewidth = 2.5))         # 线宽

# step1:导入数据

df = sns.load_dataset('iris')

# step2: 绘制矩阵图

    # 画布

plt.figure(figsize = (12, 10),    # 画布尺寸_(12, 10)

           dpi = 80)             # 分辨率_80

    # 矩阵图(带有拟合线的散点图)

sns.pairplot(df,                                     # 使用的数据

            kind = 'reg',                            # 绘制图像的类型_reg

            hue = 'species')                         # 类别的列，让不同类别具有不谈的颜色

博文总结

seaborn.pairplot

seaborn.pairplot(data, hue=None, hue_order=None,
 palette=None, vars=None, x_vars=None, y_vars=None, kind='scatter', 
diag_kind='auto', markers=None, height=2.5, aspect=1, 
dropna=True, plot_kws=None, diag_kws=None, grid_kws=None, size=None)

Plot pairwise relationships in a dataset.

By default, this function will create a grid of Axes such that each variable in data will by shared in the y-axis across a single row and in the x-axis across a single column.

The diagonal Axes are treated differently, drawing a plot to show the univariate distribution of the data for the variable in that column.

It is also possible to show a subset of variables or plot different variables on the rows and columns.

This is a high-level interface for PairGrid that is intended to make it easy to draw a few common styles. You should use PairGriddirectly if you need more flexibility.

参数：data：DataFrame

Tidy (long-form) dataframe where each column is a variable and each row is an observation.

hue：string (variable name), optional

Variable in data to map plot aspects to different colors.

hue_order：list of strings

Order for the levels of the hue variable in the palette

palette：dict or seaborn color palette

Set of colors for mapping the hue variable. If a dict, keys should be values in the hue variable.

vars：list of variable names, optional

Variables within data to use, otherwise use every column with a numeric datatype.

{x, y}_vars：lists of variable names, optional

Variables within data to use separately for the rows and columns of the figure; i.e. to make a non-square plot.

kind：{‘scatter’, ‘reg’}, optional

Kind of plot for the non-identity relationships.

diag_kind：{‘auto’, ‘hist’, ‘kde’}, optional

Kind of plot for the diagonal subplots. The default depends on whether "hue" is used or not.

markers：single matplotlib marker code or list, optional

Either the marker to use for all datapoints or a list of markers with a length the same as the number of levels in the hue variable so that differently colored points will also have different scatterplot markers.

height：scalar, optional

Height (in inches) of each facet.

aspect：scalar, optional

Aspect * height gives the width (in inches) of each facet.

dropna：boolean, optional

Drop missing values from the data before plotting.

{plot, diag, grid}_kws：dicts, optional

Dictionaries of keyword arguments.

返回值：grid：PairGrid

Returns the underlying PairGrid instance for further tweaking.

seaborn.load_dataset

seaborn.load_dataset(name, cache=True, data_home=None, **kws)

从在线库中获取数据集（需要联网）。

参数：name：字符串

数据集的名字 (<cite>name</cite>.csv on https://github.com/mwaskom/seaborn-data)。您可以通过 get_dataset_names() 获取可用的数据集。

cache：boolean, 可选

如果为True，则在本地缓存数据并在后续调用中使用缓存。

data_home：string, 可选

用于存储缓存数据的目录。默认情况下使用 ~/seaborn-data/

kws：dict, 可选

传递给 pandas.read_csv

数据可视化实例（十一）：矩阵图（matplotlib，pandas）的更多相关文章

【Matplotlib】数据可视化实例分析
数据可视化实例分析作者:白宁超 2017年7月19日09:09:07 摘要:数据可视化主要旨在借助于图形化手段,清晰有效地传达与沟通信息.但是,这并不就意味着数据可视化就一定因为要实现其功能用途而令 ...
数据可视化实例（十四）：面积图（matplotlib，pandas）
偏差 (Deviation) 面积图 (Area Chart) 通过对轴和线之间的区域进行着色,面积图不仅强调峰和谷,而且还强调高点和低点的持续时间. 高点持续时间越长,线下面积越大. https:/ ...
数据可视化实例（三）：散点图（pandas，matplotlib，numpy）
关联 (Correlation) 关联图表用于可视化2个或更多变量之间的关系. 也就是说,一个变量如何相对于另一个变化. 散点图(Scatter plot) 散点图是用于研究两个变量之间关系的经典的和 ...
seaborn线性关系数据可视化：时间线图|热图|结构化图表可视化
一.线性关系数据可视化lmplot( ) 表示对所统计的数据做散点图,并拟合一个一元线性回归关系. lmplot(x, y, data, hue=None, col=None, row=None, p ...
seaborn分布数据可视化：直方图|密度图|散点图
系统自带的数据表格(存放在github上https://github.com/mwaskom/seaborn-data),使用时通过sns.load_dataset('表名称')即可,结果为一个Dat ...
数据可视化实例（十四）：带标记的发散型棒棒糖图（matplotlib，pandas）
偏差 (Deviation) 带标记的发散型棒棒糖图 (Diverging Lollipop Chart with Markers) 带标记的棒棒糖图通过强调您想要引起注意的任何重要数据点并在图表中适 ...
数据可视化实例（十七）：包点图（matplotlib，pandas）
排序 (Ranking) 包点图 (Dot Plot) 包点图表传达了项目的排名顺序,并且由于它沿水平轴对齐,因此您可以更容易地看到点彼此之间的距离. https://datawhalechina.g ...
数据可视化实例（九）：边缘箱形图（matplotlib，pandas）
https://datawhalechina.github.io/pms50/#/chapter7/chapter7 边缘箱形图 (Marginal Boxplot) 边缘箱图与边缘直方图具有相似的用 ...
数据可视化实例（七）：计数图（matplotlib，pandas）
https://datawhalechina.github.io/pms50/#/chapter5/chapter5 计数图 (Counts Plot) 避免点重叠问题的另一个选择是增加点的大小,这取 ...

随机推荐

CentOS7.5搭建Hadoop2.7.6完全分布式集群
一完全分布式集群搭建 Hadoop官方地址:http://hadoop.apache.org/ 1 准备3台客户机 1.2 关闭防火墙,设置静态IP,主机名关闭防火墙,设置静态IP,主机名此处略 ...
isinstance用法
''' 作用:来判断一个对象是否是一个已知的类型. 其第一个参数(object)为对象,第二个参数(type)为类型名(int...)或类型名的一个列表((int,list,float)是一个列表). ...
java中Proxy类初探
在java中提供了一个动态代理类,这个类位于java.lang.reflect包中的Proxy类中.什么是动态代理类呢?就是可以在运行时创建一个实现了一组给定接口的新类.听上去有点高深的样子,其实是提 ...
《Java并发编程的艺术》第10章 Executor框架
Java的线程既是工作单元,也是执行机制.从JDK5开始,把工作单元与执行机制分离开来.工作单元包括Runnable和Callable,执行机制由Executor框架提供. 10.1 Executor ...
JDBC——使用JDBC连接MySQL数据库
在JDBC--什么是JDBC一文中我们已经介绍了JDBC的基本原理. 这篇文章我们聊聊如何使用JDBC连接MySQL数据库. 一.基本操作首先我们需要一个数据库和一张表: CREATE DATABA ...
js银行卡四个数字一个空格
!function () { document.getElementById('bankCard').onkeyup = function (event) { var v = this.value; ...
JavaWeb网上图书商城完整项目--day02-28.查询所有分类功能之left页面使用Q6MenuBar组件显示手风琴式下拉菜单
首先页面去加载的时候,会去加载main.js文件,我们在加载left.jsp.top.jsp body.jsp,现在我们修改main.jsp的代码,让它去请求的时候去访问的是不在直接去访问left.j ...
PHP 多维数组转json对象
PHP 多维数组转json对象 php 数组转json对象,可能大家都知道要用json_encode,但是转换出来的格式多有不同,此处做个小小的记录! 1. 一维数组转json对象 <?php ...
postman写测试用例
接口测试引用聚合数据(手机号码归属地)接口 1,点击postman左上角红框+New Collection来创建文件,用来存放测试用例文件名为号码归属地查询(随意) 2,右击文件选择Add Req ...
【总结】Array、ArrayList、List
一.Array(数组) 1.申明时必须要指定数组长度. 2.数据类型安全. 申明数组如下: 1 class Program 2 { 3 static void Main(string[] args) ...

数据可视化实例（十一）： 矩阵图（matplotlib，pandas）