Chapter 07-Basic statistics(Part4 t-tests&&nonparametric tests of group difference)

一. t-tests

这一部分我们使用分布在MASS包中的UScrime数据集。它是关于美国47个州在1960年时，关于惩罚制度对犯罪率的影响。

Prob：监禁（坐牢）的概率；

U1：14到24岁的城市那你的失业率；

U2：35到39岁的城市男子的失业率；

So：an indicator variable for Southern states

1. 独立的t-test(independent t-test)

t.test(y~x,data)

t.tset(y1,y2)

例01：

> library(MASS)

> t.test(Prob~So,data=UScrime)

	Welch Two Sample t-test

data:  Prob by So

t = -3.8954, df = 24.925, p-value = 0.0006506

alternative hypothesis: true difference in means is not equal to 0

95 percent confidence interval:

 -0.03852569 -0.01187439

sample estimates:

mean in group 0 mean in group 1

     0.03851265      0.06371269

注意：可以摒弃南方的州和非南方的州有相同的犯罪率，因为p<0.01。

2.依赖的t-test

t.test(y1,y2,paired=TRUE)

·y1和y2是两个有依赖关系的组的数值向量。

例02：

> library(MASS)

> sapply(UScrime[c("U1","U2")],function(x)(c(mean=mean(x),sd=sd(x))))

           U1       U2

mean 95.46809 33.97872

sd   18.02878  8.44545

> with(UScrime,t.test(U1,U2,paired=TRUE))

	Paired t-test

data:  U1 and U2

t = 32.4066, df = 46, p-value < 2.2e-16

alternative hypothesis: true difference in means is not equal to 0

95 percent confidence interval:

 57.67003 65.30870

sample estimates:

mean of the differences

               61.48936

二. nonparametric tests of group difference

1. 比较两组

如果两组是独立的，应该使用Wilcoxon rank sum去评估自变量是否是来自相同概率分布的样本。

wilcox.test(y~x,data)

wilcox.test(y1,y2)

例03：

> with(UScrime,by(Prob,So,median))

So: 0

[1] 0.038201

--------------------------------------------------------

So: 1

[1] 0.055552

> wilcox.test(Prob~So,data=UScrime)

	Wilcoxon rank sum test

data:  Prob by So

W = 81, p-value = 8.488e-05

alternative hypothesis: true location shift is not equal to 0

例04：

> sapply(UScrime[c("U1","U2")],median)

U1 U2

92 34

> with(UScrime,wilcox.test(U1,U2,paired=TRUE))

	Wilcoxon signed rank test with continuity correction

data:  U1 and U2

V = 1128, p-value = 2.464e-09

alternative hypothesis: true location shift is not equal to 0

2.比较多于两组

Kruskal-Wallis test:

kruskal.test(y~A,data)

·A：a grouping variable with two or more levels, if just two levels, equivalent to Mann-Whitney;

·y：a numeric outcome variable;

Friedman test:

friedman.test(y~A|B,data)

·B: a blocking variable that identifies matched observations.

npmc包中的npmc()函数：期待输入两列的数据，分别叫var(the dependent variable)和class(the grouping variable).

Chapter 07-Basic statistics(Part4 t-tests&&nonparametric tests of group difference)的更多相关文章

Intro to Python for Data Science Learning 8 - NumPy: Basic Statistics
NumPy: Basic Statistics from:https://campus.datacamp.com/courses/intro-to-python-for-data-science/ch ...
Spark MLlib 之 Basic Statistics
Spark MLlib提供了一些基本的统计学的算法,下面主要说明一下: 1.Summary statistics 对于RDD[Vector]类型,Spark MLlib提供了colStats的统计方法 ...
Chapter 06—Basic graphs
三. 柱状图(Histogram) 1. hist():画柱状图 ·breaks(可选项):控制柱状图的小柱子的条数: ·freq=FALSE:基于概率(probability),而非频率(frequ ...
Chapter 04—Basic Data Management
1. 创建新的变量 variable<-expression expression:包含一组大量的操作符和函数.常用的算术操作符如下表: 例1:根据已知变量,创建新变量的三种途径 > my ...
Chapter 2 Basic Elements of JAVA
elaborate:详细说明 Data TypesJava categorizes data into different types, and only certain operationscan ...
[Node & Tests] Intergration tests for Authentication
For intergration tests, always remember when you create a 'mass' you should aslo clean up the 'mass' ...
Parametric Statistics
1.What are “Parametric Statistics”? 统计中的参数指的是总体的一个方面,而不是统计中的一个方面,后者指的是样本的一个方面.例如,总体均值是一个参数,而样本均值是一个统 ...
吴裕雄--天生自然 R语言开发学习：基本统计分析（续三）
#---------------------------------------------------------------------# # R in Action (2nd ed): Chap ...
吴裕雄--天生自然 R语言开发学习：基本统计分析
#---------------------------------------------------------------------# # R in Action (2nd ed): Chap ...

随机推荐

windows 360浏览器打开网站白屏
1.场景使用windows的360浏览器打开网页白屏使用mac 谷歌,360,火狐浏览器打开均正常 2.原因 windows浏览器默认使用的是ie浏览器内核渲染的,js执行时发生错误 3.添加he ...
linux 编译引用动态库时，报GLIBC_2,14 not found的处理方法
这种错误一般是其引用的libc.so,其中含有版本较高的函数导致. 查看及解决办法: objdump -p ./libdmapi.so 显示: version References: ... requ ...
[Java] 生成二维码源码，可以在二维码中间加logo，底部可以加文字介绍
链接:https://pan.baidu.com/s/1bc1h-ix-No-2o9Ysd4_B3Q提取码:0ad4
PyQt图形化布局
安装PyQt第三方库 pip install PyQt5 安装Qt Designer(Qt的布局工具) pip install PyQt5-tools PyChram设置Qt工具配置Qt Desig ...
我跟上家老板说过的最后一句话：转.NET Core吧
最近几天浩子终于刚刚脱离了令人发指工作,一者是年底了,一者是不要向生活低头,就在这时我选择了第二者. 上家是做物联网的,人数不多,七八名开发人员,感觉都还可以,都很年轻没有秃顶,糊里糊涂就选择了入职. ...
python基础-函数作用域
函数函数对象函数是第一类对象函数名可以被引用函数名可以当作参数使用函数名可以当作返回值使用函数名可以当作容器类型的元素函数嵌套嵌套调用:在函数内部中调用函数嵌套定义:在函数内部中定义 ...
自动任务调度 - Timer
一.概述: 最近维护一个老项目,里面使用的是Timer的时间调度器,以前没接触过,对着代码鼓捣了半天,查阅了部分博客,最后总结出自己的见解,新项目一般是不会用这种老掉牙的时间调度器了,但是维护老项目还 ...
Scrapy简单上手 —— 安装与流程
一.安装scrapy 由于scrapy依赖较多,建议使用虚拟环境 windows下pip安装(不推荐) 1.安装virtualenv pip install virtualenv 2.在你开始项目的文 ...
javascript获取上传图片的大小
javascript获取上传图片的大小 <pre><input id="file" type="file"> <input id= ...
iOS--通过runtime完成归档，反归档
通过runtime,不管模型有多少属性,通过几句代码就能完成. 假设person类有N多个属性而是(这里随便写3个) .h #import <Foundation/Foundation.h> ...

Chapter 07-Basic statistics(Part4 t-tests&&nonparametric tests of group difference)

Chapter 07-Basic statistics(Part4 t-tests&&nonparametric tests of group difference)的更多相关文章

随机推荐

热门专题