Regression analysis
Source: http://wenku.baidu.com/link?url=9KrZhWmkIDHrqNHiXCGfkJVQWGFKOzaeiB7SslSdW_JnXCkVHsHsXJyvGbDva4V5A-uuOl84mg5zkTECichHX_AsN0mZalfI9BzDFOeNe-G###
❤ Simple linear regression
1. Y = β0 + β1*X + e
where:
Y - dependent variable (response)
X - independent variable (predictor/explanatory)
β0 - intercept
β1 - slope of the regression line
e - random error
2. Y' = b0 + b1*X
where: Y' - predicted value of Y
e = Y - Y'
3. Least squarea regression minizes the sum of the square of the errors and can be used to estimate b0 and b1.
4. Measuring the fit of the estimated model.
- The varibility of Y
SST (Sum of Squared Total): total variability about the mean, SST = sum((Y - mean(Y))^2);
SSE (Sum of Squared Error): variability about the regression line, SSE = sum(e^2) = sum((Y - mean(Y'))^2), SSE is unexplained varibility;
SSR (Sum of Squares due to Regression): variability that is explained, SSR = sum((Y' - mean(Y))^2), SSR is explained varibility.
Note that SST = SSE + SSR.
- Coefficient of determination
r^2: proportion of explained variability by the regression equation.
0 <= r^2 = 1 - SSE/SST = SSR/SST <= 1
- Correlation coefficient
r: strength of the relationship between X and Y.
-1 <= r <= 1
5. Assumptions in the regression model
Errors are independent, normally distributed, with the mean of zero, with a constant variance.
The assumptions can be tested by using residual analysis.
6. MSE (Mean Squared Error)
Estimation of error variance of the regression equation.
s^2 = MSE = SSE / (n - k - 1)
where:
n - number of observations in the sample
k - number of independent variables
Standard deviation of the regression: s = sqrt(MSE) is also frequently used.
❤ Test the model for significance: F-test
Used to statistically test the null hypothesis H0: there is no linear relationship between Y and X (i.e. β1 = 0).
If p value is low, then we regect H0 and conclude there is linear relationship:
F = MSR / MSE
where: MSR = SSR / k
Good regression model should have significant F value and high r^2 value.
Statistical test can be performed on the regression coefficients. H0: the βs are 0.
For a simple linear regression, the test for regression coefficient gives the same information as the ones given by F-test.
❤ ANOVA tables
The general form of the ANOVA table is helpful for understanding the interrelatedness of error terms.
❤ Multiple regression
Similar to the simple regression model, but there are more than one X in the multiple regression models.
Y' = b0 + b1*X1 + b2*X2 + ... + bn*Xn
Note that if indenpendent variables is correlate to each other, colinearity or multicolinearity will happen. This will cause problems when intepreate variables individually although the overall model estimation may still be good.
Regression analysis的更多相关文章
- [ML学习笔记] 回归分析(Regression Analysis)
[ML学习笔记] 回归分析(Regression Analysis) 回归分析:在一系列已知自变量与因变量之间相关关系的基础上,建立变量之间的回归方程,把回归方程作为算法模型,实现对新自变量得出因变量 ...
- Regression Analysis Using Excel
Regression Analysis Using Excel Setup By default, data analysis add-in is not enabled. Follow the st ...
- Functional mechanism: regression analysis under differential privacy_阅读报告
Functional mechanism: regression analysis under differential privacy 论文学习报告 组员:裴建新 赖妍菱 周子玉 2020 ...
- 7 Types of Regression Techniques you should know!
翻译来自:http://news.csdn.net/article_preview.html?preview=1&reload=1&arcid=2825492 摘要:本文解释了回归分析 ...
- STA 463 Simple Linear Regression Report
STA 463 Simple Linear Regression ReportSpring 2019 The goal of this part of the project is to perfor ...
- regression | p-value | Simple (bivariate) linear model | 线性回归 | 多重检验 | FDR | BH | R代码
P122, 这是IQR method课的第一次作业,需要统计检验,x和y是否显著的有线性关系. Assignment 1 1) Find a small bivariate dataset (pref ...
- Multiple Regression
Multiple Regression What is multiple regression? Multiple regression is regression analysis with mor ...
- Correlation and Regression
Correlation and Regression Sample Covariance The covariance between two random variables is a statis ...
- 7 Types of Regression Techniques
https://www.analyticsvidhya.com/blog/2015/08/comprehensive-guide-regression/ What is Regression Anal ...
随机推荐
- iOS网络监测方法
方法一(官方): Reachability ============================================================================== ...
- Android 手机卫士--打包生成apk维护到服务器
项目打包生成apk过程: 1.生成签名文件,并且指定所在位置 2.使用生成的签名文件,给工程打包生成一个apk 本文地址:http://www.cnblogs.com/wuyudong/p/59033 ...
- 【原】iOS:一种直接修改frame的某个属性的方法
在iOS中view的frame属性使用地太频繁了,尤其是调UI的时候.我们知道,正常情况下我们无法对frame的某个属性(x,y,width,height等)进行单独修改,比如: someView.f ...
- SVN 使用锁实现独占式签出
SVN默认并行工作,但是自动合并又做得很渣.团队工作中,如果确实有一些文件希望独占式签出可以使用SVN的特别属性. Subversion针对此问题的解决方案是提供一种机制,提醒用户在开 ...
- 面试问题4:C语言预处理包括哪些
问题描述:C语言 预处理包括哪些操作 C语言的三种预处理包括:宏定义(#define).文件包含(#include).条件编译(#if.#else.#endif). 对于宏定义的介绍: 宏定义必须写在 ...
- 详解MySQL的用户密码过期功能
这篇文章主要为大家详细介绍了MySQL的用户密码过期功能的相关资料,需要的朋友可以参考下 Payment Card Industry,即支付卡行业,PCI行业表示借记卡.信用卡.预付卡.电子钱包. ...
- 入CTF坑必不可少的地方-保持更新
0x00 前言 没有交易,没有买卖,没有排名,纯属分享:p 0x01 CTF介绍 CTF领域指南CTF介绍大全CTF赛事预告 0x02 CTF练习 BIN:reversingpwnableexploi ...
- JAVA 8 方法引用 - Method References
什么是方法引用 简单地说,就是一个Lambda表达式.在Java 8中,我们会使用Lambda表达式创建匿名方法,但是有时候,我们的Lambda表达式可能仅仅调用一个已存在的方法,而不做任何其它事,对 ...
- 【译】Java中的枚举
前言 译文链接:http://www.programcreek.com/2014/01/java-enum-examples/ Java中的枚举跟其它普通类很像,在其内部包含了一堆预先定义好的对象集合 ...
- 从刚刚「简书」平台的短暂异常,谈Nginx An error occurred报错~
09.26简书平台的短暂异常 An error occurred. Sorry, the page you are looking for is currently unavailable. Plea ...