Source: http://wenku.baidu.com/link?url=9KrZhWmkIDHrqNHiXCGfkJVQWGFKOzaeiB7SslSdW_JnXCkVHsHsXJyvGbDva4V5A-uuOl84mg5zkTECichHX_AsN0mZalfI9BzDFOeNe-G###

❤ Simple linear regression

1. Y = β0 + β1*X + e

where:

Y - dependent variable (response)

X - independent variable (predictor/explanatory)

β0 - intercept

β1 - slope of the regression line

e - random error

2. Y' = b0 + b1*X

where: Y' - predicted value of Y

e = Y - Y'

3. Least squarea regression minizes the sum of the square of the errors and can be used to estimate b0 and b1.

4. Measuring the fit of the estimated model.

- The varibility of Y

SST (Sum of Squared Total): total variability about the mean, SST = sum((Y - mean(Y))^2);

SSE (Sum of Squared Error): variability about the regression line, SSE = sum(e^2) = sum((Y - mean(Y'))^2), SSE is unexplained varibility;

SSR (Sum of Squares due to Regression): variability that is explained, SSR = sum((Y' - mean(Y))^2), SSR is explained varibility.

Note that SST = SSE + SSR.

- Coefficient of determination

r^2: proportion of explained variability by the regression equation.

0 <= r^2 = 1 - SSE/SST = SSR/SST <= 1

- Correlation coefficient

r: strength of the relationship between X and Y.

-1 <= r <= 1

5. Assumptions in the regression model

Errors are independent, normally distributed, with the mean of zero, with a constant variance.

The assumptions can be tested by using residual analysis.

6. MSE (Mean Squared Error)

Estimation of error variance of the regression equation.

s^2 = MSE = SSE / (n - k - 1)

where:

n - number of observations in the sample

k - number of independent variables

Standard deviation of the regression: s = sqrt(MSE) is also frequently used.

❤ Test the model for significance: F-test

Used to statistically test the null hypothesis H0: there is no linear relationship between Y and X (i.e. β1 = 0).

If p value is low, then we regect H0 and conclude there is linear relationship:

F = MSR / MSE

where: MSR = SSR / k

Good regression model should have significant F value and high r^2 value.

Statistical test can be performed on the regression coefficients. H0: the βs are 0.

For a simple linear regression, the test for regression coefficient gives the same information as the ones given by F-test.

❤ ANOVA tables

The general form of the ANOVA table is helpful for understanding the interrelatedness of error terms.

❤ Multiple regression

Similar to the simple regression model, but there are more than one X in the multiple regression models.

Y' = b0 + b1*X1 + b2*X2 + ... + bn*Xn

Note that if indenpendent variables is correlate to each other, colinearity or multicolinearity will happen. This will cause problems when intepreate variables individually although the overall model estimation may still be good.

Regression analysis的更多相关文章

  1. [ML学习笔记] 回归分析(Regression Analysis)

    [ML学习笔记] 回归分析(Regression Analysis) 回归分析:在一系列已知自变量与因变量之间相关关系的基础上,建立变量之间的回归方程,把回归方程作为算法模型,实现对新自变量得出因变量 ...

  2. Regression Analysis Using Excel

    Regression Analysis Using Excel Setup By default, data analysis add-in is not enabled. Follow the st ...

  3. Functional mechanism: regression analysis under differential privacy_阅读报告

    Functional mechanism: regression analysis under differential privacy 论文学习报告 组员:裴建新   赖妍菱    周子玉 2020 ...

  4. 7 Types of Regression Techniques you should know!

    翻译来自:http://news.csdn.net/article_preview.html?preview=1&reload=1&arcid=2825492 摘要:本文解释了回归分析 ...

  5. STA 463 Simple Linear Regression Report

    STA 463 Simple Linear Regression ReportSpring 2019 The goal of this part of the project is to perfor ...

  6. regression | p-value | Simple (bivariate) linear model | 线性回归 | 多重检验 | FDR | BH | R代码

    P122, 这是IQR method课的第一次作业,需要统计检验,x和y是否显著的有线性关系. Assignment 1 1) Find a small bivariate dataset (pref ...

  7. Multiple Regression

    Multiple Regression What is multiple regression? Multiple regression is regression analysis with mor ...

  8. Correlation and Regression

    Correlation and Regression Sample Covariance The covariance between two random variables is a statis ...

  9. 7 Types of Regression Techniques

    https://www.analyticsvidhya.com/blog/2015/08/comprehensive-guide-regression/ What is Regression Anal ...

随机推荐

  1. IOS 四舍五入 进一法 去尾法

    float numberToRound; int result; numberToRound = 4.51; result = (int)roundf(numberToRound); NSLog(@& ...

  2. python 获取星期几

    In [17]: now.strftime(%a),now.strftime(%w) Out[17]: ('Mon', '1') Directive Meaning %a Weekday name. ...

  3. C#实现函数默认值和C#4.0实现默认值

    static void Main(string[] args) { SayHello(); SayHello("侯志强"); Console.ReadKey(); } C#.0实现 ...

  4. git入门学习(二):新建分支/上传代码/删除分支

    一.git新建分支,上传代码到新的不同分支  我要实现的效果,即是多个内容的平行分支:这样做的主要目的是方便统一管理属于同一个内容的不同的项目,互不干扰.如图所示: 前提是我的github上已经有we ...

  5. 大家一起和snailren学java-(13)字符串

    “好久没有写这个系列了.其实也有在看,不过觉得一些很基本的都写上来没意思.现在打算的是将整本书看完后,最后整合为一篇blog,筛选出一些平时没有注意到的或者更深入的理解” 在写程序中,字符串Strin ...

  6. Java数组的12个常用方法

    以下是12个关于Java数组最常用的方法,它们是stackoverflow得票最高的问题. 声明一个数组 String[] aArray = new String[5]; String[] bArra ...

  7. Spring Batch 批处理框架

    <Spring Batch 批处理框架>基本信息作者: 刘相 出版社:电子工业出版社ISBN:9787121252419上架时间:2015-1-24出版日期:2015 年2月开本:16开页 ...

  8. javascript多态 - 类形式实现demo

    /* *多态 * 对传入的参数做判断以实现多种调用方式 */ //类形式实现 function Add(){ function zero(){ return 10; } function one(nu ...

  9. maven依赖本地非repository中的jar包-依赖jar包放在WEB-INF/lib等目录下的情况客户端编译出错的处理

    MAVEN 今天在使用maven编译打包一个web应用的时候,碰到一个问题: 项目在开发是引入了依赖jar包,放在了WEB-INF/lib目录下,并通过buildpath中将web libariary ...

  10. 为IIS站点添加限制IP

    /// <summary> /// 添加站点限制IP /// </summary> /// <param name="sitename">站点名 ...