Source: http://wenku.baidu.com/link?url=9KrZhWmkIDHrqNHiXCGfkJVQWGFKOzaeiB7SslSdW_JnXCkVHsHsXJyvGbDva4V5A-uuOl84mg5zkTECichHX_AsN0mZalfI9BzDFOeNe-G###

❤ Simple linear regression

1. Y = β0 + β1*X + e

where:

Y - dependent variable (response)

X - independent variable (predictor/explanatory)

β0 - intercept

β1 - slope of the regression line

e - random error

2. Y' = b0 + b1*X

where: Y' - predicted value of Y

e = Y - Y'

3. Least squarea regression minizes the sum of the square of the errors and can be used to estimate b0 and b1.

4. Measuring the fit of the estimated model.

- The varibility of Y

SST (Sum of Squared Total): total variability about the mean, SST = sum((Y - mean(Y))^2);

SSE (Sum of Squared Error): variability about the regression line, SSE = sum(e^2) = sum((Y - mean(Y'))^2), SSE is unexplained varibility;

SSR (Sum of Squares due to Regression): variability that is explained, SSR = sum((Y' - mean(Y))^2), SSR is explained varibility.

Note that SST = SSE + SSR.

- Coefficient of determination

r^2: proportion of explained variability by the regression equation.

0 <= r^2 = 1 - SSE/SST = SSR/SST <= 1

- Correlation coefficient

r: strength of the relationship between X and Y.

-1 <= r <= 1

5. Assumptions in the regression model

Errors are independent, normally distributed, with the mean of zero, with a constant variance.

The assumptions can be tested by using residual analysis.

6. MSE (Mean Squared Error)

Estimation of error variance of the regression equation.

s^2 = MSE = SSE / (n - k - 1)

where:

n - number of observations in the sample

k - number of independent variables

Standard deviation of the regression: s = sqrt(MSE) is also frequently used.

❤ Test the model for significance: F-test

Used to statistically test the null hypothesis H0: there is no linear relationship between Y and X (i.e. β1 = 0).

If p value is low, then we regect H0 and conclude there is linear relationship:

F = MSR / MSE

where: MSR = SSR / k

Good regression model should have significant F value and high r^2 value.

Statistical test can be performed on the regression coefficients. H0: the βs are 0.

For a simple linear regression, the test for regression coefficient gives the same information as the ones given by F-test.

❤ ANOVA tables

The general form of the ANOVA table is helpful for understanding the interrelatedness of error terms.

❤ Multiple regression

Similar to the simple regression model, but there are more than one X in the multiple regression models.

Y' = b0 + b1*X1 + b2*X2 + ... + bn*Xn

Note that if indenpendent variables is correlate to each other, colinearity or multicolinearity will happen. This will cause problems when intepreate variables individually although the overall model estimation may still be good.

Regression analysis的更多相关文章

  1. [ML学习笔记] 回归分析(Regression Analysis)

    [ML学习笔记] 回归分析(Regression Analysis) 回归分析:在一系列已知自变量与因变量之间相关关系的基础上,建立变量之间的回归方程,把回归方程作为算法模型,实现对新自变量得出因变量 ...

  2. Regression Analysis Using Excel

    Regression Analysis Using Excel Setup By default, data analysis add-in is not enabled. Follow the st ...

  3. Functional mechanism: regression analysis under differential privacy_阅读报告

    Functional mechanism: regression analysis under differential privacy 论文学习报告 组员:裴建新   赖妍菱    周子玉 2020 ...

  4. 7 Types of Regression Techniques you should know!

    翻译来自:http://news.csdn.net/article_preview.html?preview=1&reload=1&arcid=2825492 摘要:本文解释了回归分析 ...

  5. STA 463 Simple Linear Regression Report

    STA 463 Simple Linear Regression ReportSpring 2019 The goal of this part of the project is to perfor ...

  6. regression | p-value | Simple (bivariate) linear model | 线性回归 | 多重检验 | FDR | BH | R代码

    P122, 这是IQR method课的第一次作业,需要统计检验,x和y是否显著的有线性关系. Assignment 1 1) Find a small bivariate dataset (pref ...

  7. Multiple Regression

    Multiple Regression What is multiple regression? Multiple regression is regression analysis with mor ...

  8. Correlation and Regression

    Correlation and Regression Sample Covariance The covariance between two random variables is a statis ...

  9. 7 Types of Regression Techniques

    https://www.analyticsvidhya.com/blog/2015/08/comprehensive-guide-regression/ What is Regression Anal ...

随机推荐

  1. 你真的了解UIApplication吗?

    一:首先查看一下关于UIApplication的定义 NS_CLASS_AVAILABLE_IOS(2_0) @interface UIApplication : UIResponder //获得单例 ...

  2. AFNetworking讲解

    #import "ViewController.h" //#import "AFNetworking/AFNetworking.h" #import " ...

  3. GCD中的dispatch_barrier_async函数的使用(栅栏函数)

    <一>什么是dispatch_barrier_async函数 毫无疑问,dispatch_barrier_async函数的作用与barrier的意思相同,在进程管理中起到一个栅栏的作用,它 ...

  4. 【问题排查】StringIndexOutOfBoundsException

    工作中遇到 java.lang.StringIndexOutOfBoundsException ,查看网上资料,总结如下 1.异常定义: Java API指出StringIndexOutOfBound ...

  5. sql 2012 sequence 速记

    CodeSELECT id INTO #t1 FROM sys_id WHERE id < 3 SELECT next value for sequencebase FROM #t1 DROP ...

  6. Grunt安装配置教程:前端自动化工作流

    Grunt这货是啥? Grunt 是一个基于任务的 JavaScript 项目命令行构建工具. 最近很火的前端自动化小工具,基于任务的命令行构建工具 http://gruntjs.com Grunt能 ...

  7. android java substring说明

    substring(参数)是java中截取字符串的一个方法有两种传参方式一种是public String substring(int beginIndex)返回一个新的字符串,它是此字符串的一个子字符 ...

  8. PCIe 32GT/s 含义

    如下: Jul 26 03:42:53 kernel: ixgbe 0000:01:00.1: PCI Express bandwidth of 32GT/s available Jul 26 03: ...

  9. jquery——九宫格大转盘抽奖

    一.用到的图片 二.代码如下,重点是js部分 <!DOCTYPE html> <html> <head> <meta http-equiv="Con ...

  10. 时光煮雨 Unity3d 序列目标点的移动①

    系列目录 [Unity3D基础]让物体动起来①--基于UGUI的鼠标点击移动 [Unity3D基础]让物体动起来②--UGUI鼠标点击逐帧移动 时光煮雨 Unity3D让物体动起来③—UGUI DoT ...