Basics of Probability

  • Probability density function (pdf). Let X be a continuous random variable. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that any two numbers a and b with

    That is, the probability that X takes on a value in the interval [a, b] is the area above the interval and under the graph of the density function. The graph of f(x) is often referred to as the density curve.

    • The pdf is a function that describes the relative likelihood for the random variable to take on a given value. Intuitively, one can think of f(x) as being the probability of a random variable X falling within the infinitesimal interval [x, x+dx]. My understanding: a probability is regarded as an absolute likelihood?
    • for all x;
    • For continuous random variable X, the probability for any single possible value is 0: 
    • Intuitively, since continuous variable may have infinity possible values, and hence for each single value, the  probability will be extremely small (the chance of a specific event occurring is rare) and approximating 0 by the limitation. On the other hand, for a continuous random variable, it is more meaningful to look at the probability in a certain interval than the probability at a specific point.
    • A continuous random variable usually represents events related to measurements.
  • In mathematics, a moment is, loosely speaking, a quantitative measure of the shape of a set of points
    • The first moment, or the raw moment refers to the meanof a point distribution.
    • The second moment, or the central moment is the variance. The normalized n-th central moment or standardized moment is the n-th central moment divided by ; the normalized n-th central moment of 
    • The third central moment is the skewness.
    • The fourth central moment is called "kurtosis", a measure of whether the distribution is tall and skinny or short and squat, comparing to the normal distribution of the same variance.
    • High-order moments are moments beyond 4th-order moments.
  • Likelihood is a function of how likely an event is, which is weaker than probability. In statistics, probability is the function of data given the parameters while likelihood is the function of parameters given the observed data.

Uniform Distribution

  • The uniform distribution is summarized as follows:

    • notation: U(a, b), where a, b are the minimum and maximum values of a uniform distribution, a<b.
    • p.d.f: 
    • mean: 1/2 * (a+b)
    • variance 1/12 * (b-a)2

Normal Distribution

  • The normal (Gaussian) distribution is summarized as follows:

    • notation: , where  is the mean of the distribution, and is the standard deviation. if , the distribution is called the standard normal distribution.
    • p.d.f: 
    • mean: 
    • variance:
    • P(a<x<b): the integral for arbitrary a and b cannot be evaluated analytically. Hence, it is usually converted to a standard normal distribution (a.k.a standardization) from which the c.d.f can be directly read from a table.
  • Normal distribution are often used in the natural and social sciences for real-valued random variables whose distributions are not known.
  • Standardization: if X is a normal random variable with mean and standard deviation, then is a standard normal random variable.
  • Central Limit Theorem
    • Gaussian distribution is important because of the central limit theorem
    • A crude statement of the central limit theorem: things that are the result of the addition of lots ofsmall effects tend to become Gaussian. That is, no one term in sum should dominate the sum.
    • A more exact statement:
      • Let Y1, Y2, ..., Yn be an infinite sequence of independent random variables (that may be from different pdf), each with the same probability distribution
      • Suppose that the mean and variance of this distribution are bothfinite.
      • For any numbers a and b: 
    • It tells us that under a wild range of circumstances the probability distribution that describes the sum of random variables tends to a Gaussian distribution as the number of terms in the sum 

Multivariate Distributions

  • We can generalize the definition of random variables to vectors. A vector  is a vector whose components are univariate random variables. If are all discrete, then is a discrete random vector. If are all continuous, is called a continuous random vector.
  • The distribution of a random vector is characterized by the joint c.d.f that is defined as: 

References

  1. Paola Sebastiani, A Tutorial on Probability Theory

Study note for Continuous Probability Distributions的更多相关文章

  1. PRML读书笔记——2 Probability Distributions

    2.1. Binary Variables 1. Bernoulli distribution, p(x = 1|µ) = µ 2.Binomial distribution + 3.beta dis ...

  2. CCJ PRML Study Note - Chapter 1.6 : Information Theory

    Chapter 1.6 : Information Theory     Chapter 1.6 : Information Theory Christopher M. Bishop, PRML, C ...

  3. Common Probability Distributions

    Common Probability Distributions Probability Distribution A probability distribution describes the p ...

  4. PRML读书会第二章 Probability Distributions(贝塔-二项式、狄利克雷-多项式共轭、高斯分布、指数族等)

    主讲人 网络上的尼采 (新浪微博: @Nietzsche_复杂网络机器学习) 网络上的尼采(813394698) 9:11:56 开始吧,先不要发言了,先讲PRML第二章Probability Dis ...

  5. PRML Chapter 2. Probability Distributions

    PRML Chapter 2. Probability Distributions P68 conjugate priors In Bayesian probability theory, if th ...

  6. 基本概率分布Basic Concept of Probability Distributions 5: Hypergemometric Distribution

    PDF version PMF Suppose that a sample of size $n$ is to be chosen randomly (without replacement) fro ...

  7. 基本概率分布Basic Concept of Probability Distributions 3: Geometric Distribution

    PDF version PMF Suppose that independent trials, each having a probability $p$, $0 < p < 1$, o ...

  8. 基本概率分布Basic Concept of Probability Distributions 2: Poisson Distribution

    PDF version PMF A discrete random variable $X$ is said to have a Poisson distribution with parameter ...

  9. Study notes for Discrete Probability Distribution

    The Basics of Probability Probability measures the amount of uncertainty of an event: a fact whose o ...

随机推荐

  1. Hacker News网站的文章排名算法工作原理

    In this post I'll try to explain how Hacker News ranking algorithm works and how you can reuse it in ...

  2. cocos2d-x lua 内存回收

    使用cocos2d-x lua架构,游戏中存在两种内存回收方式. 1.cocos2d-x 本身内存回收 PS:假设在lua在创建一个类,继承cocos2d-x的一个类A,则该A也遵循cocos2d-x ...

  3. Swift - 本地数据的保存与加载(使用NSCoder将对象保存到.plist文件)

    下面通过一个例子将联系人数据保存到沙盒的“documents”目录中.(联系人是一个数组集合,内部为自定义对象). 功能如下: 1,点击“保存”将联系人存入userList.plist文件中 2,点击 ...

  4. 选择屏幕中的下拉框和dialog中下拉框设计

    REPORT  YTEST014. PARAMETERS: auart LIKE vapma-auart  AS LISTBOX   VISIBLE LENGTH 6. AT SELECTION-SC ...

  5. smartforms长文本处理方式

    因为长文本的话不好换行,在smartforms中对于长文本的处理有两种:一种是自定义文本模板,自己定义好的文本格式,然后再使用的时候再文本类型中选择文本模板,最好是勾上如果没有文本存在怎无错误.这种方 ...

  6. javascript中外部js文件取得自身完整路径得办法

    原文:javascript中外部js文件取得自身完整路径得办法 有时候我们需要引入一个外部js文件,这个js文件又需要用到自己的路径或者是所在的目录,别问怎么又这么变态的需求,开发做久了各种奇葩需求也 ...

  7. java32至md5加密

    码,如以下 <span style="font-size:18px;">import java.security.MessageDigest; import java. ...

  8. go运算符

    package main import ( "fmt" ) func main() { fmt.Println(^) } -3 package main import ( &quo ...

  9. python中逐行读取文件的最佳方式_Drupal_新浪博客

    python中逐行读取文件的最佳方式_Drupal_新浪博客 python中逐行读取文件的最佳方式    (2010-08-18 15:59:28)    转载▼    标签:    python   ...

  10. Advanced Data Structures

    Advanced Data Structures Advanced Data Structures