Study note for Continuous Probability Distributions

Basics of Probability

Probability density function (pdf). Let X be a continuous random variable. Then a probability distribution or probability density function (pdf) of X is a function f(x) such that any two numbers a and b with

$Pr(a\le X\le b)=\int_a^b f(x) dx$

That is, the probability that X takes on a value in the interval [a, b] is the area above the interval and under the graph of the density function. The graph of f(x) is often referred to as the density curve.
- The pdf is a function that describes the relative likelihood for the random variable to take on a given value. Intuitively, one can think of f(x) as being the probability of a random variable X falling within the infinitesimal interval [x, x+dx]. My understanding: a probability is regarded as an absolute likelihood?
- $f(x)\ge 0$ for all x;
- $\int_{-\infty}^{\infty}f(x)dx = \mbox{area under the entire graph of }f(x)=1$
- For continuous random variable X, the probability for any single possible value is 0: $Pr(X=c)=\int_c^c f(x) dx=\lim_{\epsilon\rightarrow 0}\int_{c-\epsilon}^{c+\epsilon} f(x) dx=0$
- Intuitively, since continuous variable may have infinity possible values, and hence for each single value, the probability will be extremely small (the chance of a specific event occurring is rare) and approximating 0 by the limitation. On the other hand, for a continuous random variable, it is more meaningful to look at the probability in a certain interval than the probability at a specific point.
- A continuous random variable usually represents events related to measurements.
In mathematics, a moment is, loosely speaking, a quantitative measure of the shape of a set of points.
- The first moment, or the raw moment refers to the meanof a point distribution.
- The second moment, or the central moment is the variance. The normalized n-th central moment or standardized moment is the n-th central moment divided by $\sigma^n$ ; the normalized n-th central moment of $x=E((x-\mu)^n)/\sigma^n$
- The third central moment is the skewness.
- The fourth central moment is called "kurtosis", a measure of whether the distribution is tall and skinny or short and squat, comparing to the normal distribution of the same variance.
- High-order moments are moments beyond 4th-order moments.
Likelihood is a function of how likely an event is, which is weaker than probability. In statistics, probability is the function of data given the parameters while likelihood is the function of parameters given the observed data.

Uniform Distribution

The uniform distribution is summarized as follows:
- notation: U(a, b), where a, b are the minimum and maximum values of a uniform distribution, a<b.
- p.d.f:
  $f(x)=\left\{\begin{array}{ll} \frac{1}{b-a} & \mbox{for } x\in [a, b] & \mbox{otherwise}\end{array}\right.$
- mean: 1/2 * (a+b)
- variance 1/12 * (b-a)²

Normal Distribution

The normal (Gaussian) distribution is summarized as follows:
- notation: $\mathcal{N}\sim (\mu, \sigma^2)$ , where $\mu$ is the mean of the distribution, and $\sigma$ is the standard deviation. if $\mu=0 \mbox{ and } \sigma=1$ , the distribution is called the standard normal distribution.
- p.d.f:
  $f(x)= \frac{1}{\sqrt{2\pi\sigma^2}} exp\{-\frac{(x-\mu)^2}{2\sigma^2}\}$
- mean: $\mu$
- variance: $\sigma^2$
- P(a<x<b): the integral for arbitrary a and b cannot be evaluated analytically. Hence, it is usually converted to a standard normal distribution (a.k.a standardization) from which the c.d.f can be directly read from a table.
Normal distribution are often used in the natural and social sciences for real-valued random variables whose distributions are not known.
Standardization: if X is a normal random variable with mean $\mu$ and standard deviation $\sigma$ , then $Z=\frac{X-\mu}{\sigma}$ is a standard normal random variable.
Central Limit Theorem
- Gaussian distribution is important because of the central limit theorem
- A crude statement of the central limit theorem: things that are the result of the addition of lots ofsmall effects tend to become Gaussian. That is, no one term in sum should dominate the sum.
- A more exact statement:
  - Let Y1, Y2, ..., Yn be an infinite sequence of independent random variables (that may be from different pdf), each with the same probability distribution
  - Suppose that the mean and variance of this distribution are bothfinite.
  - For any numbers a and b:
    $\lim_{n\rightarrow \infty} P[a<\frac{Y_1+Y_2+\ldots+Y_n-n\mu}{\sigma \sqrt{n}}<b]=\frac{1}{\sqrt{2\pi}} \int_a^b e^{-\frac{1}{2} y^2} dy$
- It tells us that under a wild range of circumstances the probability distribution that describes the sum of random variables tends to a Gaussian distribution as the number of terms in the sum $\rightarrow \infty$

Multivariate Distributions

We can generalize the definition of random variables to vectors. A vector $\mathbf{X}=(X_1, \ldots, X_c)$ is a vector whose components $X_i$ are univariate random variables. If $X_i$ are all discrete, then $\mathbf{X}$ is a discrete random vector. If $X_i$ are all continuous, $\mathbf{X}$ is called a continuous random vector.
The distribution of a random vector is characterized by the joint c.d.f that is defined as:
$p(\mathbf{X}\le x)=F(x)=p(X_1\le x_1, ..., X_c\le x_c) \quad x=(x_1, ..., x_c)$

References

Paola Sebastiani, A Tutorial on Probability Theory

Study note for Continuous Probability Distributions的更多相关文章

PRML读书笔记——2 Probability Distributions
2.1. Binary Variables 1. Bernoulli distribution, p(x = 1|µ) = µ 2.Binomial distribution + 3.beta dis ...
CCJ PRML Study Note - Chapter 1.6 : Information Theory
Chapter 1.6 : Information Theory Chapter 1.6 : Information Theory Christopher M. Bishop, PRML, C ...
Common Probability Distributions
Common Probability Distributions Probability Distribution A probability distribution describes the p ...
PRML读书会第二章 Probability Distributions(贝塔-二项式、狄利克雷-多项式共轭、高斯分布、指数族等)
主讲人网络上的尼采 (新浪微博: @Nietzsche_复杂网络机器学习) 网络上的尼采(813394698) 9:11:56 开始吧,先不要发言了,先讲PRML第二章Probability Dis ...
PRML Chapter 2. Probability Distributions
PRML Chapter 2. Probability Distributions P68 conjugate priors In Bayesian probability theory, if th ...
基本概率分布Basic Concept of Probability Distributions 5: Hypergemometric Distribution
PDF version PMF Suppose that a sample of size $n$ is to be chosen randomly (without replacement) fro ...
基本概率分布Basic Concept of Probability Distributions 3: Geometric Distribution
PDF version PMF Suppose that independent trials, each having a probability $p$, $0 < p < 1$, o ...
基本概率分布Basic Concept of Probability Distributions 2: Poisson Distribution
PDF version PMF A discrete random variable $X$ is said to have a Poisson distribution with parameter ...
Study notes for Discrete Probability Distribution
The Basics of Probability Probability measures the amount of uncertainty of an event: a fact whose o ...

随机推荐

研究一下TForm.WMPaint过程（也得研究WM_ERASEBKGND）——TForm虽然继承自TWinControl，但是自行模仿了TCustomControl的全部行为，一共三种自绘的覆盖方法，比TCustomControl还多一种
先擦除背景: procedure TCustomForm.WMEraseBkgnd(var Message: TWMEraseBkgnd); begin if not IsIconic(Handle) ...
我在知乎上关于Laser200/310电脑的文章。
我是30年前从Laser-310起步的,我来回答这个问题. 主要硬件规格: CPU:Z-80A/4.7MHz主频 16K RAM + 2K Video RAM 16K ROM 磁带输出:波特率300 ...
one command 一键收集 oracle 巡检信息（包括dbhc，awr reports）
初步效果图例如以下 SQL> @nb ------Oracle Database health Check STRAT ------Starting Collect Data Informati ...
原型链（__proto__）
前面详细的解释了new的几个步骤,其中随意带过了一下原型链的概念,如果细读那篇文章,基本对原型也能有所理解. 原型有两个关键属性,一个是 __proto__ 一个是 prototype ,了解了这两个 ...
新发现QWindow
http://doc.qt.io/qt-5/qwindow.html#details 不知道该什么时候使用它?
C++历史
C++历史早期C++ •1979: 首次实现引入类的C(C with Classes first implemented) 1.新特性:类.成员函数.继承类.独立编译.公共和私有访问控制.友元.函数 ...
nginx tcp proxy 连接保持设置
根据前文Nginx tcp proxy module试用的设置,在测试环境中发现tcp连接经常掉线.在该项目站点上找到一个issue,也谈论这件事情,不过别人用在web socket协议上. 其实就是 ...
git flow 的使用
在这里主要讲一下我在项目中用到的关于gitflow的使用方法. 公司的项目中,专门有一台用来存放版本号库的server,路径是在默认的安装文件夹/opt/git/,那么在使用的时候,假设你是 ...
[C++]指针浅析
Date: 2014-1-4 summary: 指针的简单理解,概念性的东西会比较多(100个人有100种理解,此处只代表我个人看法) Contents: 1.什么是指针 c++ primer plu ...
[Erlang危机](5.0)执行时指标
原创文章.转载请注明出处:server非业余研究http://blog.csdn.net/erlib 作者Sunface . Then, in times of need, it's also po ...

Study note for Continuous Probability Distributions

Basics of Probability

Uniform Distribution

Normal Distribution

Multivariate Distributions

References

Study note for Continuous Probability Distributions的更多相关文章

随机推荐

热门专题