Study notes for Discrete Probability Distribution

The Basics of Probability

Probability measures the amount of uncertainty of an event: a fact whose occurence is uncertain.
Sample space refers to the set of all possible events, denoted as $\mathcal{S}$ .
Some properties:
- Sum rule: $p(A\cup B)=p(A)+p(B)-p(A\cap B)$
- Union bound: $p(\cup_{i=1}^n A_i)\le \sum_{i=1}^n p(A_i)$
Conditional probability: $p(B|A)=\frac{p(B, A)}{p(A)}$ . To emphasize that p(A) is unconditional, p(A) is called "marginal probability", and p(B, A) is called "joint probability", where p(A, B)=p(B|A) p(A) is called the "multiplication rule" or "factorization rule".
Total probability theorem: p(B) = p(B|A)p(A) + p(B|~A)p(~A)
Bayes' Theorem:
$p(A|B)=\frac{p(A, B)}{p(B)}=\frac{p(B|A)p(A)}{p(B)}=\frac{p(B|A)p(A)}{p(B|A)p(A)+p(B|\bar{A})p(\bar{A})}$

Bayes' Theorem can be regarded as a rule to update a prior probability p(A) into a posterior probability p(A|B), taking into account the amount/occurrence of evidence/event B.
Conditional independence: Two events A and B, with p(A)>0 and p(B)>0 are independent, given C, if p(A, B|C)=p(A|C) p(B|C).
Probability mass function (p.m.f) of random variable X is a function $f: x\rightarrow f(x)=Pr[X=x]$
Joint probability mass function of X and Y is a function $f: (x, y)\rightarrow f(x,y)=Pr[X=x\cap Y=y]$
Cumulative distribution function (c.d.f) of a random variable X is a function: $f: x\rightarrow f(x)=Pr[X\le x]$
The c.d.f describes the probability in a specific interval, whereas the p.m.f describes the probability in a specific event.
Expectation: the expectationof a random variable X is:
- linearity: E[aX+bY]=aE[x]+bE[Y]
- if X and Y are independent: E[XY]=E[X]*E[Y]
- Markov's inequality: let X be a nonnegative random variable with $E[X]<\infty$ , then for all $t>0, Pr[X\ge tE[X]]\le \frac{1}{t}$
Variance: the variance of a random variable X is: , where is called the standard deviation of the random variable X.
- Var[aX] = a²Var[X]
- if X and Y are independent, Var[X+Y]=Var[X]+Var[Y]
- Chebyshev's inequality: let X be a random variable $E[X]<\infty$ , then for all $t>0, Pr[|X-E[X]|\ge t\sigma_X]\le \frac{1}{t^2}$

Bernoulli Distribution

A (single) Bernoulli trial is an experiment whose outcome is random and can be either of two possible outcomes, "success" and "failure", or "yes" and "no". Examples of Bernoulli trials include: flipping a coin, political option poll, etc.
The Bernoulli distribution is a discrete probability distribution ofone (a) discrete random variable X, which takes value 1 with success probability p: Pr(X=1)=p, and value 0 with failure probability Pr(X=0)=q=1-p. For formally, the Bernoulli distribution is summarized as follows:
- notation: Bern(p), where 0<p<1 is the probability of success.
- support: X={0, 1}
- p.m.f: Pr[X=0]=q=1-p, Pr[X=1]=p
- mean: E[X]=p
- variance: Var[X]=p(1-p)
- It is a special case of Binomial distribution B(n, p). Bernoulli distribution is B(1, p).

Binomial Distribution

The Binomial distribution is the discrete probability distribution of the number of successes in a sequence ofn independent Bernoulli trials with success probabilityp, denoted asX~B(n, p).
The Binomial distribution is often used to model the number of successes in a sample of sizen drawn with replacement from a population of sizeN. If the sampling is carried out without replacement, the draws are not independent and so the resulting distribution is a hypergeometric distribution, not a binomial one.
The Binomial distribution is summarized as follows:
- notation: B(n, p), where n is the number of trials and p is the success probability in each trial
- support: k = {0, 1, ..., n} the number of successes
- p.m.f: $\binom{n}{k}p^k(1-p)^{n-k}$
- mean: np
- variance: np(1-p)
If n is large enough, then the skew of the distribution is not too great. In this case, a reasonable approximation to B(n, p) is given by the normal distribution: since a large n will result in difficulty to compute the p.m.f of Binomial distribution.
- one rule to determine if such approximation is reasonable, or if n is large enough is that both np and np(1-p) must be greater than 5. If both are greater than 15 then the approximation should be good.
- A second rule is than for n>5, the normal approximation is adequate if:
  $\Big|\Big(\frac{1}{\sqrt{n}}\Big)\Big(\sqrt{\frac{1-p}{p}}-\sqrt{\frac{p}{1-p}}\Big)\Big|<0.3$
- Another commonly used rule holds that the normal approximation is appropriate only if everything within 3 standard deviation of its mean is within the range of possible values, that is if:
  $\mu\pm 3\sigma=np\pm 3\sqrt{np(1-p)}$
- To improve the accuracy of the approximation, we usually use a correction factor to take into account that the binomial random variable is discrete while the normal random variable is continuous. In particular, the basic idea is to treat the discrete value k as the continuous interval from k-0.5 to k+0.5.
In addition, Poisson distribution can be used to approximate the Binomial distribution when n is very large. A rule of thumb stating that the Poisson distribution is a good approximation oof the binomial distribution if n is at least 20 and p is smaller than or equal to 0.05, and an excellent approximation if n>=100, and np<=10: $B(n, p) \approx P(\lambda=np)$

Poisson Distribution

Poisson distribution: Let X be a discrete random variable taking values in the set of integer numbers $\mathcal{N}=\{, 1, 2, \ldots\}$ with probability:

$Pr(X=x)=\frac{\lambda^x}{x!} e^{-\lambda} \quad x = 0, 1, 2, \ldots$

My understanding. Poisson distribution describes the fact that the probability of drawing a specific integer from a set of integers is not uniform. For example, it is well-known that if someone is asked to pick a random integer from 1-10, some integers are occurring with greater probability whereas some others happen with lower probability. Although it seems that all possible integers get equal chance to be picked, it is not true in real case. I think this may be due to subjectivity of people, i.e., some one prefers larger values while other tends to pick smaller ones. This point needs to be verified as I got this feeling totally from intuitions.
The Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time and/or space if these events occur with a known average rate and independent of the time since the last event.
The Poisson distribution is summarized as follows.
- notation: $P(\lambda)$ , where $\lambda=\lambda T>0$ is a real number, indicating the number of events occurring that will be observed in the time interval $T=1$ .
- support: k = {0, 1, 2, 3, ...}
- mean: $\lambda$
- variance: $\lambda$
Applications of Poisson distribution
- Telecommunication: telephone calls arriving in a system
- Management: customers arriving at a counter or call center
- Civil engineering: cars arriving at a traffic light

Generating Poisson random variables

algorithm poisson_random_number:

init:

     Let

$L\leftarrow e^{-\lambda}$

$k\leftarrow 0$

,  and

$p\leftarrow 1$

.

do:

$k\leftarrow k+1$

     Generate uniform random number u in [0, 1], and let

$p\leftarrow p\times u$

while p>L.

return k-1.

References

Paola Sebastiani, A tutorial on probability theory
Mehryar Mohri, Introduction to Machine Learning - Basic Probability Notations.

Study notes for Discrete Probability Distribution的更多相关文章

Generating a Random Sample from discrete probability distribution
If is a discrete random variable taking on values , then we can write . Implementation of this formu ...
Machine Learning Algorithms Study Notes(2)--Supervised Learning
Machine Learning Algorithms Study Notes 高雪松 @雪松Cedro Microsoft MVP 本系列文章是Andrew Ng 在斯坦福的机器学习课程 CS 22 ...
Notes on the Dirichlet Distribution and Dirichlet Process
Notes on the Dirichlet Distribution and Dirichlet Process In [3]: %matplotlib inline Note: I wrote ...
Study note for Continuous Probability Distributions
Basics of Probability Probability density function (pdf). Let X be a continuous random variable. The ...
Machine Learning Algorithms Study Notes(3)--Learning Theory
Machine Learning Algorithms Study Notes 高雪松 @雪松Cedro Microsoft MVP 本系列文章是Andrew Ng 在斯坦福的机器学习课程 CS 22 ...
Machine Learning Algorithms Study Notes(1)--Introduction
Machine Learning Algorithms Study Notes 高雪松 @雪松Cedro Microsoft MVP 目录 1 Introduction 1 1.1 ...
Study notes for Latent Dirichlet Allocation
1. Topic Models Topic models are based upon the idea that documents are mixtures of topics, where a ...
Study notes for Clustering and K-means
1. Clustering Analysis Clustering is the process of grouping a set of (unlabeled) data objects into ...
ORACLE STUDY NOTES 01
[JSU]LJDragon's Oracle course notes In the first semester, junior year DML数据操纵语言 DML指:update,delete, ...

随机推荐

Java遍历解析URL类型字符串中参数
public static void main(String[] args) { String str="&emailCheckURL=447&useremail=vip@c ...
thrift js javascript C# Csharp webservice
http://www.cnblogs.com/xxxteam/archive/2013/04/15/3023159.html 利用thrift实现js与C#通讯的例子关键字:thrift js ja ...
Phpstorm配置phpunit对php进行单元测试
在 phpstorm 中配置 php 项目的单元测试,项目使用 Composer 进行管理,为了避免在项目中直接引入 phpunit 相关代码包,使项目的 vendor 目录变得臃肿,这里采用全局安装 ...
OCP-1Z0-051-题目解析-第12题
12. You need to produce a report where each customer's credit limit has been incremented by $1000. I ...
VMware安装CentOS 图文教程
VMware安装CentOS 图文教程 VMware 下安装CentOS6.2 取消 Easy install模式(此模式不好,很多软件没有安装) http://jingyan.baidu.com/a ...
Dotfuscator自定义规则中的元素选择
Dotfuscator是专业的.NET程序代码保护软件.是支持规则自定义的,你可以对重命名.程序控制流.字符串加密等等功能自定义规则.在进行规则自定义过程中,可以通过元素的不同选择,满足自己的程序需要 ...
QC邮件转发工具Mail Direct安装配置手册
QC邮件转发工具Mail Direct安装配置手册 2010-06-11 10:00:56| 分类: 软件测试 | 标签: |举报 |字号大中小订阅 QC邮件转发工具安装配置手册由于公司没有独立的 ...
api的安全问题
在给第三方系统提供api时,我们需要注意下安全问题. 比较常见的接口有http接口.以http接口为例.我们需要注意的几点: 1.只有被允许的系统才可以调用api 2.如果http请求被截获.也不 ...
LigerUI+MVC的应用1
[项目开发]LigerUI+MVC的应用(一) 近期因为稍微空闲有点时间,就晚上回家自己在随便写写代码,也就边写边记,中间主要采用了微软的MVC4.0框架.虽然目前公司也是使用的MVC的模式,但是因为 ...
discuz X3.1的门户文章实现伪静态，利于搜索引擎收录url的地址修改
最近在捣鼓DZ框架,这两天发现文章的收录情况并不是太理想,做了很多优化方面的工作,今天主要解决了DZ门户的文章链接伪静态化,在这次修改之前,也做过一次在网上找的静态化修改,之前做的方式是: 1.在DZ ...

Study notes for Discrete Probability Distribution

The Basics of Probability

Bernoulli Distribution

Binomial Distribution

Poisson Distribution

References

Study notes for Discrete Probability Distribution的更多相关文章

随机推荐

热门专题