Statistics : Data Distribution
1、Normal distribution

In probability theory, the normal (or Gaussian or Gauss or Laplace–Gauss) distribution is a very common continuous probability distribution. Normal distributions are important in statistics and are often used in the natural and social sciences to represent real-valued random variables whose distributions are not known. A random variable with a Gaussian distribution is said to be normally distributed and is called a normal deviate.
The normal distribution is
useful because of the central limit theorem. In its most general form, under
some conditions (which include finite variance), it states that averages of
samples of observations of random variables independently drawn from
independent distributions converge in distribution to the normal, that is, they
become normally distributed when the number of observations is sufficiently
large. Physical quantities that are expected to be the sum of many independent
processes (such as measurement errors) often have distributions that are nearly
normal. Moreover, many results and methods (such as propagation of uncertainty
and least squares parameter fitting) can be derived analytically in explicit
form when the relevant variables are normally distributed.
The normal distribution is
sometimes informally called the bell curve. However, many other distributions
are bell-shaped (such as the Cauchy, Student's t-, and logistic distributions).
link:https://en.wikipedia.org/wiki/Normal_distribution
https://www.mathsisfun.com/data/standard-normal-distribution.html
2、Poisson Distribution

In
probability theory and statistics, the Poisson distribution (French
pronunciation: ; in English often rendered /ˈpwɑːsɒn/), named after French
mathematician Siméon Denis Poisson, is a discrete probability distribution that
expresses the probability of a given number of events occurring in a fixed
interval of time or space if these events occur with a known constant rate and
independently of the time since the last event. The Poisson distribution can
also be used for the number of events in other specified intervals such as
distance, area or volume.
For instance, an
individual keeping track of the amount of mail they receive each day may notice
that they receive an average number of 4 letters per day. If receiving any
particular piece of mail does not affect the arrival times of future pieces of
mail, i.e., if pieces of mail from a wide range of sources arrive independently
of one another, then a reasonable assumption is that the number of pieces of
mail received in a day obeys a Poisson distribution. Other examples that may
follow a Poisson distribution include the number of phone calls received by a
call center per hour and the number of decay events per second from a
radioactive source.
link:https://en.wikipedia.org/wiki/Poisson_distribution
https://www.umass.edu/wsp/resources/poisson/
3、Chi-squared distribution

In
probability theory and statistics, the chi-square distribution (also
chi-squared or χ2-distribution) with k degrees of freedom is the distribution
of a sum of the squares of k independent standard normal random variables. The
chi-square distribution is a special case of the gamma distribution and is one
of the most widely used probability distributions in inferential statistics,
notably in hypothesis testing or in construction of confidence intervals. When
it is being distinguished from the more general noncentral chi-square distribution,
this distribution is sometimes called the central chi-square distribution.
The chi-square
distribution is used in the common chi-square tests for goodness of fit of an
observed distribution to a theoretical one, the independence of two criteria of
classification of qualitative data, and in confidence interval estimation for a
population standard deviation of a normal distribution from a sample standard
deviation. Many other statistical tests also use this distribution, such as
Friedman's analysis of variance by ranks.
link:https://en.wikipedia.org/wiki/Chi-squared_distribution
http://mathworld.wolfram.com/Chi-SquaredDistribution.html
https://www.itl.nist.gov/div898/handbook/eda/section3/eda3666.html
4、Beta distribution

In
probability theory and statistics, the beta distribution is a family of
continuous probability distributions defined on the interval parametrized by two positive shape parameters,
denoted by α and β, that appear as exponents of the random variable and control
the shape of the distribution. It is a special case of the Dirichlet
distribution.
The beta distribution has
been applied to model the behavior of random variables limited to intervals of
finite length in a wide variety of disciplines.
In Bayesian inference, the
beta distribution is the conjugate prior probability distribution for the
Bernoulli, binomial, negative binomial and geometric distributions. For
example, the beta distribution can be used in Bayesian analysis to describe
initial knowledge concerning probability of success such as the probability
that a space vehicle will successfully complete a specified mission. The beta
distribution is a suitable model for the random behavior of percentages and proportions.
The usual formulation of
the beta distribution is also known as the beta distribution of the first kind,
whereas beta distribution of the second kind is an alternative name for the
beta prime distribution.
link:https://en.wikipedia.org/wiki/Beta_distribution
Statistics : Data Distribution的更多相关文章
- 异常:Data = 由于代码已经过优化或者本机框架位于调用堆栈之上,无法计算表达式的值。
做项目的时候,将DataTable序列化成Json,通过ashx向前台返回数据的时候,前台总是获取不到数据,但是程序运行却没问题, 没抛出异常.一时找不到办法,减小输出的数据量,这时前台可以接收到页面 ...
- lombok插件:Data自动get/set方法, Slf4j实现Logger的调用
lombok插件:Data自动get/set方法, Slf4j实现Logger的调用 lombok.Data import lombok.Data; import org.hibernate.anno ...
- 插入图片新方式:data:image
我们在使用<img>标签和给元素添加背景图片时,不一定要使用外部的图片地址,也可以直接把图片数据定义在页面上.对于一些“小”的数据,可以在网页中直接嵌入,而不是从外部文件载入. 如何使用 ...
- EnjoyingSoft之Mule ESB开发教程第六篇:Data Transform - 数据转换
目录 1. 数据转换概念 2. 数据智能感知 - DataSense 3. 简单数据转换组件 3.1 Object to JSON 3.2 JSON to XML 3.3 JSON to Object ...
- Logstash:Data转换,分析,提取,丰富及核心操作
Logstash:Data转换,分析,提取,丰富及核心操作 Logstash plugins Logstash是一个非常容易进行扩张的框架.它可以对各种的数据进行分析处理.这依赖于目前提供的超过200 ...
- Mysql load data infile 导入数据出现:Data truncated for column
[1]Mysql load data infile 导入数据出现:Data truncated for column .... 可能原因分析: (1)数据库表对应字段类型长度不够或修改为其他数据类型( ...
- 错误记录:Data too long for column 'xxx' at row 1
错误记录:Data too long for column 'xxx' at row 1 使用Flask-sqlalchemy操作数据时报错: "Data too long for colu ...
- Generative Modeling by Estimating Gradients of the Data Distribution
目录 概 主要内容 Langevin dynamics Score Matching Denoising Score Matching Noise Conditional Score Networks ...
- C# UTF8的BOM导致XML序列化与反序列化报错:Data at the root level is invalid. Line 1, position 1.
最近在写一个xml序列化及反序列化实现时碰到个问题,大致类似下面的代码: class Program { static void Main1(string[] args) { var test = n ...
随机推荐
- 深入理解计算机系统 第八章 异常控制流 part2
关于进程,需要关注其提供给应用程序的两个关键抽象: 1.一个独立的逻辑控制流,它提供一个假象,好像我们的程序独占地使用处理器 2.一个私有的地址空间,它提供一个假象,好像我们的程序独占地使用内存系统 ...
- thinkphp 比RBAC更好的权限认证方式(Auth类认证)
Auth 类已经在ThinkPHP代码仓库中存在很久了,但是因为一直没有出过它的教程, 很少人知道它, 它其实比RBAC更方便 . RBAC是按节点进行认证的,如果要控制比节点更细的权限就有点困难了, ...
- 理解Spark运行模式(一)(Yarn Client)
Spark运行模式有Local,STANDALONE,YARN,MESOS,KUBERNETES这5种,其中最为常见的是YARN运行模式,它又可分为Client模式和Cluster模式.这里以Spar ...
- SQLite性能 - 它不是内存数据库,不要对IN-MEMORY望文生意。
SQLite创建的数据库有一种模式IN-MEMORY,但是它并不表示SQLite就成了一个内存数据库.IN-MEMORY模式可以简单地理解为,本来创建的数据库文件是基于磁盘的,现在整个文件使用内存空间 ...
- Anaconda中启动Python时的错误:UnicodeDecodeError: 'gbk' codec can't decode byte 0xaf in position 553
今天,在Anaconda prompt启动python遇到了如下错误: UnicodeDecodeError: ‘gbk’ codec can’t decode byte 0xaf in positi ...
- 基于 HTML5 WebGL + VR 的 3D 机房数据中心可视化
前言 在 3D 机房数据中心可视化应用中,随着视频监控联网系统的不断普及和发展, 网络摄像机更多的应用于监控系统中,尤其是高清时代的来临,更加快了网络摄像机的发展和应用. 在监控摄像机数量的不断庞大的 ...
- Mysql数据库调优和性能优化的21条最佳实践
Mysql数据库调优和性能优化的21条最佳实践 1. 简介 在Web应用程序体系架构中,数据持久层(通常是一个关系数据库)是关键的核心部分,它对系统的性能有非常重要的影响.MySQL是目前使用最多的开 ...
- 使用python2连接操作db2
在python2.6下连接db2,步骤: 1.安装python2.6. (注:目前db2的驱动还不支持2.7) 2.安装setuptools,下载地址http://pypi.python.org/py ...
- PHP的常用字符串处理
一.拼接字符串 拼接字符串是最常用到的字符串操作之一,在PHP中支持三种方式对字符串进行拼接操作,分别是圆点.分隔符{}操作,还有圆点等号.=来进行操作,圆点等号可以把一个比较长的字符串分解为几行进行 ...
- linux创建文件名添加当前系统日期时间的方法
使用`date +%y%m%d` Example: mkdir `date +%y%m%d` tar cfvz /tmp/bak.`date +%y%m%d`.tar.gz /etc YmdHM代表年 ...