Links

PRESIDENTIAL COLUMN: Bayes for Beginners: Probability and Likelihood

C. Randy Gallistel, August 31, 2015; TAGS: C. RANDY GALLISTEL COLUMNS | DATA | EXPERIMENTAL PSYCHOLOGY | METHODOLOGY | STATISTICAL ANALYSIS

Distinguishing Likelihood(可能性) From Probability(概率)

The distinction between probability and likelihood is fundamentally important:

  • Probability attaches to possible outcomes(可能的产出);

    the possible outcomes to which probabilities attach are MECE(Mutually Exclusive and Collectively Exhaustive);
  • Likelihood attaches to hypotheses(假设).

    the hypotheses to which likelihoods attach are often neither(MECE);

Explaining this distinction is the purpose of this first column.

Possible results are MECE

Possible outcomes are MECE(Mutually Exclusive and Collectively Exhaustive).

Suppose we ask a subject to predict the outcome of each of 10 tosses of a coin.

There are only 11 possible outcomes(0 to 10 correct predictions).

The actual result will always be one and only one of the possible outcomes.

Thus, the probabilities that attach to the possible outcomes MUST sum to 1.

Hypotheses are often neither(MECE)

Hypotheses, unlike outcomes, are neither mutually exclusive nor collectively exhaustive.

Suppose that the first subject we test predicts 7 of the 10 outcomes correctly.

  • I might hypothesize that the subject just guessed,
  • and you might hypothesize that the subject may be somewhat clairvoyant,

    by which you mean that the subject may be expected to correctly predict the results,

    at slightly greater than chance rates over the long run.

    These are different hypotheses, but they are not mutually exclusive,

    because you hedged when you said "may be.",

    You thereby allowed your hypothesis include mine.
  • In technical terminology, my hypothesis is nested within yours.

    Someone else might hypothesize that the subject is strongly clairvoyant and that the observed result underestimates the probability that her next prediction will be correct.

    Another person could hypothesize something else altogether.

    There is no limit to the hypotheses one might entertain.

The set of hypotheses to which we attach likelihoods is limited by our capacity to dream them up. In practice, we can rarely be confident that we have imagined all the possible hypotheses. Our concern is to estimate the extent to which the experimental results affect the relative likelihood of the hypotheses we and others currently entertain. Because we generally do not entertain the full set of alternative hypotheses and because some are nested within others, the likelihoods that we attach to our hypotheses do not have any meaning in and of themselves; only the relative likelihoods — that is, the ratios of two likelihoods — have meaning.

"Forwards" and "Backwards"

The difference between probability and likelihood becomes clear,

when one uses the probability distribution function in general-purpose programming languages.

  • In the present case, the function we want is the \(binomial\ distribution\ function\).

    It is called \(BINOM.DIST\) in the most common spreadsheet software and \(binopdf\) in the language I use. It has three input arguments: the number of successes, the number of tries, and the probability of a success. When one uses it to compute probabilities, one assumes that the latter two arguments (number of tries and the probability of success) are given. They are the parameters of the distribution. One varies the first argument (the different possible numbers of successes) in order to find the probabilities that attach to those different possible results (top panel of Figure 1). Regardless of the given parameter values, the probabilities always sum to 1.
  • By contrast, in computing a likelihood function, one is given the number of successes (7 in our example) and the number of tries (10). In other words, the given results are now treated as parameters of the function one is using. Instead of varying the possible results, one varies the probability of success (the third argument, not the first argument) in order to get the binomial likelihood function (bottom panel of Figure 1). One is running the function backwards, so to speak, which is why likelihood is sometimes called reverse probability.

The information that the binomial likelihood function conveys is extremely intuitive. It says that given that we have observed 7 successes in 10 tries, the probability parameter of the binomial distribution from which we are drawing (the distribution of successful predictions from this subject) is very unlikely to be 0.1; it is much more likely to be 0.7, but a value of 0.5 is by no means unlikely. The ratio of the likelihood at p = .7, which is .27, to the likelihood at p = .5, which is .12, is only 2.28. In other words, given these experimental results (7 successes in 10 tries), the hypothesis that the subject's long-term success rate is 0.7 is only a little more than twice as likely as the hypothesis that the subject's long-term success rate is 0.5.

In summary, the likelihood function is a Bayesian basic.

To understand likelihood, you must be clear about the differences between probability and likelihood:

Probabilities attach to results; likelihoods attach to hypotheses.

In data analysis, the "hypotheses" are most often a possible value or a range of possible values for the mean of a distribution, as in our example.

  • The results to which probabilities attach are MECE(Mutually Exclusive and Collectively Exhaustive);
  • the hypotheses to which likelihoods attach are often neither;

    the range in one hypothesis may include the point in another, as in our example.

    To decide which of two hypotheses is more likely given an experimental result,

    we consider the ratios of their likelihoods. This ratio, the relative likelihood ratio, is called the "Bayes Factor".

SciTech-Mathmatics-Probability+Statistics: Distinguishing(区分) Probability(attaches to Outcomes MECE) from Likelihood(attaches Hypothesis are often neither MECE)的更多相关文章

  1. Probability&Statistics 概率论与数理统计(1)

    基本概念 样本空间: 随机试验E的所有可能结果组成的集合, 为E的样本空间, 记为S 随机事件: E的样本空间S的子集为E的随机事件, 简称事件, 由一个样本点组成的单点集, 称为基本事件 对立事件/ ...

  2. Study note for Continuous Probability Distributions

    Basics of Probability Probability density function (pdf). Let X be a continuous random variable. The ...

  3. An Introduction to Measure Theory and Probability

    目录 Chapter 1 Measure spaces Chapter 2 Integration Chapter 3 Spaces of integrable functions Chapter 4 ...

  4. Probability Concepts

    Probability Concepts Unconditional probability and Conditional Probability Unconditional Probability ...

  5. 【概率论】1-1:概率定义(Definition of Probability)

    title: [概率论]1-1:概率定义(Definition of Probability) categories: Mathematic Probability keywords: Sample ...

  6. Normal Probability Plots|outlier

    6.4 Assessing Normality; Normal Probability Plots The normal probability plot is a graphical techniq ...

  7. [Math Review] Statistics Basics: Main Concepts in Hypothesis Testing

    Case Study The case study Physicians' Reactions sought to determine whether physicians spend less ti ...

  8. PHP7函数大全(4553个函数)

    转载来自: http://www.infocool.net/kb/PHP/201607/168683.html a 函数 说明 abs 绝对值 acos 反余弦 acosh 反双曲余弦 addcsla ...

  9. How do I learn machine learning?

    https://www.quora.com/How-do-I-learn-machine-learning-1?redirected_qid=6578644   How Can I Learn X? ...

  10. [C2P3] Andrew Ng - Machine Learning

    ##Advice for Applying Machine Learning Applying machine learning in practice is not always straightf ...

随机推荐

  1. 使用sealos快速搭建kubernetes集群!!!

    什么是sealos? Sealos 是一款基于 Kubernetes 的轻量级操作系统,专为云原生环境设计,主要用于快速部署和管理 Kubernetes 集群.它采用"容器化内核" ...

  2. Django开发过程中遇到的问题和解决方案

    1.django向数据库中添加中文时报错 解决方案:创建数据库的时候设置编码格式 2.django的信号使用无法触发信号里的内容 解决方案:在django 1.7后,使用信号时候需要在应用配置类中的r ...

  3. 本地编译WPF框架源码

    最近,在 排查WPF框架触摸失效和书写 Stroke 绘制的问题,常常需要查看WPF 的源码,由于项目组用到的框架大部分都是 .netFramwork 的,只能通过VS的F12按键反编译或者Dnspy ...

  4. 关于dpnet项目

    关于dpnet项目 dpnet是我开源的一个轻量异步框架,主要用于利用多核优势执行异步任务,处理异步IO. 起初并没有独立的dpnet项目,所有功能集成在另一个项目dplua中. 提到异步,实现方案必 ...

  5. Golang相关环境变量

    GOROOT: GO语言的安装路径,linux系统下一般是/usr/local/go GOPATH: 程序员自己的go源码路径,比如开发一个Helloworld的项目,那么它的代码文件夹就应该放在GO ...

  6. 「Log」2023.8.24 小记

    序幕 \(\texttt{7:20}\):才到校,昨天调题整半夜去了,没想到这么晚来的人也少. 按惯例整理博客. 补题,补串串. \(\color{blueviolet}{P2444\ [POI200 ...

  7. RDP远程桌面连接服务

    漏洞原理 Windows远程桌面内核驱动处理MS_T120通道时(管理数据传输时)没有对数据的数据包进行验证限制,没有将信道的指针删除(之后会回来访问恶意的数据包),攻击者无需认证即可向RDP服务(3 ...

  8. WinForms中实现Adobe PDF Reader实现旋转PDF功能

    实现效果: 问题点:Adobe PDF Reader中并没有可以直接旋转的方法 LoadFile 加载文件,文件URL地址 GotoFirstPage 到第一页 GotoLastPage 到最后一页 ...

  9. Mac os的防火墙导致开的热点手机连不上

    在工位上用Mac给手机开热点用,结果今天手机一直连不上Mac开的热点,最后把Mac的防火墙关了就能让手机连上了,连上了再把防火墙打开也不影响连接.

  10. 如何最大化客户生命周期价值?APMDR 模型在袋鼠云的落地实践

    相信大家都认可一个观点:不论是 To B 还是 To C,用户是企业的核心资源,是互联网产品中最重要的价值之一.因此,深入挖掘用户价值成为现在大部分企业运营的关键. 之前我们为大家介绍过如何利用 RF ...