SciTech-Mathmatics-Probability+Statistics-Population-Sample: Sample Proportion vs. Sample Mean: The Difference

Sample Proportion vs. Sample Mean: The Difference

BY ZACH BOBBITTPOSTED ON MAY 5, 2021

Two terms that are often used in statistics are Sample Proportion and Sample Mean.

Here's the difference between the two terms:

  • Sample Proportion: The proportion of observations in a sample with a certain characteristic.

    Often denoted $\large p̂ $, It is calculated as follows:

    \(\large \begin{array}{lrl} \\
    & p̂= & \frac{x}{n} \\
    where: & & \\
    & x: & \text{ The } number \text{ of } observations \text{ in the } sample, \\
    & & \text{ with } a\ certain\ characteristic. \\
    & n: & \text{ The } total\ number \text{ of } observations \text{ in the } sample \\
    \end{array}\)

  • Sample mean : The average value in a sample.

    Often denoted $\large x $, it is calculated as follows:

    \(\large \begin{array}{lrl} \\
    & x= & \frac{\sum{x_i} }{n} \\
    where: & & \\
    & \sum: & \text{ A symbol that means } sum \\
    & x_i: & \text{ The value of the } i \text{th observation in the sample } \\
    & n: & \text{ The sample size } \\
    \end{array}\)

When to Use Each(Proportion Vs. Mean)

The sample proportion and sample mean are used for different reasons:

  • Sample proportion : Used to understand the proportion of observations in a sample that have a certain characteristic.

    For example, we could use the sample proportion in each of the following scenarios:

    • Politics : Researchers might survey 500 individuals in a certain city to understand what proportion of residents support a certain candidate in an upcoming election.
    • Biology : Biologists may collect data on 100 sea turtles,

      to understand what proportion of them have experienced damage from pollution.
    • Sports : A journalist may survey 1,000 college basketball players,

      to understand what proportion of them shoot left-handed.
  • Sample mean : Used to understand the average value in a sample.

    For example, we could use the sample mean in each of the following scenarios:

    • Demographics : Economists may collect data on 5,000 households in a certain city to estimate the average annual household income.
    • Botany : A botanist may take measurements on 50 plants from the same species to estimate the average height of the plant in inches.
    • Nutrition : A nutritionist may survey 100 people at a hospital to estimate the average number of calories that residents eat per day.

Depending on the question of interest, it might make more sense to use the sample proportion or the sample mean to answer the question.

Use Sample's Statistics to Estimate Population Parameters

Both the sample proportion and the sample mean are used to estimate population parameters.

Sample Proportion as an Estimate

We use the sample proportion to estimate a population proportion.

For example, we might be interested in understanding,

what proportion of residents in a certain city support a new law.

  • Since it would be too costly and time-consuming to survey all 40,000,000 residents in the city, we instead survey 500 and calculate the proportion of residents in the sample who support the new law.

    • We then use this sample proportion as our best estimate of the proportion of residents in the entire city who suppose the new law.
    • However, since $\large \bm{ it's \ unlikely } $ that our sample proportion \(\large \bm{ exactly\ matches }\) the population proportion, we often use a \(\large \bm{ confidence\ interval }\) for a proportiona range of values that we believe $\large \bm{ contains } $ the true population proportion with a certain level of confidence.

Sample Mean as an Estimate

We use the sample mean to estimate a population mean.

For example, we might be interested in understanding,

the average height of a certain species of plants.

  • Since it would be too costly and time-consuming to measure the height of all 10,000 plants in a certain region, we instead measure the height of 150 plants and use the sample mean as our best estimate of the population mean.
  • However, since $\large \bm{ it's \ unlikely } $ that our sample mean \(\large \bm{ exactly\ matches }\) the population mean, we often use a \(\large \bm{ confidence\ interval }\) for a meana range of values that we believe $\large \bm{ contains } $ the true population mean with a certain level of confidence.

Confidence Interval for a Proportion

A confidence interval for a proportion is a range of values that is likely to **contain a population proportion with a certain level of confidence.

This tutorial explains the following:

  • The motivation for creating a confidence interval for a proportion.
  • The formula to create a confidence interval for a proportion.
  • An example of how to calculate a confidence interval for a proportion.
  • How to interpret a confidence interval for a proportion.

Confidence Interval for a Proportion: Motivation

The reason to create a confidence interval for a proportion is to capture our uncertainty when estimating a population proportion.

For example, suppose we want to estimate the proportion of people in a certain county that are in favor of a certain law.

  • Since there are thousands of residents in the county, it would be too costly and time-consuming to go around and ask each resident about their stance on the law. Instead, we might select a simple random sample of residents and ask each one whether or not they support the law:



    Population proportion estimation example

Since we select a random sample of residents, $\large \bm{ there\ is\ no\ guarantee } $ that the proportion of residents in the sample who are in favor of the law $\large \bm{ will\ exactly\ match } $ the proportion of residents in the entire county who are in favor of the law.

So,$\large \bm{ to\ capture\ this\ uncertainty } $ we can $\large \bm{ create\ a\ confidence\ interval } $ that contains a range of values that $\large \bm{ are\ likely\ to } $ contain the true proportion of residents who are in favor of the law in the entire county.

Confidence Interval for a Proportion: Formula

We use the following formula to calculate a confidence interval for a population proportion:

$\large \bm{ Confidence\ Interval }= p +/- z* \sqrt{ \frac{p(1-p)}{n} } $

where:

\(\large p\): sample proportion

\(\large z\): the chosen z-value

\(\large n\): sample size

The z-value that you will use is dependent on the confidence level that you choose.

The following table shows the z-value that corresponds to popular confidence level choices:

Confidence Level z-value
0.90 1.645
0.95 1.96
0.99 2.58

Notice that higher confidence levels correspond to larger z-values,

which leads to wider confidence intervals.

This means that, for example, a 95% confidence interval will be wider than a 90% confidence interval for the same set of data.

Related: What is Considered a Good Confidence Interval?

Confidence Interval for a Proportion: Example

Suppose we want to estimate the proportion of residents in a county that are in favor of a certain law. We select a random sample of 100 residents and ask them about their stance on the law. Here are the results:

$\large \text{ Sample size } n $ = 100

$\large \text{ Proportion in favor of law } p $ = 0.56

Here is how to find various confidence intervals for the population proportion:

Then $\large \sqrt{ \frac{0.56(1-0.56)}{100} } = 0.0496 $

Confidence Level z-value Confidence Interval
0.90 1.645 $ [0.478, 0.642] \leftarrow 0.56 +/- 1.645*( \sqrt{ \frac{0.56(1-0.56)}{100} } ) $
0.95 1.96 $ [0.463, 0.657] \leftarrow 0.56 +/- 1.96 *( \sqrt{ \frac{0.56(1-0.56)}{100} } ) $
0.99 2.58 $ [0.432, 0.688] \leftarrow 0.56 +/- 2.58 *( \sqrt{ \frac{0.56(1-0.56)}{100} } ) $

Note: You can also find these confidence intervals by using the Confidence Interval for Proportion Calculator.

Confidence Interval for a Proportion: Interpretation

The way we would interpret a confidence interval is as follows:

There is a 95% chance that the confidence interval of [0.463, 0.657] contains the true population proportion of residents who are in favor of this certain law.

Another way of saying the same thing is that there is only a 5% chance that the true population proportion lies outside of the 95% confidence interval.

That is, there's only a 5% chance that the true proportion of residents in the county that support the law is less than 46.3% or greater than 65.7%.

Additional Resources

Confidence Interval for Proportion Calculator

Confidence Interval for Mean Calculator

SciTech-Mathmatics-Probability+Statistics-Population-Sampling of Region of Population : Proportion + Mean + Confidence Interval的更多相关文章

  1. [Math Review] Statistics Basic: Sampling Distribution

    Inferential Statistics Generalizing from a sample to a population that involves determining how far ...

  2. Probability&Statistics 概率论与数理统计(1)

    基本概念 样本空间: 随机试验E的所有可能结果组成的集合, 为E的样本空间, 记为S 随机事件: E的样本空间S的子集为E的随机事件, 简称事件, 由一个样本点组成的单点集, 称为基本事件 对立事件/ ...

  3. 加州大学伯克利分校Stat2.3x Inference 统计推断学习笔记: Section 1 Estimating unknown parameters

    Stat2.3x Inference(统计推断)课程由加州大学伯克利分校(University of California, Berkeley)于2014年在edX平台讲授. PDF笔记下载(Acad ...

  4. mysql----Nested SELECT Quiz

     Nested SELECT quiz bbc name region area population gdp Afghanistan South Asia 652225 26000000   Alb ...

  5. R语言:常用统计检验

    统计检验是将抽样结果和抽样分布相对照而作出判断的工作.主要分5个步骤: 建立假设 求抽样分布 选择显著性水平和否定域 计算检验统计量 判定 -- 百度百科 假设检验(hypothesis test)亦 ...

  6. BAYESIAN STATISTICS AND CLINICAL TRIAL CONCLUSIONS: WHY THE OPTIMSE STUDY SHOULD BE CONSIDERED POSITIVE(转)

    Statistical approaches to randomised controlled trial analysis The statistical approach used in the ...

  7. Sampling and Estimation

    Sampling and Estimation Sampling Error Sampling error is the difference between a sample statistic(t ...

  8. [Math Review] Statistics Basic: Estimation

    Two Types of Estimation One of the major applications of statistics is estimating population paramet ...

  9. 【转载】Recommendations with Thompson Sampling (Part II)

    [原文链接:http://engineering.richrelevance.com/recommendations-thompson-sampling/.] [本文链接:http://www.cnb ...

  10. Study notes for Discrete Probability Distribution

    The Basics of Probability Probability measures the amount of uncertainty of an event: a fact whose o ...

随机推荐

  1. Python3循环结构(二) while循环

    Python3 while循环 当循环次数无界时通常会使用while循环. 1.使用while循环输出九九乘法表 i=1 while i < 10: j = 1 while j < i + ...

  2. CSS 魔法与布局技巧

    CSS 布局与视觉效果常用实践指南 在我一篇随笔中其实有说到十大布局,里面有提到 flex 布局.grid 布局.响应式布局,不过没有提到容器查询这个,现在说下这三个布局然后穿插下容器查询把. 1️⃣ ...

  3. HarmonyOS NEXT开发实战教程-记账app

    今天分享的实战教程是一款记账app,最近分享的项目都是纯页面,没有服务端,没有数据接口,因为鸿蒙开发主要就是写页面,都是前端嘛.如果有友友想要完整的项目可以找幽蓝君定制,想学服务端开发的话幽蓝君也可以 ...

  4. vue3 基础-data-methods-computed-watch

    本篇来简单了解 vue 的数据, 方法, 计算属性和监听器等相关内容. data ( ) vue 里面的 data ( ) 函数返回一些能供模板 template 直接使用的数据, 以变量的方式进行 ...

  5. SQL 强化练习 (二)

    继续 sql 搞起来, 面向过程来弄, 重点是分析的思路, 涉及的的 left join, inner join, group by +_ having, case when ... 等场景, 也是比 ...

  6. 【.NET必读】RabbitMQ 4.0+重大变更!C#开发者必须掌握的6大升级要点

    RabbitMQ 作为一款广受欢迎的消息队列中间件,近年来从 3.x 版本升级到 4.0+,带来了显著的功能增强和架构调整.与此同时,其官方 C# 客户端也从 6.x 版本跃升至 7.0,引入了全新的 ...

  7. 基于注解@Aspect实现Spring AOP

    摘要:基于注解@Aspect实现Spring AOP切面编程. 目录 基于注解@Aspect实现Spring AOP 小结 Reference 基于注解@Aspect实现Spring AOP   Sp ...

  8. uniapp中使用mqtt.js的踩坑记录

    最近在uniapp的vue3.0版本中使用mqtt.js库时遇到了一些坑,经过亲身踩坑,现在把实际能够实现在uniapp的app端能够使用mqtt.js的方法步骤记录如下: 一.安装 首先安装mqtt ...

  9. 开源直播课丨高效稳定易用的数据集成框架——ChunJun类加载原理与实现

    一.直播介绍 前几期,我们为大家分享了ChunJun的数据还原.Hive事务表及传输模块的一些内容,本期我们为大家分享ChunJun类加载原理与实现. 本次直播我们将从Java 类加载器解决类冲突基本 ...

  10. ET框架对MongoDB的使用

    一:本地测试: 1:加载DB组件 2:调整用户ID :  C2G_LoginGateHandler中创建玩家时id调整.(每次重启服务端创建小人ID是一样的,插入数据库会覆盖掉上传插入的数据) 3:在 ...