5 min read•january 2, 2023

Josh Argo

B

Brianna Bukowski

Jed Quiaoit

An **unbiased** **estimator** is one that produces estimates that are on average as close as possible to the true population parameter. This means that if you repeatedly draw samples from the population and use the estimator to make inferences about the population parameter, the average of those estimates will be equal to the true population parameter. ⚖️

For example, if you wanted to estimate the mean height of all the students in your school, you could take a sample of students and measure their heights. If the mean height of the sample (the sample statistic) is equal to the mean height of the entire school (the population parameter), then your estimator is unbiased. On the other hand, if the sample mean consistently underestimates or overestimates the true mean height of the school, then your estimator is biased.

A **sample** is **unbiased** if the estimator value (sample statistic) is equal to the population parameter. For example, if the sampling distribution mean (x̅) is equal to the population mean (𝝁) or if the average of our sample proportions (p) is equal to our population proportion (𝝆), then our sample is unbiased! 😌

It is impossible to have no variability, due to the nature of random sampling. This is because the sample you are using to make the inference is only a small subset of the entire population, and so it is subject to sampling error. Therefore, there will always be some level of uncertainty or variability in the estimate when you use an estimator to make inferences about a population parameter.

However, a larger sample size will minimize variability in a sampling distribution! 🗽

Skewness is a measure of the symmetry of a distribution. A distribution is symmetric if it is roughly the same on both sides of the center, like a bell curve. A distribution is skewed if it is not symmetric, with more of the values clustered on one side or the other. For example, if a distribution is skewed to the left, it means that there are more values on the right side of the distribution and fewer values on the left side. 🔔

If a sample is equally spread out around the mean, it is not necessarily unbiased, but it is less likely to be biased than a sample that is heavily skewed in one direction or the other. However, other factors can also contribute to bias, such as sampling methods or the way that the sample was collected.

A good illustration for bias and variability is a bullseye. Bias measures how precise the archer is (how close to the bullseye), while variability measures how consistent he/she is. See the illustrations below for different circumstances regarding bias and variability: 🎯

In this analogy, the bullseye represents the true population parameter, and the archer's shots represent the estimates produced by the estimator.

If the archer is very precise but not very consistent, their shots will be close to the bullseye but may be scattered around it. This would correspond to a situation where the estimator has low variability but high bias.

On the other hand, if the archer is very consistent but not very precise, their shots will all be close to each other but may be far from the bullseye. This would correspond to a situation where the estimator has low bias but high variability.

Suppose that you are asked to estimate the mean income of all the households in your town. You decide to use a sample of 100 households, selected using a random sampling method. After collecting the data, you calculate the sample mean income to be $50,000.

Browse Study Guides By Unit

👆Unit 1 – Exploring One-Variable Data

✌️Unit 2 – Exploring Two-Variable Data

🔎Unit 3 – Collecting Data

🎲Unit 4 – Probability, Random Variables, & Probability Distributions

📊Unit 5 – Sampling Distributions

⚖️Unit 6 – Proportions

😼Unit 7 – Means

✳️Unit 8 – Chi-Squares

📈Unit 9 – Slopes

✏️Frequently Asked Questions

✍️Free Response Questions (FRQs)

📆Big Reviews: Finals & Exam Prep

© 2023 Fiveable Inc. All rights reserved.