Skip to main content

About this free course

Download this course

Share this free course

Data analysis: hypothesis testing
Data analysis: hypothesis testing

Start this free course now. Just create an account and sign in. Enrol and complete the course for a free statement of participation or digital badge if available.

4.1 Z-test: known population standard deviation

The z-test is appropriate when you have access to the population standard deviation. This often occurs when you have comprehensive industry reports or extensive historical data. Key characteristics of the z-test include:

  • It uses the known population standard deviation in calculations.
  • The test statistic follows a standard normal distribution (bell-shaped curve).
  • It generally provides more precise results when population parameters are available.

Using the z-statistic formula, this can be explained:

cap z equals x bar minus mu divided by left parenthesis sigma divided by square root of n right parenthesis

x macron  = sample mean

mu  = population mean under the null hypothesis

sigma  = population standard deviation

n = sample size

This formula reflects the z-test’s characteristics by directly incorporating the known population standard deviation (σ) and transforming the difference between the sample mean and population mean into a standardised score that follows the standard normal distribution.

It may seem odd to you that in Unit 1, you learned how to calculate the z-score, which is shown in this formula.

cap z postfix times equals x minus mu divided by sigma

Where:

x macron  = raw score

mu  = population mean

sigma  = population standard deviation

You may question how this relates to the z-statistic we have been discussing. While these concepts are related, they serve different purposes in statistical analysis.

The key difference lies in their focus:

  • The z-score focuses on a particular data point within a population. It tells us how many standard deviations an individual value is from the population mean. Imagine you are analysing the performance of a new marketing campaign. A z-score to determine how a specific customer’s spending compares to the average customer spending across all campaigns.
  • The z-statistic, on the other hand, focuses on the sample mean in relation to the population parameters. It tells us how many standard errors the sample mean is from the hypothesised population mean. This is crucial for hypothesis testing and making inferences about populations based on sample data. Using the same marketing campaign example, a z-statistic to test whether the average spending in your new campaign (based on a sample) is significantly different from the average spending (population mean).