Course content Course content

Exploring data: Graphs and numerical summaries

Start this free course now. Just create an account and sign in. Enrol and complete the course for a free statement of participation or digital badge if available.

More free courses

5.6 Quartiles and the interquartile range

The first alternative measure of dispersion we shall discuss is the interquartile range: this is the difference between summary measures known as the lower and upper quartiles. The quartiles are simple in concept: if the median is regarded as the middle data point, so that it splits the data in half, the quartiles similarly split the data into quarters. This is, of course, an over-simplification. With an even number of data points, the median is defined to be the average of the middle two: defining quartiles is a little more complicated.

It would be convenient to express our wordy definition of the median in a concise symbolic form, and this is easy to do. Any data sample of size n may be written as a list of numbers

x ₁,x ₂,x ₃, ... x _n

In order to calculate the sample median it is necessary to sort the data so that they are written in order of increasing size. The sorted list can then be written as

x ₍₁₎,x ₍₂₎,x ₍₃₎, ... x _(n)

where x ₍₁₎ is the smallest value in the original list (the minimum) and x _(n) is the largest (the maximum). In general, the notationx _(p) is used to mean the pth value when the data are arranged in order of increasing size. Each successive item in the ordered list is greater than or equal to the previous item. For instance, the list of six data items

7, 1, 3, 6, 3, 7

may be ordered as

1, 3, 3, 6, 7, 7

So, for these data, x ₍₁₎=1, x ₍₂₎=x ₍₃₎=3, x ₍₄₎=6, x ₍₅₎=x ₍₆₎=7.

In any such ordered list, the sample median m may be defined to be the number

m = x _{(½(n + 1))}

as long as the subscript on the right-hand side is appropriately interpreted.

If the sample size n is odd, then the number ½(n+1) is an integer, and there is no problem of definition. For instance, if n=27 then ½(n+1)=14, and the sample median is m = x ₍₁₄₎ side of it.

If the sample size n is even then the number ½(n+1) is not an integer but has a fractional part equal to ½. For instance, if n = 6 (as in the example above) then the sample median is

m = x _(½(n+1)) = x _(3½).

Such numbers are sometimes called ‘half-integer’

If the number x _(3½) is interpreted as ‘the number halfway between x ₍₃₎ and x ₍₄₎’ then you can see that the wordy definition survives intact. This obvious interpretation of numbers such as x _3½ can be extended to numbers such as x _(2¼) and x _(4¾): x _(2¼) is the number one-quarter of the way from x ₍₂₎ to x ₍₃₎, and x _(4¾) is the number three-quarters of the way from x ₍₄₎ to x ₍₅₎. Interpreting fractional subscripts in this way when they occur, the lower quartile (roughly, one-quarter of the way into the data set) and the upper quartile (approximately three-quarters of the way through the data set) may be defined as follows.

Sample quartiles

If a data set x ₁, x ₂,… , x _n is reordered as x ₍₁₎, x ₍₂₎, …, x _(n) , where

x ₍₁₎ ≤ x ₍₂₎ ≤ ... ≤ x(n)

then the lower sample quartile q_L is defined by

q_L = x _(¼(n+1))

and the upper sample quartile q_U is defined by

q_U = x _(¾(n+1))

Unfortunately, there is no universally accepted definition for sample quartiles, nor, indeed, a universally accepted nomenclature. The lower and upper sample quartiles are sometimes called the first and third sample quartiles. The median is the second sample quartile. Other definitions are possible, and you may even be familiar with some of them. For instance, some practitioners use

q_L = x _(¼n+½), q_U = x _(¾n+½)

Others use

q_L = x _(¼n+¾), q_U = x _(¾n+¼)

Still others insist that the lower and upper quartiles be defined in such a way that they are identified uniquely with actual sample items. However, almost all definitions of the sample median reduce to the same thing.

Previous 5.5 Measures of dispersion

Next 5.6.1 Quartiles for the SIRDS data

Take your learning further

Making the decision to study can be a big step, which is why you’ll want a trusted University. We’ve pioneered distance learning for over 50 years, bringing university to you wherever you are so you can fit study around your life. Take a look at all Open University courses.

If you’re new to university-level study, read our guide on Where to take your learning next, or find out more about the types of qualifications we offer including entry level Access modules, Certificates, and Short Courses.

Want to achieve your ambition? Study with us and you’ll be joining over 2 million students who’ve achieved their career and personal goals with The Open University.

Browse all Open University courses

My OpenLearn Profile

About this free course

Become an OU student

Download this course

Share this free course

5.6 Quartiles and the interquartile range

Sample quartiles