Exploring communications technology
Exploring communications technology

Start this free course now. Just create an account and sign in. Enrol and complete the course for a free statement of participation or digital badge if available.

Free course

Exploring communications technology

3.4 Perceptual encoding

Sounds of certain frequencies or certain colours are perceived better than others. Useful reductions of file size or data rate can often be achieved if this fact is exploited during encoding of the source. MP3 music files for example are typically one-tenth of the size of equivalent, uncompressed music files (such as CD files).

Humans are more sensitive to frequencies in the range 1 to 5 kHz than to those outside this range. This is shown Figure 3.4. The red line is the threshold of hearing. Sounds below the threshold are inaudible. The threshold is lowest between 1 and 5 kHz. It rises above 5 kHz and below 1 kHz. At these frequencies, the quietest audible sounds are louder than the quietest audible sounds between 1 kHz and 5 kHz.

In Figure 3.4, two single-frequency tones A and B are shown with the same amplitude, but A is audible and B is inaudible.

Described image
Figure 3.4 Hearing sensitivity threshold response curve of the human ear with two equal-amplitude frequency tones, A and B

A relatively loud sound at a particular frequency reduces our sensitivity to neighbouring frequencies. This is frequency masking. Figure 3.5 shows a loud sound A raising the perceptual hearing threshold in its vicinity. Sound B, which would otherwise be audible, is made inaudible. Under these circumstances, it would be unnecessary to encode sound B.

Figure 3.5 Frequency masking for two single-tone frequencies, A and B, with A louder than B

Another form of masking is temporal masking. This arises because our sensitivity to sounds in a narrow frequency range is reduced for a short period before and after the presence of a relatively strong sound in that frequency range. You may be surprised that sensitivity can be reduced before as well as after a relatively loud sound. This is a result of the way the auditory system and brain process audio information.

Following a loud sound, it takes the ear up to 50 ms to be able to respond again to a much quieter sound. The resulting temporal masking envelope is displayed in Figure 3.6. The shaded region represents inaudible signal amplitudes following a very strong signal at time T.

Described image
Figure 3.6 Temporal masking effect of a loud sound at T and resulting inaudible envelope

Take your learning further

Making the decision to study can be a big step, which is why you'll want a trusted University. The Open University has 50 years’ experience delivering flexible learning and 170,000 students are studying with us right now. Take a look at all Open University courses.

If you are new to University-level study, we offer two introductory routes to our qualifications. You could either choose to start with an Access module, or a module which allows you to count your previous learning towards an Open University qualification. Read our guide on Where to take your learning next for more information.

Not ready for formal University study? Then browse over 1000 free courses on OpenLearn and sign up to our newsletter to hear about new free courses as they are released.

Every year, thousands of students decide to study with The Open University. With over 120 qualifications, we’ve got the right course for you.

Request an Open University prospectus371