My OpenLearn Profile

Personalise your OpenLearn profile, save your favourite content and get recognition for your learning

Create account / Sign in

Course content Course content

An introduction to electronics

Start this free course now. Just create an account and sign in. Enrol and complete the course for a free statement of participation or digital badge if available.

More free courses

4.3 Recording and analysing speech

Generally, sound waves are much more complicated than a wave made by humming a single note or the sound made by an electric toothbrush. For example, Figure 23 shows the waveforms generated by someone saying ‘yes’ followed by ‘no’. Let this be called Recording A.

Figure 23 Recording A: ‘yes’ followed by ‘no’

Show description|Hide description

This is a screenshot from Interactive 1, showing the wave pattern for the spoken word ‘yes’ followed by ‘no’. ‘Yes’, on the left, has a group of spikes that look like a ball, followed by a ‘tail’ of small, bunched-together lines (probably the high-frequency hiss at the end of ‘yes’). On the right, the vertical lines for ‘no’ make a ball with no tail. Their amplitude is greater, possibly because ‘no’ is more emphatic than ‘yes’ and we say it louder.

Figure 23 Recording A: ‘yes’ followed by ‘no’

SAQ 6

Each of the bursts of sound shown in Figure 24 is another recording of a person saying either ‘yes’ or ‘no’. Let this be called Recording B. Comparing Recording B with Recording A, which of the bursts of sound in Recording B is ‘yes’ and which is ‘no’?

Figure 24 Recording B

Show description|Hide description

This is another screenshot from Interactive 1. The burst of sound on the left looks like a ball. The burst of sound on the right looks like a smaller ball with a tail.

Figure 24 Recording B

Answer

The left sound burst in Recording A has a long tail, presumably caused by the long ‘s’ sound at the end of ‘yes’. The right sound burst does not have this tail. In Recording B, the right sound burst has a tail but the left sound burst does not. From this, it can be guessed (correctly) that the left sound burst in Recording B is ‘no’ and the right sound burst is ‘yes’.

The fact that different words result in different wave patterns underlies the technology of speech recognition. This technology has evolved to a high performance level over the last half century, but it has overcome some formidable problems. For example, are one person’s speech patterns the same as another’s?

Return to Interactive 1 (which you should still have open in a separate tab) and spend five or ten minutes experimenting by making your own sounds. If you hum a low note, are you able to calculate its frequency? Can you distinguish the patterns when you say ‘yes’ and ‘no’? Are your ‘yes’ and ‘no’ wave patterns similar to those shown in Figure 23?

When you have finished, close the interactive.

Previous 4.2 Recording sounds

Next 4.4 Signals and sine waves

Take your learning further

Making the decision to study can be a big step, which is why you’ll want a trusted University. We’ve pioneered distance learning for over 50 years, bringing university to you wherever you are so you can fit study around your life. Take a look at all Open University courses.

If you’re new to university-level study, read our guide on Where to take your learning next, or find out more about the types of qualifications we offer including entry level Access modules, Certificates, and Short Courses.

Want to achieve your ambition? Study with us and you’ll be joining over 2 million students who’ve achieved their career and personal goals with The Open University.

Browse all Open University courses

My OpenLearn Profile

About this free course

Become an OU student

Download this course

Share this free course

4.3 Recording and analysing speech

SAQ 6

Answer