My OpenLearn Profile

Personalise your OpenLearn profile, save your favourite content and get recognition for your learning

Create account / Sign in

Course content Course content

An introduction to computers and computer systems

Start this free course now. Just create an account and sign in. Enrol and complete the course for a free statement of participation or digital badge if available.

More free courses

5 Representing text in binary

Most modern systems for encoding text derive in part from ASCII (American Standard Code for Information Interchange, pronounced ‘askee’), which was developed in 1963. In the original ASCII system, upper-case and lower-case letters, numbers, punctuation and other symbols and control codes (such as a carriage return, backspace and tab) were encoded in 7 bits. As computers based on multiples of 8 bits (or a byte) became more common, the encoding system became an 8-bit system, and so could be expanded to include more symbols.

When binary numbers were assigned to each character in the original ASCII system, careful thought was given to choosing sequences of values for the characters of the alphabet and numerals that would make it easy for a computer processor to perform common operations on them. (These encodings were preserved in the 8-bit system by simply padding out the leftmost bit with a 0.)

To illustrate, let’s look at the 8-bit encoding of some of the lower-case and upper-case letters of the English alphabet, shown in Table 1.

The headings of the columns are “Binary”, Character”, “Binary”, “Character”. The first column contains the binary numbers 0100 0001 to 0100 1010 (inclusive). In each case the third binary digit (which is always a 0) is in blue. The second column contains the upper case characters A to J (inclusive). The third column contains the binary numbers 0110 0001 to 0110 1010 (inclusive). In each case the third binary digit (which is always a 1) is in blue. The sixth column contains the lower case characters a to j (inclusive).

Maximise

Table 1

Show description|Hide description

The headings of the columns are “Binary”, Character”, “Binary”, “Character”. The first column contains the binary numbers 0100 0001 to 0100 1010 (inclusive). In each case the third binary digit (which is always a 0) is in blue. The second column contains the upper case characters A to J (inclusive). The third column contains the binary numbers 0110 0001 to 0110 1010 (inclusive). In each case the third binary digit (which is always a 1) is in blue. The sixth column contains the lower case characters a to j (inclusive).

Table 1

Notice that the ASCII values for corresponding upper-case and lower-case characters always differ by one bit, shown in blue. This means that converting from upper case to lower case (a very common manipulation of text) is simply a matter of ‘flipping’ one bit.

The original ASCII codes are suitable for representing North American English, but do not allow for other languages that use Latin characters with diacritics, nor for languages that do not use a Latin alphabet at all. This was a major problem with the ever more widespread use of computers and processors. It took a long time for an acceptable international standard to emerge but since 2007, the standard encoding system for characters has been Unicode Transformation Format-8 (UTF-8) which uses a variable number of bytes (up to 6) to encode characters in use across the world. However, in order to maintain backward compatibility, the original 127 ASCII codes are preserved in UTF-8.

In the next section, you will look at numbers.

Previous 4 An alternative to binary?

Next 6 Representing numbers in binary

Take your learning further

Making the decision to study can be a big step, which is why you’ll want a trusted University. We’ve pioneered distance learning for over 50 years, bringing university to you wherever you are so you can fit study around your life. Take a look at all Open University courses.

If you’re new to university-level study, read our guide on Where to take your learning next, or find out more about the types of qualifications we offer including entry level Access modules, Certificates, and Short Courses.

Want to achieve your ambition? Study with us and you’ll be joining over 2 million students who’ve achieved their career and personal goals with The Open University.

Browse all Open University courses

My OpenLearn Profile

About this free course

Become an OU student

Download this course

Share this free course

5 Representing text in binary