1.2 Producing useful tables
1.2.1 Data sets in different tabular forms
In much of your statistical work, you will begin with data set, often presented in the form of a table, and use the information in the table to produce diagrams and/or summary statistics that help in the interpretation of the data set. However, in practice, much interpretation of data sets can be done directly from an appropriate table of data, or by re-presenting the data in a rather different tabular form. Dealing with data in tables is the subject of this section and the next. By the time you have finished you should be able to produce tables which make certain aspects of the data in question more obvious.
Example 2.1 Lung cancer deaths in South Australia
Table 2.1 contains raw data on the incidence and mortality for lung cancer in South Australia in 1981.
Table 2.1 Age group, male and of population sizes, male and female cases, male and female deaths
A table like Table 2.1 may be adequate for someone who is merely taking a quick look at the data, perhaps prior to carrying out an analysis, but it is not the best way of presenting the figures to most readers. The objectives in producing a table that is actually being used to communicate information are to make the data immediately clear, and to facilitate picking out important patterns in them with the minimum of effort. To this end, there are several guidelines for producing tables which should be borne in mind.
Guidelines for tables
Labelling of rows and columns should be clear and unambiguous.
A table should contain the minimum amount of information needed to communicate its message. This may involve splitting the data into several simpler tables or pooling cells.
It may be appropriate to simplify the numbers in a table to aid speedy comprehension.
Useful summary statistics or calculation results should be added, where appropriate, to help communicate the message.
These guidelines will be followed in relation to Table 2.1 to see what changes they suggest.