Long page
descriptions

Chapter 1   Tables, graphs and maps

1.1   Simple tables

1.1.1   Overview

This page is an overview of the section.

1.1.2   Frequency tables

A frequency table counts the number of items in a population that are in different groups.

1.1.3   Tables for other partitions

Other quantities such as money or production may be partitioned into several categories. The total of the values in the categories is meaningful in such tables.

1.1.4   Proportions and percentages

For frequency tables and other partitions, a column of percentages is often added.

1.1.5   Other simple tables

An example shows a simple table whose values are not a partition of a meaningful total.

1.1.6   Variations

In simple tables, the categories may be reordered, related categories can be combined and it may be informative to restrict attention to a subset of categories.

1.1.7   Tables within tables

Larger published tables often consist of a set of smaller simple tables in different columns. Two examples are presented.

1.2   Presenting data in tables

1.2.1   Overview

This page is an overview of the section.

1.2.2   Gridlines and white space

Never use gridlines to box all values in a table. In large multi-column tables, reading across rows is easier with occasional hairlines of light shading, but otherwise consider using white space to separate associated groups of rows or columns.

1.2.3   Layout and annotation

Use white space to group related rows and columns. Rearranging rows or columns may bring values that should be compared closer. Summarise and interpret in the body of a report but do not simply repeat values.

1.2.4   Significant digits and data noise

The meaningful information is 'signal'. Information that does not help understanding of the data is 'noise'. Noise includes data noise and unnecessary embellishments to the table. Decreasing the significant digits displayed often decreases data noise.

1.2.5   Meaningful variables

Showing proportions in a multi-column table instead of frequencies makes it easier to compare groups. Ratios of variables can be easier to interpret than their raw values.

1.2.6   Swapping rows and columns

It is easier to compare values down rows than across columns. Interchanging the rows and columns of a table can make it easier to make comparisons.

1.2.7   Reordering rows

Rearranging the rows (or columns) may make the information in large tables stand out better.

1.2.8   Example

An example shows a published table whose presentation can be improved in many ways.

1.3   Bar charts

1.3.1   Overview

This page is an overview of the section.

1.3.2   Bar charts

The simplest graphical display of a table of values is a bar chart. The bars can be drawn either vertically or horizontally. If the values are partitions of a total, a second axis with percentages can be added.

1.3.3   Ordering of categories

If there is no natural ordering of the categories in a bar chart, it is often informative to arrange them in decreasing order of the values. If the values are partitions of a total, cumulative percentages can be added.

1.3.4   Importance of zero and -ve values

If possible, all bars should start at zero, so bar length is proportional to the values. If the axis does not include zero, this should be clearly indicated with zig-zags on the bars or axis. Negative values can be represented with bars hanging below the axis.

1.3.5   Chartjunk

Three-dimensional representations of bar charts should be avoided.

1.3.6   Misleading bar charts

It is misleading to replace simple bars with 2- or 3-dimensional objects whose height represents the values since the visual impression depends on the area or volume of the objects.

1.3.7   Histograms

For frequency tables describing the distribution of continuous measurements, the bars of a bar chart have their heights equal to the frequencies if all classes have the same width. However this is misleading if the class widths vary and the bar heights must be defined differently.

1.4   Pie charts

1.4.1   Overview

This page is an overview of the section.

1.4.2   Stacked bar charts and pie charts

The bars in a bar chart are sometimes stacked on top of each other. An alternative display is to represent the values by segments of a circle. These displays must only be used when the values are partitions of some total.

1.4.3   Comparison of bar and pie charts

Individual categories can be compared more easily in a bar chart. The combined contribution of the total of several categories is displayed better in a pie chart.

1.4.4   Chartjunk for pie charts

Three-dimensional pie charts should be avoided.

1.4.5   Misleading stacked bar charts

Avoid splitting a picture into segments to form a stacked bar chart. The resulting picture often misrepresents the values for the different categories.

1.4.6   Colour

Colour is helpful to distinguish the categories in a pie chart. If it must be printed with shades of grey, care must be taken with the labelling of the categories.

1.5   Comparing groups

1.5.1   Overview

This page is an overview of the section.

1.5.2   Two-way tables

Data for comparing groups are often displayed in a two-way table with either the rows or columns corresponding to the different groups.

1.5.3   Proportions within groups

If the values in each group are partitions of a group total such as a frequency table, a table or bar chart of percentages within the groups highlights the differences.

1.5.4   Clustered bar charts

The bars can be clustered together by group or by category.

1.5.5   Stacked bar charts

If there are many groups, stacking the bars often makes it easier to compare the groups.

1.5.6   Two categories

If there are only two categories in each group, there is no need to present both proportions. A simple bar chart of one proportion is sufficient and allows the scale to be expanded if the proportion is small for all groups.

1.5.7   Chartjunk

Three-dimensional versions of clustered and stacked bar charts make it harder to understand the data and can be misleading.

1.6   Relationships between measurements

1.6.1   Overview

This page is an overview of the section.

1.6.2   Scatterplots

A scatterplot displays the values of two measurements from each region or other 'individual'.

1.6.3   Information from scatterplots

Scatterplots show whether particular values of one measurement are associated with particular values of the second measurement. The strength of the relationship is important. Crosses that do not conform to the same relationship as the rest of the data are also important.

1.6.4   Distinguishing groups

The crosses on a scatterplot can be coloured to distinguish different groups, or differing symbols can be used.

1.6.5   Showing size

Scatterplots are often used when the crosses correspond to different geographical regions. If these differ in size, the crosses can be replaced by circles whose area is proportional to the size -- e.g. area, population or GDP.

1.6.6   Nonlinear scales

Economic measurements from countries often contain many small values, but a few very large values (usually corresponding to rich countries). In order to distinguish the small values (often poor countries), a nonlinear scale can be used on a scatterplot.

1.7   Maps

1.7.1   Overview

This page is an overview of the section.

1.7.2   Colouring regions

The geographic distribution of a measurement can be displayed using different colours on a map.

1.7.3   Choice of colours

The choice of colours to represent numerical measurements on a map is important. A continuous graduation involving three contrasting colours or a grouping into classes is usually best.

1.7.4   Adding information with circles

Shading regions on a map can only represent a single measurement. Drawing a circle on each region can represent both a 'size' measurement with the circle area and another measurement with its colour.

1.7.5   Maps with pie and bar charts

Maps can have other simple bar and pie charts superimposed on each region. Only simple information about each region should be displayed in this way.

1.7.6   Distorted population maps

In most conventional maps, the areas of the regions are proportional to their land areas. The shapes of the regions can be distorted to make their areas proportional to their populations.

1.7.7   Other distorted maps

It is sometimes informative to distort a map to make the areas of regions proportional to other measurements. These measurements must be partitions of a meaningful total so that the combined measurement for two regions would be their total.

1.8   Changes over time

1.8.1   Overview

This page is an overview of the section.

1.8.2   Time series charts

Changes to a single numerical measurement over time can be displayed in a scatterplot of the measurement against time with successive values joined by lines.

1.8.3   Time series for quantities

If the measurement is a quantity, an alternative display is a bar chart with a bar for each successive value.

1.8.4   Time series with categories

If the values at each time point form a partition of some total, such as a frequency table, a series of stacked bar charts of the values or proportions provides an effective display.

1.8.5   Chartjunk

Simple time series are sometimes drawn in 3 dimensions as a ribbon chart. This may look more artistic, but makes the information in the data harder to see.

1.8.6   Dynamic pie and bar charts

When presented on a computer, simple diagrams such as pie and bar charts can be animated to show how they change over time. However when the individual diagrams are simple, it is often possible to display all data in a static diagram more effectively.

1.8.7   Dynamic histograms

Frequency tables of numerical measurements such as ages are usually displayed in histograms. They too can be animated to show changes over time.

1.8.8   Dynamic scatterplots

A particularly useful type of display to animate is a scatterplot, either of crosses or circles of varying sizes. When the crosses represent countries or regions, it is often possible to pick out ones that behave differently from the rest.

1.8.9   Dynamic maps

Maps can also be animated to show changes over time. The measurement of interest can be displayed with either the shading of countries on the map or of circles that are superimposed on it.

1.9   Data presentation principles

1.9.1   Overview

This page is an overview of the section.

1.9.2   Types of publication

Information may be published in executive reports, for archival purposes or in documents for the general public. Publication is increasingly done electronically rather than on paper. Resolution, availability of colour, production cost and ease of formatting are issues that should be considered.

1.9.3   From data to information

Reports generally only summarise the most important features of the available data. Identifying the important features is subjective and is similar to extracting the signal from a noisy electronic communication.

1.9.4   Tables, graphs and text

Reports should contain a balance of tables, graphs and text. Annotations on graphs can highlight important features.

1.9.5   Combining simple graphs

Several simple graphs can sometimes be linked together in a single display. They should usually be either drawn on a common time axis or a map.

1.9.6   Advanced graphs

Most reports only contain simple graphs such as the general-purpose ones described in this chapter. More advanced graphs are needed to effectively display some specific types of data but they are generally too complex for reports intended for the general public.

1.9.7   Innovative graphics

Some excellent graphics have been devised to display particular data sets. There is still scope for innovation.