Routledge Handbook of Interdisciplinary Research Methods

Anything can be visualized – whether it is financial trends, phylogenetic relationships, partisanship in senate voting patterns or even the concept of evolution (Figure 2.12.1). Visualizing data brings quantities, forms and relationships into view when the subject matter is minuscule or distant, abstract or intangible, transient or multiscale.

Visualizing data can be essential to making sense of data by enabling discoveries and increased understanding. Visualizations can also facilitate education and enjoyment, and have even become cultural icons. As a new ‘photojournalism’ (Stefaner 2014), visualizing data can reveal unseen issues; such as when a humble chart catalysed the creation of the Bill and Melinda Gates’ foundation – ‘that rotavirus slice in the pie chart set us on fire’ (Gates 2013).

Visualization tools make businesses intelligent, allowing us to ‘Answer questions as quickly as you can think of them’ (TableauTM 2010). This might be unsurprising. When we visualize, we wire data into our cognition via advanced graphics technologies and the highly evolved human visual system. More information is consumed ‘through vision than through all of the other senses combined’ (Ware 2012), so why consume information any other way? As Peter Hall (2008) suggests, as data and information inundate our lives, ‘diagrams, maps, and visualisation tools offer a means to filter and make sense of it’.

Anscombe’s demonstration has, however, been staged. It depends on a particular graphical representation, where the categories are separated across multiple graphs that have common scales. This staging can easily be undermined (Figure 2.12.4) by encoding data in ways our perception does not instinctively decode, or that our cognition cannot translate. If our perception and cognition fail, does a visualization actually visualize?

Design Space

Any single design is just one realization of the design constraints, with alternative designs arising when the data (Figure 2.12.4B) or media change (Figure 2.12.4F), or when non-standard plots are required (Figure 2.12.5). Divergent designs might arise from the same design constraints (Figure 2.12.6), and differing design constraints might produce convergent designs. A discussion of ‘visualization’ – whether as objects, a set of methods or a subject – could be hindered by our predisposition to viewing visualization through the design constraints that can also define function and how function is evaluated.

To shed some of these inbuilt values and perspectives, we could consider visualizations as collections of visual objects and nothing more, and consider all design possibilities, even those that have not been made material. We can use this ‘Design Space’ as a shorthand term for the infinite variation of visualizations. Some areas in this space will have a use, or multiple uses, and some will have no conceivable purpose. Design Space is envisaged as a hyper-volume of all possible visualization designs with as many dimensions as there are ways to visualize data using coordinate and mapping systems, visual encodings and formatting, scales and sizing, sampling and aggregation methods, etc.

Figure 2.12.6 Paired examples of the same data visualized in different areas of Design Space – (a.i–ii) visualizations of the evolution of On the Origin of Species by Charles Darwin; (b.i–ii) timelines of Arab Spring events; (c.i–ii) death toll in Iraq during the American occupancy; (d.i–ii) O-ring damage during space shuttle launches.
Image credits: (a.i) https://fathom.info/traces/; (a.ii) www.moma.org/interactives/exhibitions/2011/talktome/objects/145525/; (b.i) www.informationisbeautifulawards.com/showcase/113-arab-spring; (b.ii) www.thefunctionalart.com/2015/02/redesigning-circular-timeline.html; (c.i) www.scmp.com/infographics/article/1284683/iraqs-bloody-toll; (c.ii) www.youtube.com/watch?v=Ybwh4lejYO4; (d.i) and (d.ii) Reprinted by Permission, from Visual Explanations, Edward Tufte, Graphics Press.

There are issues with this definition of Design Space, but its vagueness forces us to reflect on how we define, evaluate and interpret visualizations. Different disciplines can impose highly specific views onto qualities such as ‘effectiveness’ or ‘beauty’, and how visualizations might be used and created. In what follows, we will explore topics such as function, technology, aesthetics and our approaches to studying visualization. To start, let us consider if Design Space might be charted, and what parts of this n-dimensional space ‘work’?

Lost in Design Space

Many books and blogs assist the craft of visualizing data, by suggesting how to visualize data effectively using different coordinate systems, visual encodings, patterns of emphasis and data manipulations. Each perspective, however, will at some point fail. Visualization ‘rules’ are often drawn from experimental evaluations that compare simplified, tractable compartments of Design Space. As visualization science lacks a wholly predictive theory (Kindlmann and Scheidegger 2014), the science accumulates contingent rules to understand and compare the relative suitability of designs given specific data types or tasks. Rather than providing a reliable rule-based mapping between designs and their properties, these studies instead point to the unavoidable difficulties of a predictive theory as there are instabilities in Design Space where the properties of a design depend on the data. In Design Space, contingency reigns.

For example, even simple datasets can experience conflicts between the ‘rules of thumb’ that should assist us when visualizing data, such as when different categories conflict in their demands for a truncated axis (an axis not starting at zero) or demand differing aspect ratios (the relative dimensions of the plot) (Figure 2.12.7). When visualizing data, it might be inevitable that we hide some patterns as we reveal others. Patterns can be a composite of features that might be optimally revealed in different kinds of charts and not viewable in any single graph.

Figure 2.12.7 Visualizing two data series with contrasting demands in one graph. In the top row (A), the details of the grey oscillation are revealed by stretching the graph. More detail is seen by squashing the y-axis (B), which increases the aspect ratio further, showing the different rates of increase and decline. However, the pattern in the black data becomes increasingly hidden. Each stretch, and each squash, flattens the black pattern. More could be seen of the black data in the thinnest and, relatively, tallest plot (left hand side of A) where the grey data was least visible. By zooming in the detail of trend, and fine scale oscillations around that trend, are shown for the black data (C–E), but at the expense of the grey data. In (E) we have contravened what some might call a golden rule by truncating the y-axis. The format of a graph might not always suit all the patterns it contains. Arbitrary data selections were downloaded and modified from www.sidc.be/silso/datafiles and for the Waddington data station http://data.giss.nasa.gov/gistemp/stdata/.

Degrees of freedom

Design Space might initially appear to be small for simple data sets. For instance, a scatterplot might seem the only choice when visualizing two vectors of continuous data, such as for a category in Anscombe’s Quartet (e.g. X_P Y_I in Figure 2.12.2). Yet the axes of a scatterplot can be aligned to produce a parallel-coordinates plot, then bent to simulate a chord diagram or hive plot, or the values can be summed for a stacked bar chart, which can be bowed into a pie chart and then punctured to produce a donut plot. Each jump in Design Space can modify the meaning and information content (Figures 2.12.3, 2.12.4 and 2.12.5), even when the symbols, shapes and scales are unchanged.

Different software reveal and optimize different design possibilities, determining what can be defined and manipulated programmatically, or otherwise. For instance, data manipulation and analysis might be easier in some software (R; www.r-project.org/), whereas control over the form of shapes and interactivity is easier in others (Processing; https://processing.org) and interactive web applications might be more naturally created elsewhere (P5; https://p5js.org/ and D3; https://d3js.org/). Each software offers a different view of Design Space. Spreadsheet applications can launch users towards apparently polished forms, but designing beyond the defaults requires flexibility within software, and the facilities to create new templates and functions in code, or by other means. Design Space is too vast, and its contingencies too many, to be entirely contained within defaults.

Points of view

In his book review entitled ‘Pretty vacant’, Kevin Walker (2014) critiques the apparent hollowness of some visualizations which substitute function and precision with frivolity and fun. Despite being more likely to reside in coffee table books than to inform system-critical decisions, these ‘vacant visualizations’ can face incredibly strong criticisms that have included censorship campaigns. Neither art nor science, these visual stories do not necessarily claim any grand discoveries or offer experiential epiphanies. The vigorous critique is often aimed at explorations in Design Space that go beyond the software defaults.

However, the differences in this ‘infographic’ genre are not always recognized when it is critiqued. Vacant visualizations may use ‘fun’ illustrations and pictograms that aid memorability and recall (Borkin et al. 2015) and so improve understanding in ways that pared-back graphs cannot. This does not stop purists being concerned with the use of chart junk, in what they might already consider to be junk charts. Other approaches that use ‘arbitrary encodings’ (those that must be learnt through the visualization itself (Ware 2012)) are more readily accepted due to their apparent aesthetic qualities. Some propose that arbitrariness might stimulate deliberative reasoning which could benefit comprehension (Hullman, Adar and Shah 2011). This strategy might only work when aesthetics seduce the reader sufficiently for them to invest in decoding the images, though the seduction need not lead to anything more.

The visible spectrum

Right now we are exposed to the broadest spectrum of visualization expertise, literacy, use, tools and interest that has ever existed. At one set of extremes we have the purer reflections of the promise, where inspirational bespoke interactive visuals have inspired large changes in the practices, structure and audiences of influential organizations. At the other extremes are people who do not know what visualizations are and do not use digital technologies.

This spectrum offers diverse opportunities to develop lenses to see beyond our myopias and beyond the myopic brouhahas of data-ink ratios. Probing this spectrum could help define and reconcile how Design Space might be mapped to concepts beyond functionality and

References

Anscombe, F. J. (1973). Graphs in statistical analysis. American Statistician, 27(1): 17–21.

Borkin, M., Bylinskii, Z., Kim, N., Bainbridge, C., Yeh, C., Borkin, D., Pfister, H. and Oliva, A. (2015). Beyond memorability: visualization recognition and recall. IEEE Transactions on Visualization and Computer Graphics, 22(1): 519–528.

Brody, H., Rip, M. R., Vinten-Johansen, P., Paneth, N. and Rachman, S. (2010). Map-making and myth-making in Broad Street: The London cholera epidemic, 1854. The Lancet, 356: 64–68.

Gates, B. (2013). Bill Gates: Dimbleby lecture. [online] Retrieved 6 July 2016 from: www.gatesfoundation.org/media-center/speeches/2013/01/bill-gates-dimbleby-lecture

Hall, P. (2008). Critical visualization. In P. Antonelli (Ed.) Design and the Elastic Mind (pp. 122–131). New York, NY: Museum of Modern Art, Harrison.

Hullman, J., Adar, E. and Shah, P. (2011). Benefitting InfoVis with visual difficulties. IEEE Transactions on Visualization and Computer Graphics, 17(12): 2213–2222.

Kindlmann, G. and Scheidegger, C. (2014). An algebraic process for visualization design. IEEE Transactions on Visualization and Computer Graphics, 20(12): 2181–2190.

Lupi, G. (2012). Non-linear storytelling: journalism through ‘Info-spatial’ compositions. Parsons Journal for Information Mapping, IV(4): 1–11.

Robison, W., Boisjoly, R., Hoeker, D. and Young, S. (2002). Representation and misrepresentation: Tufte and the Morton Thiokol engineers on the Challenger. Science and Engineering Ethics, 8(1): 59–81.

Stefaner, M. (2014). Worlds, not stories. [online] Retrieved 6 July 2016 from: http://well-formed-data.net/archives/1027/worlds-not-stories

Tableau (2016). Answer questions as fast as you can think of them. [Online] Retrieved 29 April 2016 from: http://get.tableau.com/trial/p3group.html?width=300&height=300&inline=true

Talbot, J., Setlur, V. and Anand, A. (2014). Four experiments on the perception of bar charts. IEEE Transactions on Visualization and Computer Graphics, 20(12): 2152–2160.

Tufte, E. R. (1990). Visual Explanations: Images and Quantities, Evidence and Narrative. Cheshire, CT: Graphics Press.

Walker, K. (2014). Pretty vacant: what we’re not seeing in graphics today. New Scientist. [Online] Retrieved 6 July 2016 from: www.newscientist.com/article/mg22429991-700-pretty-vacant-what-were-not-seeing-in-graphics-today/

Ware, C. (2012). Information Visualisation: Perception for Design (3rd ed.). Burlington, MA: Morgan Kauffman.

12
Visualizing data

The promise

Do visualizations visualize?

Design Space

Lost in Design Space

Degrees of freedom

Designed by defaults

Points of view

20/20 visualization

The visible spectrum

References

12 Visualizing data

The promise

Do visualizations visualize?

Design Space

Lost in Design Space

Degrees of freedom

Designed by defaults

Points of view

20/20 visualization

The visible spectrum

References

12
Visualizing data