Abstract
Introduction
Data graphs facilitate analysis and communication of quantitative information. The perhaps most common types of data graphs, line and bar graphs, were first published by William Playfair in 1786. Impressively, Playfair’s choice of graph design has pervaded. By 1913, line and bar graphs were being used to communicate with a large public. For example, campaigns from the US Health Department promoting falling death rates were displayed widely in street parades and on carriages (Brinton, 1914). Over time, the utility of line and bar graphs has been well supported by a body of experimental research (Spence, 2006) and integrated into the education curriculums used throughout the world to develop children’s ability to read and construct data visualizations (Börner et al., 2019; Franconeri et al., 2021). Even in the modern era where data visualizations are presented in digital form (Friendly, 2008), line and bar graphs are a prominent part of current guidelines for constructing effective data visualization (Franconeri et al., 2021).
Although the established conventions for data visualization suggest using bar graphs to display nominal category values and line graphs to display metric scale values, different types of graphs promote different kinds of interpretations (Zacks and Tversky, 1999). For example, bar graphs more effectively communicate contrast between conditions, for instance when communicating differences in a person’s mood across two times of day, even though the scales are metric (Franconeri et al., 2021). Thus, choice of graph type should also consider how well the intended message may be comprehended by the viewer.
Graph schema
Several cognitive models of graph comprehension (e.g. Lohse, 1993; Padilla et al., 2018; Pinker, 1990) suggest that people have a
Pinker (1990) proposed that graph schemas are part of a

Applying typicality research to data graphs.
Graph typicality
Notably, the general schema may be activated more quickly by specific instances that are especially typical or common – a phenomenon known as the
Two kinds of models have been proposed to account for typicality effects. In prototype models (Rosch and Mervis, 1975), typicality is determined by the extent to which a stimulus has the specific features that define the prototype. In exemplar models (Storms et al., 2000), typicality is determined by the degree of similarity of a stimulus to one or more of all the exemplars stored in memory. In both models, frequency of exposure to specific instances plays a critical role for the development of cognitive representations – either by supporting formulation of the feature set that defines the prototype or by expanding the quantity or quality of exemplars stored in memory. This suggests that, whether using prototype or exemplar models, graphs that are encountered more often should become more typical, and in turn be interpreted more quickly. Indeed results from two studies indicate that individuals can pick out specific values or the higher of two values more quickly when viewing vertical bar graphs than when viewing horizontal bar graphs (Fischer et al., 2005) or line graphs (Ratwani and Trafton, 2008). The implication is that vertical bar graphs may be seen more often and thus be deemed more typical than horizontal bar graphs or line graphs.
The present study
In this study, we investigate if and how typicality effects manifest across a variety of data graphs. Specifically, we examine typicality of three types of L-shape graphs that are both ubiquitous (Friendly and Denis, 2005) and serve as key examples in graph comprehension models (Kosslyn, 1989; Pinker, 1990; Ratwani and Trafton, 2008): vertical bar graphs, horizontal bar graphs and line graphs; and three types of data patterns that are commonly depicted in these graphs: rising, neutral and falling patterns. Our results inform both scientific and practical knowledge about data graphs. First, our results provide new knowledge about the structure of the graph schema, specifically whether they may be organized in a hierarchical structure (Pinker, 1990; Ratwani and Trafton, 2008). Second, the results will inform decisions about which graph types might best facilitate viewers’ understanding as well as the design of icons in human–machine interactions (where fast graph recognition is crucial). Third, in the long run, knowledge of typicality effects will help designers create more memorable visualizations, since unique stimuli seem to be easier to remember (Borkin et al., 2013).
Drawing on prior work, we study typicality effects using two methods (Rosch et al., 1976). In Study 1, we followed the traditional approach where participants are asked to provide subjective ratings of the typicality of specific instances of the general category (e.g. pictures of different types of birds). Here, ratings indicate relative typicality of the different graph types and data patterns. In parallel, we obtained participants’ perceptions of the relative frequency of their prior exposure to the different graph types and data patterns. In Study 2, we used a verification time task approach wherein participants are asked to judge whether or not specific instances belong to the general category. Here, faster reaction times indicate greater typicality of specific graph variants. In line with prior work (Ratwani and Trafton, 2008), we expected that participants had encountered vertical bar graphs more frequently and that these graphs would be deemed, both subjectively and behaviourally, as the most typical of the data graphs studied here.
Study 1
Method
Participants
A priori calculations indicated that 28 participants would be needed to have a statistical power of .80 for detecting a medium effect size of
Material and procedure
The study was conducted online using Unipark (Questback GmbH, 2019). Each person was presented with 9 stimuli in a randomized order which consisted of 3 graph types (vertical bar graph, horizontal bar graph and line graph) and 3 data patterns (rising, neutral, falling). The nine data graphs (created using Excel) are depicted in Figure 2. Each data graph showed 6 data points. For the rising patterns, the first number was randomly chosen from 0 – 10 (6.26), the second from 10 – 20 (16.65), the third from 20 – 30 (26.31), the fourth from 30 – 40 (39.88), the fifth from 40 – 50 (47.49) and the sixth from 50 – 60 (50.66). For the falling patterns, we used the generated numbers from the rising patterns in a reversed order. For the neutral pattern, six numbers were randomly chosen from a range between from 25 – 35 (resulting in 33, 31.95, 32.94, 30.02, 31.80 and 33.76). We chose this range to avoid creating a rising or falling pattern. The axis showing the value of the data points was always labelled with tick-marks from 0 to 60 with 10-point intervals. The pixel size of each data graph was 379 × 379. First, all graphs were presented one by one in an individually randomized order and participants rated how typical the shown graph was for a data graph using a 1 (very untypical) to 6 (very typical) response scale. Second, participants used the mouse to sort the stimuli in descending order in accordance with their estimation of how often they had seen a similar graph in their life before. The most frequently encountered graph was to be placed at the top (first place was coded as 1) and the least frequently encountered graph was to be placed at the bottom (last place was coded as 9). All stimuli images and data are available at https://osf.io/7ZM2D

Data graph stimuli of the studies.
Results
Figures 3 and 4 show the mean values for the typicality ratings and the frequency rankings.

Mean values for typicality ratings (left) and frequency rankings (right).
Typicality ratings
A 3 × 3 ANOVA with the within-subject factors graph type (vertical bar graph, horizontal bar graph, line graph) and data pattern (rising, neutral, falling) showed a significant main effect for graph type,
In regard to the
In regard to the
Frequency ranking
A 3 × 3 ANOVA with the within-subject factors graph type (vertical bar graph, horizontal bar graph, line graph) and data pattern (rising, neutral, falling) showed a significant main effect for graph type,
In regard to the
In regard to the
The interaction effect reflects that, while line graphs were ranked as more frequently encountered than horizontal bar graphs for the rising and neutral data pattern, line graphs were ranked as less frequently encountered than horizontal bar graphs for the falling data pattern. Notably, vertical bar graphs with a rising data pattern were ranked as most frequently encountered.
Follow-up analysis
For each person, we calculated the correlation between all their typicality ratings and all their frequency rankings. The mean of the correlations was –.58. Fisher r-to-Z transformation and a one-sample
Discussion
Despite using common (L-shaped) data graph formats that can be used interchangeably to display many variants of data, Study 1 documented substantial differences in rated typicality and consistent frequency ratings. Vertical bar graphs seem to dominate the representation. The correlation showed that typicality ratings were related to perceived frequency of exposure. Thus, the more often a specific kind of data graph has been observed in the past, the more typical it is perceived. Overall, perceived typicality was highest for vertical bar graphs, followed by line graphs followed by horizontal bar graphs. This order is largely in line with Ratwani and Trafton’s (2008) finding based on participants’ reaction times for reading off a value, where vertical bar graphs were read fastest, followed by horizontal bar graphs and then line graphs.
Typicality was also influenced by the data pattern depicted in the graph, albeit with a smaller effect size than the effect size of the graph type. Rising patterns were perceived as more typical than falling patterns. Neutral patterns seem to be less typical than rising or falling pattern. The finding of the high perceived frequency of rising trends is in line with previous findings from related fields. For example, research on function learning (Brehmer, 1971, 1974; Busemeyer et al., 1997: Kalish et al., 2004, 2007; McDaniel and Busemeyer, 2005) and graph perception (Ciccione et al., 2022) has shown that people have a bias for linear functions with a positive slope, that increasing linear functions are among the most easily learned functions, and that people extrapolate more easily from linear rising trends than from exponential ones.
In a second study, we used a verification time approach to check whether the conclusions obtained from self-report measures used in Study 1 might also manifest in behavioural measures.
Study 2
Method
Participants
In Study 2, 30 German speaking participants (17 women, 13 men, age
Materials and procedure
Participants were tested individually in a laboratory on a laptop computer with a 12.5-inch screen. The program was controlled by psychopy (Peirce et al., 2019). We used the same design and stimuli as in Study 1, except with a few changes. The nine data graphs were supplemented with a set of ‘shredded’ stimuli that might not be considered data graphs. These stimuli were created by taking each graph of Figure 2 and manipulating the image using image effects in the freeware IrfanView. For vertical bar graphs and line graphs, a vertical shift effect with a strength of 50 was introduced. For the horizontal bar graphs, a horizontal shift effect with a strength of 50 was used. Figure 5 shows the nine ‘shredded’ stimuli. During the study, each of 9 data graphs was presented 5 times so that RT could be averaged across multiple presentations of the same stimulus. Specifically, the stimuli were presented in an individually randomized order of 90 trials (9 data graph stimuli × 5 presentation times, and 9 shredded stimuli × 5 presentation times). On each trial, participants were asked to indicate as quickly as possible by pressing a key whether the presented stimulus was a data graph (right arrow key) or a shredded variant of a graph (left arrow key).

Shredded stimuli used in the study.
A fixation cross in the middle of the screen appeared before each stimulus for 1 second. At the beginning of the experiment, participants conducted a practice trial with six example stimuli. Three stimuli were data graphs (each data graph type with a neutral pattern) and three stimuli were shredded stimuli. The dependent variable, reaction time for correct recognition, was measured at each trial.
Results
Two criteria led to the exclusion of small amounts of reaction time data: wrong answers (0.011% of the data) and reaction times larger than 3000ms (0.001% of the data). For each of the nine data graphs, the mean value across the five (or less in case of excluded data) reaction times was calculated.
Figure 6 shows the mean reaction times for each data graph.

Mean reaction times for each data graph.
The 3 × 3 ANOVA showed a significant main effect for graph type,
In regard to the
In regard to the
The mean RT for shredded stimuli was 778 ms (SD = 463 ms) and for data graphs was 776 ms (SD = 500 ms). The difference was not significant (
Discussion
The results from Study 2 showed that graph type had a significant influence on the verification time. In Study 1, the vertical bar graphs were rated most typical and ranked with most frequent exposure. In Study 2, the vertical bar graphs were recognized significantly faster than horizontal bar graphs and line graphs, with the same ordering of reaction times seen in a previous study by Ratwani and Trafton (2008). In that study, participants had to identify data values in the graphs. Thus, it was ambiguous as to whether the RT differences reflected differences in recognition of the graph type (cf. Pinker, 1990) or aspects of the data extraction process (or both). Our finding, using a much simpler and more process pure task where participants only had to decide whether or not the stimulus is a data graph, thus provides an important replication and clarification. We add the new finding that reaction times for recognition were also influenced by the data pattern, with rising pattern graphs being recognized faster than neutral and falling pattern graphs.
General Discussion
This study investigated the possibility of typicality effects in data graphs using both a traditional typicality ratings approach and a verification task approach. Overall, the results indicate that vertical bar graphs are a more typical graph type than horizontal bar graphs and line graphs. Following from the theoretical and empirical work of Rosch (Rosch, 1975; Rosch et al., 1976) and Pinker (1990), our results suggest that a vertical bar graph is a more typical L-shape graph than are horizontal bar graphs and line graphs.
We also found that data pattern seems to play a role in typicality of graphs. Our results suggest that rising data patterns are more typical than falling patterns, while neutral patterns are perceived as less typical. Pinker explicitly discussed different patterns of descending and ascending staircases in relation to a bar graph schema and Rosch et al. (1976) showed typicality effects with dot patterns. Our results provide additional evidence that data patterns are part of an individuals’ graph schema. One explanation for the lower typicality of the neutral patterns in our stimuli could be that they might occur infrequently, in part because designers often adjust the range of vertical axis so as to highlight the differences among the data points – in line with the design advice to set the data in the focus (e.g. Tufte, 2001).
One practical implication of the study concerns the choice of a graph type. If fast processing and understanding of a graph is crucial, it is perhaps better to use a vertical bar graph. However, in some cases, there are other reasons to choose a different graph. For example, it is often recommended to use horizontal bar graphs instead of vertical bar graphs when the labels are long (e.g. Wickham and Grolemund, 2016). Other criteria can be the class of data and the intended message (focus on contrast vs change over time, cf. Franconeri et al., 2021). A further practical implication of our findings concerns the data pattern. In cases where the order of the values is not determined by other criteria, it may be useful to sort the data to create rising patterns.
Some limitations in our study suggest perspectives for future work. First, the Study 1 sample consisted only of university psychology students. It is possible that they differ from people in the general public in their graph type experience or in other potentially influential factors such as media consumption. Second, our stimuli consisted of a well-controlled but small set of data graphs. This ensured that participants would not suffer fatigue and that practice effects would be minimal. Long sessions in Study 2 might have led the participants to forget that they were dealing with data graphs and instead focus on the discriminating features that with practice would have turned out to best support categorizing the stimuli (cf. Gaschler et al., 2015). Future studies can extend the work by allocating different stimuli to short sessions on different days in a counterbalanced order. This would allow for a broader sampling of stimulus properties, including strength of rising or falling patterns, number of data points depicted, orientation of axes and labels, colour of various visual features, inclusion of error bars, and many other types of graphs in order to obtain greater specificity about the many factors that influence typicality. We look forward to investigating typicality effects in data graphs and how it can be leveraged to increase and optimize viewers’ understanding and decision-making.
