Abstract
Keywords
Human errors are attributed to around 70% of accidents in process industries (Bhavsar et al., 2017; Mannan, S., & Lees, 2012). Control room operators, who monitor plant operations and intervene during process upsets, are associated with most of these errors (Amazu et al., 2023). Although advanced digital tools, such as industrial sensors and control systems, provide operators with critical real-time data, the growing complexity of integrated processes and the need for continuous optimization increase demands on human-machine interaction (Wanasinghe et al., 2022). This often leads to mental overload in operators (Pan et al., 2017), as seen in the BP Toledo refinery incident, where an overwhelming alarm flood delayed critical responses and led to severe consequences (U.S. Chemical Safety Board, 2022). Operators’ performance in managing process control efficiently and safely depends heavily on the human-machine interface (HMI), especially its design and functionality (Bántay & Abonyi, 2024; Xu et al., 2018).
Industry has developed standards and HMI design guidelines for alarm systems, display design, and control room management (Ikuma et al., 2014). Subsequent revisions of the guidelines have proposed specific design recommendations to reduce cognitive load and support effective human-machine interaction (O’Hara & Fleger, 2020). The forthcoming section provides a brief overview of such guidelines. However, many leading technology providers evaluate HMI design primarily through iterative expert reviews, think-aloud protocols, subjective measures, and usability ratings (Tharanathan et al., 2012). These methods may overlook implicit, non-verbal indicators of cognition and behavior, such as how operators perceive, process, and respond to information under stress. This limits the ability to objectively verify whether the HMI truly meets design requirements aimed at reducing cognitive load. Therefore, a more comprehensive evaluation of HMI design is necessary, one that considers various dimensions, including task performance, workload, and situational awareness (Xu et al., 2018). Researchers particularly emphasize the need to integrate cognitive and behavioral characteristics when studying complex human-machine systems (Naderpour et al., 2015; Nazir et al., 2021). Process safety experts also stress the importance of methodologies that bridge this gap, enabling operators to process information efficiently and respond effectively (Bullemer & Nimmo, 1994; Shin, 2014). The emergence of Industry 5.0, which prioritizes human-centered approaches, further reinforces this notion. Together, these efforts highlight the need for user-centered HMI design solutions that enhance operator capabilities.
As sensor technology advances, eye-tracking has emerged as a promising approach for evaluating operators’ responses while interacting with HMIs (Lin et al., 2003). In control operations, eye-metrics could offer potential for developing performance indicators (Manca et al., 2012; Shahab et al., 2021), improving training design in VR-based settings (Kluge et al., 2014; Srinivasan et al., 2022), and enabling human digital twinning (Balaji et al., 2023). Although these methods deliver cost-effective solutions and demonstrate usefulness (Pavlas et al., 2012), most studies rely on participants who are not professional board operators, which limits the applicability of findings to real-world settings.
This paper provides an overview of specific industry standards and design recommendations that emphasize incorporating operators’ cognitive and physiological capabilities into HMI design. It also outlines design initiatives by major technological powerhouses and provides actionable perspectives for human factors and ergonomics practitioners on how eye-tracking metrics reveal operators’ engagement with interfaces and their perception of on-screen elements and support systems. Representative case studies from process control operations illustrate these applications. By involving actual operators in future research, these preliminary works can be strengthened to support the development of next-generation HMIs that are operator-centric and better aligned with industry needs.
Overview of Industry Guidelines on HMI Design
Advances in computer science have enabled industries to integrate computer-powered video consoles as a central interface in Distributed Control Systems (DCSs), transforming how operators interact with critical systems. However, deficiencies in alarm management and graphic display design have contributed to industrial accident severity. For example, in the Brenham accident in 1992, the graphical display did not present pressure and flow trends in an interpretable format, delaying recognition of a pressure drop and causing an overfill and ignition. In the Gramercy incident in 1996, multiple alarms occurred simultaneously, preventing the operator from evaluating each alarm effectively and delaying leak detection, which led to a substantial gasoline release. Similar incidents at Winchester in 2000 and Kingman in 2004 demonstrate failures in alarm evaluation and display interpretation (National Transportation Safety Board, 2005).
Regulatory bodies and industrial organizations provide high-level HMI design recommendations, including display formats, text styles, code labels, and color coding, to enhance readability, clarity, and salience. These features help reduce operator search time in both normal and abnormal situations. Figure 1 illustrates some of the high-level HMI design guidelines. We reference this information from the PETRONAS Technical Standards 14.00.02 (PTS 14.00.02, 2014), and the Fundamentals of Industrial Control (Coggan, 2005). Both guidelines emphasize developing HMI interfaces that support clear, consistent, and ergonomically sound operator experiences. They recommend thoughtful use of color (e.g., avoiding dark hues and using green, yellow, and red for alarms), appropriately sized text and symbols, and minimal clutter so that operators can quickly spot and interpret critical information. In this context, the Human-System Interface Design Review Guidelines (NUREG-0700, Revision 3) provide comprehensive documentation of high-level HMI design principles (O’Hara & Fleger, 2020). Illustration of some of the recommended high-level HMI design guidelines from the PETRONAS Technical Standard and the Instrument Society of America guidelines.
Nevertheless, an important part of HMI design is how users interact with the system, how tasks are performed, and how much workload can affect plant monitoring and control tasks (O’Hara & Brown, 2002). The International Society of Automation (ISA) developed ISA 18.2, which accounts for the interaction between the operator and the interface elements (American National Standards Institute ANSI & ISA, 2009). It is also important to identify interface features that demand excessive attention or cause distractions, which could divert cognitive resources from critical tasks and reduce overall performance (O’Hara & Brown, 2001). Therefore, designers must integrate design principles that address operators’ cognitive and physical needs, ensuring the HMI system supports effective decision-making and minimizes distractions and overload. Although we did not find universal best practices for interface design, we summarize key recommendations that emphasize accounting for operators’ cognitive and physiological capabilities to enhance HMI design and effectiveness in control systems. We sourced these design recommendations from the third revision of NUREG-0700 (O’Hara & Fleger, 2020). 1. Design HMIs to align with cognitive and physiological capabilities while ensuring tasks are engaging and maintaining a balanced workload that supports attention and high performance. 2. Present information clearly and quickly to ensure operators can easily recognize and understand the system status. Evaluate display formats and information density to ensure important details are readily perceived. 3. Assess the impact of new graphics on cognitive load and situation awareness while minimizing visual fatigue. Ensure that any new visual elements do not hinder operator performance. 4. Train operators in task management strategies for high-workload situations and develop test scenarios that assess operator performance and workload under varying conditions. 5. Adopt ecological interface design to align interfaces with how humans naturally perceive and process information.
To address these recommendations, eye-tracking could serve as a practical and objective tool for designers and engineers. It could reveal how users interact with the interface in real time, what draws their attention, what they overlook, and where cognitive bottlenecks occur. These insights could facilitate targeted improvements and help achieve HMI design enhancements following the recommended principles. A more detailed perspective on the application of eye-tracking metrics in human-centered HMI design, along with representative examples, appears in the later sections of this review. Prior to that, we present a brief outline of HMI design initiatives proposed by leading technological powerhouses.
Overview of HMI Design Initiatives by Technological Powerhouses
Technological companies actively develop future-ready HMI designs that are more interactive, user-friendly, and align with the industrial goal of reducing operator search time across all conditions. In fact, these advanced HMI solutions, deployed in real plant environments, achieve measurable improvements in control room operations (ABB, 2025). Figure 2 depicts HMI design initiatives from five of the world’s leading automation and digitalization powerhouses. For instance, Siemens provides “ Overview of advanced HMI initiatives developed by leading automation and digitalization providers.
Nevertheless, the HMI design effectiveness is generally evaluated by iterative expert reviews, think-aloud protocols, subjective measures, and usability ratings (Tharanathan et al., 2012). This evaluation approach may overlook implicit indicators of operators’ underlying cognitive state, limiting the ability to objectively confirm that the HMI truly meets design recommendations intended to lower cognitive load. Here, eye-tracking measures could offer additional benefits to the designers, strengthening the design evaluation process. Eye-tracking could provide a more objective way to evaluate how effectively these design choices work in practice. Practitioners can determine whether users notice, interpret, and act on key interface elements as intended. This could make eye-tracking a powerful tool for aligning interface design with real human interaction, mainly offering a deeper understanding of how operators perceive and process critical information on the screen. The next frontier in HMI design lies in developing human-centered interfaces that truly reflect operator perception and interaction (Mourtzis et al., 2023). This aspect is also central in the five design recommendations stated earlier. In the following sections, we use scientific studies as representative cases to show how insights from eye-tracking metrics shape human-centered HMI design.
Role of Eye-Tracking in Human-Centered HMI Design
Eye-tracking has proven valuable in human-computer interactions (Majaranta & Bulling, 2014), especially for understanding visual attention and cognitive processes (Glöckner & Herbold, 2011; Wade & Tatler, 2005). In Figure 3, we illustrate how eye-tracking is being applied at three progressively detailed levels of HMI design in process control rooms. Within the broadest scope, the “ Hierarchical integration of eye-tracking measures for optimizing HMI design.
Summary of Key Eye-Tracking Metrics in the Context of HMI in Control Rooms.
(I) Global Interaction Analysis
Eye-tracking metrics, particularly fixation counts, offer valuable insights into how operators interact with HMIs by highlighting which AOIs capture their attention (Salehi et al., 2018). For example, during a drilling operation where detecting a drill break and kick was critical, heat maps of operator gaze were recorded for both expert and novice drillers (see Figure 4) (Naqvi et al., 2020). The expert’s gaze, as shown in Figure 4(a), was predominantly focused on the Drilling Parameters (DP) on the display monitor, ensuring that key information, such as the Rate of Penetration (ROP) and Return Flow (RF), was closely monitored to detect abnormalities quickly. In contrast, the novice exhibited a scattered fixation pattern with insufficient attention to ROP and an overemphasis on less critical areas like the Manifold Panel, resulting in a detection delay around 5 times longer than that of the expert. These findings suggest that process tracing through eye-tracking can inform HMI configuration improvements, such as repositioning less important panels further and using subtle animations to highlight key parameters. This, in turn, could help novices to perform critical, time-sensitive tasks. The effectiveness of fixation counts in usability testing and enhancing operators’ situational awareness has also been demonstrated in other studies (Sanfilippo, 2017). Use of heat maps of fixation counts to evaluate the efficacy of HMI design through operator interaction with AOIs (features of this figure are adopted from the original version published by Naqvi et al. (2020)).
Another eye-tracking metric, the percentage of time spent looking at AOI, reveals that under high workload, operators spent less time on the main display and shifted attention toward the faceplate and alarm bar (Ikuma et al., 2014). Unlike fixation counts, which only indicate how often an operator looks at an AOI, the percentage of time spent looking at an AOI accounts for total viewing time on that area, providing insight into the overall attention it receives. Findings by Ikuma et al. (2014) suggest that the percentage of time spent looking at AOI on key interface elements offers a practical and objective metric to assess and refine HMI performance and its usability.
In the same regard as investigating attention on interface elements, dwell duration could be considered a more informative eye-tracking metric that captures not only where operators look, but also how long and how often they engage with important areas. Dwell duration captures repeated visits and sustained attention, where prolonged dwell duration may indicate that an operator is struggling with a decision. A high percentage of time spent on an AOI alone cannot distinguish whether the attention is sustained at a single glance or distributed across multiple glances. This distinction can be captured through dwell duration.
To put this into context, we provide an example depicted in Figure 5, where participants are tasked with increasing the feed flow to the distillation column in response to a flow reduction from a continuous stirred-tank reactor (CSTR). Appropriate attention to the control valve (V201), gauges for feed flow (F105), and temperature (T104 and T106), alongside the alarm summary and trend pane, is important for this task. The arrangement of these AOIs in HMI is particularly strategic for achieving the correct execution of tasks as soon as possible. The dwell duration plot in Figure 5 shows the prolonged focus on F105, its trend, and the alarm summary pane after the alarm at 26 seconds enabled quick recognition of the abnormal state. Afterward, frequent gaze shifts between the V201 slider and the F105 trend during control actions ensured precise monitoring and adjustment until the process stabilized (Sharma et al., 2016). This confirms that dwell duration provides a more refined understanding of operator attention, capturing not only where but also how long and in what pattern critical AOIs hold focus, making it a valuable metric for optimizing HMI design to enhance situational awareness and response efficiency. Eye-tracking dwell duration analysis during disturbance response, highlighting operator attention shifts across critical interface elements (features of this figure are adopted from the original version published by Sharma et al. (2016)).
These are some examples, while other studies have demonstrated the effectiveness of more advanced and sensitive metrics derived from eye-tracking for evaluating HMI design. For illustration, association and salience metrics derived from eye-tracking data provide distinct insights into HMI design effectiveness. The association metric quantifies the alignment between scenario-relevant AOIs and system elements, ensuring that the critical information is positioned for optimal visibility. Higher association values indicate a more substantial alignment between interface elements and task-relevant data. The metric is calculated based on the angle between the scenario and AOI vectors, where a smaller angle corresponds to a value approaching unity, signifying an improved association (Shahab et al., 2021).
The salience metric identifies interface components that are frequently referenced across multiple scenarios, aiding layout optimization. It measures the relative distance of an AOI from the “origin” in an asymmetric plot, where lower values indicate higher relevance (Shahab et al., 2021). For instance, the trend panel, which displays real-time data trends, serves as a consistently referenced AOI owing to its role in tracking variable changes. A salience metric approaching zero suggests that an interface element is integral across operational contexts. Refer to Shahab et al. (2021) for further details on the equations and calculations.
(II) Contextual Support Assessment
As mentioned earlier, the contextual support assessment examines how supplementary displays or software-based support systems influence HMI usability. From this perspective, a case study on the Chemical and Volume Control System (CVCS) used eye-tracking to compare two progressively advanced HMI configurations against a conventional HMI, evaluating their effectiveness in improving visual search efficiency, attention, and mental workload. HMI-1 is the conventional configuration. HMI-2 integrates a hybrid control interface with instrumentation and process control, and computer-based procedures that show upcoming steps, along with overview HMI displays. HMI-3 combines a hybrid board with a dedicated computerized operator support system (COSS) (Kovesdi et al., 2018).
Figure 6 presents the percentage change in eye-tracking metrics to evaluate the visual search efficiency of new HMIs, using HMI-1 as the baseline for comparison. HMI-2 significantly reduced visual search efficiency, as evidenced by longer scan path length and duration, higher fixation and saccade counts, and higher scan transition matrix density. HMI-3 demonstrated better visual search efficiency than HMI-2 but remained less efficient than HMI-1, as indicated by higher relative values of these eye-tracking metrics. Percentage change in eye-tracking metrics indicating visual search efficiency for HMI-2 and HMI-3, relative to the baseline HMI-1. (Stats in this figure are adopted from the original version published by Kovesdi et al. (2018)).
Moving beyond these visual search efficiency findings, both HMI-2 and HMI-3 were found to reduce cognitive effort compared to HMI-1. Specifically, pupil size decreased by approximately 4% and 30%, fixation duration dropped by 39% and 24%, while blink rates increased by about 1.4 and 1.25 times, for HMI-2 and HMI-3, respectively (Kovesdi et al., 2018). In terms of attentional allocation, participants showed significant attention to the AOI displaying the operator support system in HMI-3, as reflected by higher fixation counts and longer dwell durations on the AOI. This indicates that while using HMI-3, operators relied more on the support system than on their search efforts, and this reliance probably reduced their cognitive load. Other studies support this argument, showing that when operator decision support systems were added to process control HMIs, participants fixated significantly more on the support system’s suggestions, as revealed by eye gaze heat maps (Abbas et al., 2025).
In a similar context, one study compared three overview display configurations: (a) Heat Map: a grid of color-coded tiles showing process deviations; (b) Surface Chart: a topographical view where deviations form peaks and valleys; and (c) Visual Thesaurus: a gauge-based interface using bars, balls, and dials (Shi et al., 2021). The Surface Chart display yielded higher click accuracy, faster response times, and significantly greater fixation duration, count, and fixation/saccade ratio. Since both the fixation duration and fixation/saccade ratio reflect more cognitive processing and less visual searching, this indicates that the Surface Chart supports efficient attention to process information.
The authors in another study analyzed how auxiliary input devices in HMI systems affect cognitive load and situational awareness (Shi & Rothrock, 2022). Using the touchscreen interface, participants showed shorter fixation durations and counts, which indicated lower cognitive workload and improved situational awareness. Their lower fixation/saccade ratios suggest they spent more time searching than processing the information. Older adults had longer saccade durations, indicating more cognitive processing with touch screens, but higher saccade amplitudes, suggesting lower cognitive workload when handling abnormal situations. However, their fixation behavior mirrored that of younger adults. Hence, this presents preliminary evidence that saccadic measures may be more sensitive than fixation metrics to mental workload changes induced by variations in auxiliary HMI inputs among older adults (Shi & Rothrock, 2022). Nonetheless, this should be interpreted as exploratory, requiring further research to draw a definitive conclusion. Other studies compared gaze-based control with keyboard input for switching between multiple control screens (Smith et al., 2015).
(III) Element-Level Interface Optimization
For element-level interface optimization, we illustrate a case study where researchers redesigned the Chemical and Volume Control System (RCV) interface elements using cognitive psychology principles to enhance intuitiveness and operator-friendliness (Chen et al., 2019; Yan et al., 2017). Figure 7 shows some of the design reconsiderations introduced by the redesigned version, which include the following: (1) More salient solid lines depicting active flow that can quickly provide operators with the information they need. (2) Consistent grouping of task-relevant parameters and variables to ease the readability of information. (3) Closely arrange the valve information to be compared to the process data. (4) Removal of extra information that could easily distract the operator. Hypothetical redesigning of RCV process-computer interface using cognitive psychology principles (features of this figure are adopted from the original version published by Yan et al. (2017)).

Based on the NASA-TLX index, researchers showed that the redesigned RCV interface reduced mental demand by 14%, demonstrating its improved usability (Yan et al., 2017). Notably, eye-tracking measures revealed that operators using the redesigned interface exhibited significantly shorter fixation durations, smaller pupil sizes, and increased blink rates. These eye-based metrics not only align with the NASA-TLX findings but also highlight their potential as a useful real-time indicator of mental workload arising from element-level interface designs.
Other Relevant Research
This paper explores the effectiveness of eye-tracking metrics in designing and evaluating HMIs, providing representative examples from studies. It is worth acknowledging that numerous other studies have applied these metrics to assess variations in cognitive workload due to task difficulty. For instance, Iqbal et al. (2018) observed more dispersed eye movements, reflected in increased gaze entropy, under higher cognitive demand. During process disturbances, studies reported more prolonged fixation and saccade durations, indicating their potential for monitoring mental effort (Das et al., 2018). Huang et al. (2019) found that task difficulty was associated with more frequent fixations and longer dwell times. Studies also observed larger pupil sizes during mentally demanding tasks and smaller sizes during tasks with lower cognitive demand (Bhavsar et al., 2016; Noah & Rothrock, 2015; Wu et al., 2020). Ha et al. (2016) derived fixation-based metrics to infer operators’ thought processes during nuclear control room tasks. However, this review did not address these studies in the discussion, as it focused specifically on the application of eye-tracking metrics to human-centered HMI design rather than their secondary use in inferring operator states due to task complexity.
Limitations for Industrial Application and Future Directions
While eye-tracking metrics demonstrate strong potential for evaluating HMI design, certain findings highlight the need for contextual sensitivity during interpretation. For example, studies have reported both increases and decreases in saccade amplitude and velocity under varying task demands and performance levels (Abbas et al., 2025; Fan et al., 2022; Shi & Rothrock, 2022), reflecting variability based on task complexity or individual strategies. Similarly, although pupil size is often used as an indicator of operator state, some studies have found it insensitive to certain conditions (Lin et al., 2003). Others have linked pupil size to attention shifts and arousal caused by error, indicating it may not always reflect increased cognitive processing (Gao et al., 2013). Additionally, studies also reported non-linear pupil behavior with memory load (rising initially but decreasing when memory becomes overloaded) (Granholm et al., 1996; Peavler, 1974). These mixed findings do not undermine the utility of eye-tracking measures but present an important caveat that successful industrial implementation requires careful calibration and adaptation to context.
One limitation in the discussed studies is that most demonstrate the efficacy of eye-tracking metrics in various HMI design systems involving non-industrial participants, which may not fully reflect the responses of industrial operators (Abbas et al., 2025; Shahab et al., 2022). Validating these metrics with actual board operators remains essential to establish their practical relevance and reliability. Accordingly, future work could extend these preliminary efforts by involving real industrial operators.
For example, the study by Sharma et al. (2016) can be extended, where dwell patterns of experienced board operators during task execution could reveal expert cognitive strategies, offering a means of understanding decision-making processes that verbal reports cannot easily represent. Furthermore, analyzing visual attention across AOIs during process events may help identify decision-making heuristics of experts that inform evidence-based benchmarks or training guidelines for novices (Ikuma et al., 2014; Naqvi et al., 2020). Likewise, the approach proposed by Shahab et al. (2021) can be applied to quantify the alignment (association metric) and importance (salience metric) of interface elements based on actual operator behavior. This enables designers to tailor HMI layouts to real-world usage. This ensures critical information is optimally placed and frequently attended components receive appropriate emphasis, enhancing both intuitiveness and operational relevance.
Furthermore, real-world process control operations encompass a broad spectrum of operator profiles, including individuals with 30–40 years of experience (Koskinen et al., 2024). Eye-tracking metrics have demonstrated sensitivity to individuals of varying ages, as shown by Shi & Rothrock (2022), who reported that saccade durations and amplitudes significantly differ in older adults. Applying their approach to experienced board operators would further test their validity in capturing expertise grounded in years of plant-specific operational knowledge and ensure applicability to complex industrial settings.
Conclusion
Eye-tracking metrics offer valuable insights for enhancing HMI design by capturing operator perception of design elements, whether it be overall process tracing or element-level optimizations. We present findings grounded in scientific research demonstrating that fixation-related, scan path, and saccadic measures effectively evaluate attention allocation and search efficiency. Pupil size, blink rate, and saccade amplitude provide additional support in assessing mental workload variations influenced by changes in HMI configurations. However, various findings discussed in this paper should be interpreted as exploratory, and for meaningful impact on industries, studies must involve real board operators to ensure insights translate to actual work environments. As Industry 5.0 prioritizes human-centered innovation, leveraging insights from eye-tracking metrics could help design HMI systems that empower operators rather than overwhelm them.
