Abstract
Introduction
The concept of presence has commanded much research. Scholars have argued about its different definitions (Ghani et al., 2016). Sheridan (1992) described presence as “feeling like you are present in the environment generated by the computer” (p. 120). Parsons et al. (2017) divided presence into two main perspectives: one refers to an interactive experience with the media and the other, a phenomenon linked to the “organization and control of action” (p. 3). Waterworth et al. (2015) defined presence as “being physically present somewhere” (p. 36), whereas Jacobson (2002) described it as “engagement in a virtual world” (p. 1). In general, presence reflects how real one’s feeling is about the virtual environment depending on individual characters and media qualities.
Sense of presence is an intermediate mechanism of media technology, which helps us evaluate the media and understand user experience (Lessiter et al., 2001). The need to explore ways of creating a higher sense of presence is increasing. However, there is still no clear consensus on how to measure sense of presence, as reflected in existing self-report measures. Witmer and Singer (1998, 2005) developed the Presence Questionnaire (PQ) in 1998 and then re-validated it in 2005. They found that four factors (i.e., involvement, adaptation/immersion, sensory fidelity, and interaction quality) affect one’s involvement and immersion, which measure one’s sense of presence. Schubert et al. (2001) developed the Igroup Presence Questionnaire (IPQ) based on the embodied cognition framework and identified three factors (i.e., spatial presence, involvement, and judgment of reality) to measure sense of presence. Lombard et al. (2000) developed the Temple Presence Inventory (TPI), which includes eight factors measuring sense of presence: spatial presence, parasocial interaction, passive interpersonal presence, active interpersonal presence, engagement (mental immersion), social richness, social realism, and perceptual realism. Among these questionnaires, Witmer and Singer (1998, 2005) focused on environment interaction, whereas Lombard et al. (2009) on social presence. Scales that focus on spatial presence, involvement, and immersion cannot be used for measuring cross-media presence.
Based on previous research and existing questionnaires, Lessiter et al. (2001) developed the Independent Television Commission-Sense of Presence Inventory (ITC-SOPI). This study validated the ITC-SOPI because it can be used in measuring cross-media presence in subjective perception settings (Lessiter et al., 2001). This 44-item scale was originally developed in English. A 5-point Likert-type scale (1 =
The ITC-SOPI has not yet been validated in other countries, including China. Therefore, we translated the 44 items of the ITC-SOPI into simplified Chinese. Then we examined factor structure, internal consistency, and test–retest reliability. Wallach et al. (2010) believed that sense of presence, like empathy, is an emotional projection. In this sense, it can be understood as an emotional connection with a place and, therefore, is related to empathy. Stavropoulos et al. (2017) found that there is a strong correlation between anxiety and sense of presence. Based on their finding, we tested the correlations between the Chinese version of the ITC-SOPI and measures of empathy and generalized anxiety to evaluate the former’s convergent validity as the final purpose. Sense of presence was expected to be significantly related to high empathy and severe symptoms of anxiety.
We used VR as the medium for evaluating the Chinese ITC-SOPI. VR is a three-dimensional virtual situation generated by computer simulation. Under a laboratory environment, researchers can study complex human behaviors using VR to simulate a complex reality (Julia et al., 2015). The word “VR” was put forward by Jaron Lanier in the 1980s (Blanchard et al., 1990). Recently, with the growing popularity of commercial VR products, VR technology has been increasingly applied in social well-being fields, such as nursing, mental health, and learning promotion (Chirico et al., 2016). VR can provide high fidelity and elicit “presence” (Chirico et al., 2018; Steed et al., 2016), which means users experience increased presence, engagement, and usability (McMahan et al., 2012). Here, presence refers to an individual psychological response to a VR system (Bowman & McMahan, 2007). It is the intermediary mechanism in which the VR technology works (Price et al., 2011; Wiederhold & Wiederhold, 2005).
Method
Participants
This study was approved by the Ethics Committee of the Department of Psychology of the Tsinghua University (IRB NO. 2019THUPsy56). Of the 226 Chinese-speaking participants from the University Town of Shenzhen who volunteered to participate, 210 (133 males and 77 females) completed the Chinese ITC-SOPI, with a response rate of 92.9%. Participants’ average age was 23.05 years (
Translation and Adaptation
We employed the forward-backward translation procedure (Beaton et al., 2000). First, the original English version of the ITC-SOPI was translated into simplified Chinese separately by two translators who are bilingual in their native language Chinese and second language English. One translator was a psychologist and the other, a graduate student of the department of Chinese language and literature. Then, members of an expert panel discussed the two preliminary translations until they reached an agreement with the first synthesized version. Next, two fluent English speakers, a graduate student of a master’s in psychology and an undergraduate from the Department of English, translated the synthesized version back into English. Afterward, members of the expert panel compared the re-translated English version to the original. This process continued until the re-translated and original English versions were consistent. Finally, the expert panel, consisting of the two original translators and one professor with experience in scale development and translation, assessed the final version of the Chinese ITC-SOPI. We randomly selected five postgraduates majoring in psychology and data sciences in the Graduate School at Shenzhen, Tsinghua University to check the scale’s acceptability. After this process, the Chinese ITC-SOPI was finalized for data collection.
Process
Data were collected in November and December 2017. Participants were recruited from the University Town of Shenzhen by posting an announcement on campus and social media like WeChat. Potential participants were added in a WeChat group in which a detailed information on this experiment was provided. The VR staff members completed a 2-hr training that emphasized on standardizing the administration procedures to improve data quality. The experiment was held at the VR lab of the Graduate School at Shenzhen, Tsinghua University (Figure 1). First, study purpose and questionnaire characteristics were described to participants. Informed consent was obtained verbally. Participants were assured of the anonymity and confidentiality of their answers. Second, participants wore the VR headset, assisted by an experiment staff member. Third, participants played the four chosen games in sequence. Finally, participants were asked to fill out a written questionnaire independently. The total time of completing the experiment was approximately 45 min: 30 min to experience VR games and 15 min to complete the questionnaire.

Participants undertaking the experiment at the VR lab (participants provided consent).
The questions were reordered in the second data collection to obtain test–retest reliability. Participants were not informed that they would be completing the same questionnaire. They all confirmed that they did not have any VR experience between the two tests.
Instruments
Virtual reality
Data were obtained using the Chinese version of the ITC-SOPI. A detailed information on ITC-SOPI was provided in the introduction. This section discusses a more detailed information on VR games. The current VR headset that provides the best experience is the external Head Mount Display, such as Oculus Rift, HTC Vive, and Sony PS VR. Among these headsets, HTC Vive can provide a more fully immersive first-person experience and more realistic interaction. That is why, we selected HTC Vive as the media display. Participants’ experiences were based on the HTC Vive headset, STEAM, and VIVEPORT APP market. The PC specifications are as follows: Lenovo Y720, CPU Intel® Core™ i7-7,700, Operating System Windows 10, GPU NVIDIA® GeForce® GTX 1070. Considering the item content in ITC-SOPI, we selected the following four games with a realistic environment and interaction for participants to experience:

Games (theBlu, Desert Ride, Fist to Legend, and Fruit Ninja) used in the experiment.
Interpersonal Reactivity Index-C
The Interpersonal Reactivity Index-C (IRI-C) is a Chinese version of the Interpersonal Reactivity Index. Empathy includes emotional and cognitive empathy. The IRI-C evaluates empathy from a cognitive and emotional perspective, making the assessment of empathy more comprehensive (Davis, 1980, 1983). The IRI-C consists of 22 items scored on a 5-point Likert-type scale, ranging from 1 (
Generalized Anxiety Disorder-7
The Generalized Anxiety Disorder-7 (GAD-7) is a seven-item scale measuring generalized anxiety (Spitzer et al., 2006). It measures participants’ level of anxiety, such as whether they are nervous, worried, or annoyed, in the past 2 weeks. Response options include “not at all,” “several days,” “more than half of the days,” and “nearly every day,” which are scored as 0, 1, 2, and 3, respectively. In a previous study, Cronbach’s α of GAD-7 was .89 and the correlation coefficient between items and total score was higher than .63 (Löwe et al., 2008). In this study, Cronbach’s α of GAD-7 was .84.
Data Analysis
In our dataset, 11 of 210 completed responses were missing completely at random (MCAR). Therefore, we used mean imputation for missing data throughout the analyses (missing processing for the factor analysis).
Mplus version 7.2 was used for factor analysis and IBM SPSS Statistics version 20.0, for other analyses. Before conducting factor analysis, item-total correlations were examined to detect low-performing items. Based on a presence theory’s suggestion on the use of a four-factor structure and the previous development of the scale, we employed the confirmatory factor analysis (CFA) to validate the factor structure of the Chinese ITC-SOPI. Data were not normally distributed. The kurtosis and skewness values were 1.12 and −0.17 for spatial presence, 0.51 and −0.26 for engagement, 0.01 and −0.20 for ecological validity, and 0.38 and 0.58 for negative effects. Therefore, the maximum likelihood (ML) estimation was not appropriate for analysis (Finney & DiStefano, 2013). We used the weighted least square with mean and variance correction (WLSMV), which is the default estimation method for ordinal data, to evaluate the fitness of the original four-factor model. These fit indices were considered. Generally, chi-square/degrees of freedom <3, Tucker–Lewis Index (TLI) ≥0.900, comparative fit index (CFI) ≥0.900, and root mean square error of approximation (RMSEA) ≤0.080 were considered acceptable (Bentler & Bonett, 1980; Browne & Cudeck, 1992; Hu & Bentler, 1999). After the factor structure was confirmed, internal consistency (i.e., Cronbach’s α) and test–retest reliability were examined through factor analyses.
The scale requires summation in four sub-dimensions (Lessiter et al., 2001). Therefore, each of the four-factor scores (each generated by calculating a mean of all completed items contributing to each factor) and all of these four factors (spatial presence, engagement, ecological validity, and negative effects) were analyzed individually. These scores were used to investigate the criterion validity (i.e., relationships between factor scores and other scales).
Results
Factor Analysis
The Kaiser–Meyer–Olkin score was 0.881 (0.50 <
Item Analysis and Reliability
The item-total correlations for the 44 items of the Chinese ITC-SOPI ranged from 0.17 to 0.68. Items A1 (“I felt sad that my experience was over”), A2 (“I felt disorientated”), and B17 (“I paid more attention to the displayed environment than to my own thoughts”) had relatively low item-total correlations (
Means, Standard Deviations, Corrected Item-Total Correlation, and Cronbach’s α of the Deleted A1, A2, and B17 Items.
The mean, standard deviation, Cronbach’s α, Cronbach’s α with a 95% confidence interval, and test–retest reliability of each factor are presented in Table 2. Internal consistency was examined using Cronbach’s α. Cronbach’s α ranged from 0.75 to 0.87, suggesting good levels of internal consistency for the four factors (Mellinger & Hanson, 2017). Test–retest reliability is an indicator of the stability of ITC-SOPI scores over time. A subset of participants completed the ITC-SOPI for 2 weeks after the initial examination. The intraclass correlation coefficient (ICC) between the two sessions was 0.82 for spatial presence, 0.84 for engagement, 0.90 for ecological validity, and 0.91 for negative effects, which demonstrated high test–retest reliability (Shrout, 1998).
Means, Standard Deviations, Cronbach’s α, and Intraclass Correlation Coefficient (ICC) of the Questionnaires.
Convergent Validity
Convergent validity indicates the validity of an instrument relative to other standards. We computed Pearson correlations between the total scores of the four factors of the Chinese ITC-SOPI and IRI-C and these scores and GAD-7 to assess concurrent validity. As presented in Table 3, all the four factors of the Chinese ITC-SOPI showed significantly positive correlations with IRI-C and GAD-7. Specifically, spatial presence and engagement were significantly correlated with IRI-C; ecological validity was significantly correlated with both IRI-C and GAD-7; and negative effect was significantly correlated with GAD-7.
Pearson Correlations Between Scores of Spatial Presence, Engagement, Ecological Validity, Negative Effects, and IRI-C and GAD-7.
Discussion
This study validated the Chinese version of the ITC-SOPI among a convenience sample of 210 participants from the University Town of Shenzhen. We used a standard translation process, which involved forward-backward translation, an expert panel, and pilot testing. Our study provided Chinese-speaking researchers with a self-report measure of sense of presence in cross-media. The issue of how to accurately and conveniently evaluate sense of presence is necessary as it is an important intermediary variable in psychological studies, when researchers apply media, especially immersive media like VR, to their experiments (Price et al., 2001; Wiederhold & Wiederhold, 2005). The Chinese ITC-SOPI was validated with the following psychometric evidence: factor structure, internal consistency, test–retest reliability, and convergent validity. This is a pioneering study on measuring sense of presence in mainland China using the Chinese ITC-SOPI.
The various fit indices indicated that the four-factor model was a good fit in the Chinese version of ITC-SOPI. The Chinese ITC-SOPI consisted of four factors: spatial presence, engagement, ecological validity, and negative effects. This study’s factor analysis results are similar to those in the previous research (Lessiter et al., 2001), indicating that the scale had good structural validity in the Chinese context.
The item-level analysis indicated that items A1 (“I felt sad that my experience was over”), A2 (“I felt disorientated”), and B17 (“I paid more attention to the displayed environment than to my own thoughts”) had relatively low item-total correlations; therefore, deleting these items can significantly promote relevant internal consistency. Especially, in the Chinese context, A1 lacks the reason of “disorientated” and can be understood as “sad” because the experience was over or terrible; A2 also lacks the reason of “paying more attention to the displayed environment” and can be understood as disorienting because the experience was so real or bad. Hence, these three items, especially A1 and A2, can be ambiguous in the Chinese-speaking context. A different understanding of A1 and A2 may lead to an opposite result.
The Chinese ITC-SOPI had high internal consistency. Its Cronbach’s α was .87 for spatial presence, .81 for engagement, .75 for ecological validity, and .80 for negative effects. The results were similar to the original study conducted by Lessiter et al. (2001). Similarly, an acceptable test–retest reliability was obtained. Both separate ICC factor scores between the two sessions were highly correlated: 0.82 for spatial presence, 0.84 for engagement, 0.90 for ecological validity, and 0.91 for negative effects, which demonstrated that the results in the Chinese ITC-SOPI were stable over time.
Significant relationships between the factor scores in the Chinese ITC-SOPI and measures of empathy and anxiety were highlighted. High empathy anxiety levels were associated with a higher score of sense of presence. This result indicated that the Chinese ITC-SOPI had good criterion validity. In other words, people with high empathy and high anxiety may also have high sense of presence. More specifically, spatial presence and engagement were closely correlated with IRI-C; ecological validity was closely correlated with IRI-C and GAD-7; and negative effect was closely correlated with GAD-7. These findings are consistent with the results of other previous studies (Browne & Cudeck, 1992; Wallach et al., 2010).
Limitations
Although our findings showed good reliability and validity of the Chinese ITC-SOPI, study limitations should be considered. First, participants were recruited only from the University Town of Shenzhen. These young and highly educated people may not represent all people in China. Moreover, the sample size of 210 is relatively small compared with the original study (Lessiter et al., 2001). Given these limitations, our research results may reflect only Chinese university students. To improve the generalization of this scale, further studies should be replicated in different settings and populations, including those with low education and older adults who lack technological experiences in general, to verify whether the same results are obtained. Second, we only employed two scales (IRI-C and GAD-7) for criterion validity. More variety in measures of personality trait should be included to examine convergent validity. Third, although standardized translating procedures were used, the translation may have deviations from the original. Therefore, in the future, researchers should verify the accuracy of translated items. Finally, we did not analyze whether each item functioned appropriately. Therefore, a multidimensional Rasch analysis is necessary to examine item fit (i.e., infit and outfit).
Conclusion
The Chinese version of ITC-SOPI was validated with high internal consistency, good test–retest reliability, acceptable convergent validity, and a four-factor model, suggesting that the scale can be used for researchers to evaluate sense of presence in a Chinese-speaking context.
