Abstract
Introduction
Mobile health (mHealth) is used in numerous contexts across many countries. 1 The technology was first developed for chronic diseases but has since been employed for health promotion and self-management as well. 2 Aside from private use, workplaces have also started to use apps as a means of workplace health promotion (WHP) because they are considered a cost-effective solution for supporting employee health and enhancing productivity, as well as employee satisfaction.3,4
Studies have evaluated the various use cases, identifying measurable effectiveness. For instance, research has demonstrated the effectiveness of mHealth in the use of physical activity trackers and wearables at the workplace.5,7–11 Besides physical activity and sedentary behavior,8,12 studies have also targeted mental health (e.g., in terms of stress perception or resilience).6,13–15 The intervention groups in these effectiveness studies have extended beyond office workers to include, shift workers, truck drivers, and airplane pilots.10,16,17
Studies have also illustrated perceived advantages of mHealth e.g. flexibility, personalization, possibilities of real-time progress tracking and feedback, the ability to reach a larger target group, nonstigmatization, 24-hour accessibility, usage independently from time and location, the existing integration of smartphones in daily life, prolonged user engagement, and cost-efficiency.4,6,11,18–23 Findings have further suggested that employers also benefit because the working conditions and health of employees can be tracked to implement health measures tailored to the workforce, hence improving work safety by preventing accidents. 10
However, up until now, detailed app usage behaviors of employees have not yet been extensively studied. The potential of WHP apps has been mainly examined by qualitative and quantitative studies measuring potential acceptance18,21 and by developing theoretical frameworks and models for adoption.24,25 Clearly, more detailed analyses of usage behaviors and actual effectiveness are required. However, according to Bol et al., it is difficult to compare studies focusing on usage behavior patterns because the apps differ widely in terms of their content, focus area, and target groups. 26 Still, Bol et al.‘s study emphasizes that those analyses are important, and a focus on the differences among individuals is crucial to develop better apps. Research indicates a need to perform qualitative research regarding the motivation of end-users. 21 McCallum et al. have argued that researchers should use more log data from the app to investigate the full potential of mHealth and make conclusions about usage behavior. 7
To fill this knowledge gap, the present study aims to evaluate app usage behavior and effectiveness based on health improvements using an explorative approach in a case study. Subgroup analyses and comparisons were conducted to assess users’ specialties and preferences. The basis for the evaluation is usage data gathered by app log and medical data acquired in medical check-ups before and after app usage. The study answers the following two research questions: 1. What are the differences in usage behavior (duration, frequency) among different demographic groups? 2. Did app usage have a significant impact on the measured medical outcomes?
The present research seeks to identify the factors that contribute to the usage of an app; examining this can help support the development of these apps and incorporate the preferred content across different target groups.
Related work
There have been few contributions in the field of app usage behaviors and the effectiveness of mHealth in terms of health improvements for WHP. The relevance of research arises from the fact that the costs for absent employees are increasing, there is increasing labor market competition, and there is a shortage of qualified workers because of demographic changes. 4 Thus, employers need measures to increase employee satisfaction, lower employee turnover and support the health of its their employees. One option is corporate health apps, which were first started by various employers and are now being examined by researchers to assess their effectiveness. 27 Literature reviews have pointed out increased physical activity, better productivity, and higher engagement; however, often, the effects are not significant.28,29 Additionally, a study by Lowenstyn et al. has also reported the potential effectiveness of a medical check-up in combination with the use of web-based information 30 as a new promising measure, supporting our study aim. Some authors have further argued that more studies on the effectiveness of these apps by using a larger number of participants and longer time frames are required. 28 Other studies have contributed in the field of adoption to gain insights into the potential usage and acceptance of end-users.24,31
To explain a range of usage behaviors and detect usage approaches, studies have combined available theoretical frameworks and models, such as the theory of planned behavior (TPB) and technology acceptance model (TAM). 32 Sari et al. modify the unified technology acceptance and use theory (UTAUT) to explain WHP app usage, 24 while Melzner et al. further develop a model based on the TPB, the TAM, and the health belief model (HBM). 25 For their proposed model, they find limited overall validity in terms of variance explained; however, based on the structural equation model, the importance of normative belief and, thus, colleagues’ support in the context of WHP apps is considered important. 33 Yang et al. have conducted a literature review to gather the factors that are relevant for adherence to mHealth apps; they summarize the predicting factors as being adherence, dimensions of adherence, and potential outcomes. 34 The four dimensions to measure adherence to mHealth apps are shown to be the following: “breadth (device functions used and completion of modules), depth (meeting tasks or challenges and behavior change), length (device use time and length of use), and interaction (active or passive interaction with the program elements)”. 34 These dimensions match the focus of our study on different usage behaviors in the app, while also combining them with medical outcomes. However, not all dimensions will be covered in our paper.
In addition to these general theories and theoretical frameworks, studies have also underlined the relevance of demographics to WHP effectiveness in practice. Gender is one factor that has been shown to influence behavior. Specifically, women use apps and the internet to search for more in-depth information about health and tend to use nontechnical health promotion programs in the workplace.35,36 whereas men appear more eager to test new technologies. These findings imply that men are more heavily influenced by perceived usefulness and that women are more attracted by ease of use and subjective norms.37,38 Additionally, findings have suggested that more men have fitness apps installed on their phones, while women seem to be more interested in nutrition and reproduction apps. 26
Age may be another influencing factor of mHealth usage. Older employees have been found to be more engaged in non-technology-based programs, placing greater value on ease of use compared with younger participants, which might be explained by less experience with technology among older individuals. Accordingly, younger individuals have had more mHealth apps installed on their phones. 26 Younger employees are also more interested in content related to physical activity and nutrition, stress management, and technology.39–41
Methods
In accordance with McCallum’s recommendations, the present research considers actual usage data from a WHP app. 7 In this case, the data were derived from an intervention conducted at a German IT company. The current study analyzes data from medical examinations compared with data from activities executed within an app. This specific app can be considered an information-based intervention that does not involve connected wearable functionalities. All data used in the app were obtained from either the check-up and app provider based on the medical check-up results (medical data), self-administered data from questionnaires (subjective health status), or log data of the user in the app tracked by the app provider (app usage data). No permissions for other tracking apps or gadgets were utilized. After the first medical check-up, the users were equipped with the WHP app, which demonstrated their personal results and provided personalized health recommendations and challenges.
Two groups of participants were distinguished. The first group (456 participants) consisted of users who participated in only one medical check-up and the app afterwards. The second study group (99 participants) completed pre- and post-screening within 1 year (December 2017–December 2018) at two different company locations; this included the initial and follow-up medical check-ups, which were conducted either in December 2017 and December 2018 (one-year follow-up period) or in March 2018 and October 2018 (seven-month follow-up period). However, for the comparisons between the two check-up results, no differences were considered between the two follow-up times.
Application and intervention
The intervention was supported by one supplier who executed the comprehensive medical check-up and provided the app. Neither the check-up nor app was explicitly developed for the study, and the design process was not scientifically reported. The provider was chosen by the company based on the comprehensive medical check-up, as well as the technical component. Generally, the intervention consisted of two parts: the medical check-up and app usage afterward.
Recruiting and study group
The study group was composed of employees from a German IT company with around 2500 employees. All employees were informed about the program via the usual communication channels of the company (email and intranet). They were also informed about data usage in the check-up and app and for scientific research (in an anonymous form). Data usage was approved by the internal privacy department of the company, as well as employee representatives. Communications were handled by the human resources department responsible for corporate health management.
Participation in the program was voluntary. No information was collected regarding the reasons for why employees decided to participate. The incentives for participation were that the check-ups took place during working hours and that all costs were covered by the employer. The participation rate was limited by the number of contracted check-ups with the provider. Participation was approved on a first-come, first-serve basis, which might have caused selection bias toward more motivated employees.
Statistical analysis
First, the entire study group (555 participants) was analyzed using descriptives and charts to evaluate usage duration and content (Part A). Then, detailed analyses were carried out concerning the differences in medical check-up results between pre- and post-check-ups for only those 99 participants who participated in both screenings (Part B). All factors were tested for significant differences between the baseline and follow-up screenings using Spearman’s correlation. Furthermore, a t-test was employed to evaluate the differences between medical values because a normal distribution was assumed based on the sample sizes in both groups, as well as the characteristics of the medical values. However, the usage rates were not normally distributed and, thus, needed to be evaluated with nonparametric measures, which were performed using SPSS 26.0.
Results
To answer the research questions, the study was divided into Parts A and B. Part A accounted for all users and focuses on usage behavior after the initial medical check-up and initial usage of the app. Part B investigated the effects of the intervention by monitoring only those participants who participated in two health screenings and, thus, provided pre- and posttest values (99 participants).
Study part A
The datasets from a total of 555 participants were analyzed. Thirty datasets with missing data in the demographics were excluded. Of the valid cases, 456 participants participated in only one screening, whereas 99 (16.9%) completed two screenings. The participants were 39.37 years old (SD = 9.55) on average and ranged from 22 to 67 years old. They consisted of 219 female employees and 336 male employees (60.5%). Their body mass index (BMI) ranged from 17.1 to 40.7, with an average of 24.71 (SD = 3.86). Their body fat percentage was 22.72 (SD = 7.1) on average and ranged from 5.1% to 46.7%. Blood pressure (BP) measures ranged from 100 to 198 (systolic BP left) and from 100 to 196 (systolic BP right), with an average of 129.07–130.86 (SD = 13.75). The average cholesterol/HDL ratio was 3.19 (SD = 1.13), and the average triglyceride measurement was 145.90 (SD = 80.23)
Correlation table for women (
Correlation table for men (
Different usage behaviors in the app
Total activities within the app over time (normalized per user).
Average duration of app usage according to age group.
*Calculation: mean of average time spent on page per individual.
Average duration of app usage according to gender group.
*Calculation: mean of average time spent on page per individual.
Age groups
The time spent on the page increased with the age of the users (except for ages above 60, although this category included only two users). On average, younger users spent less time on one page compared with older users, as confirmed by the Spearman rank correlation (
Gender Groups
On average, women spent slightly more time on the app pages than men. However, it should be noted that, on average, men logged 40.29 entries that were included in this analysis, whereas this figure was 35.67 for women.
Study Part B: Differences in the medical indicators before and after app usage
Demographics of the study participants who participated in two screenings (
Differences in health values between the first and second screenings.
The difference in calculated body age between the first and second screenings was also determined. The calculated body age accounted for a one-year difference (calculation = [Value_Screening1] − [Value_screening2 – 1]) because all participants aged 1 year during the time period. On average, positive developments in the calculation of body age were detected.
Differences: Real age at check-up and calculated age.
aThe calculation is based on the following formula: Value Check-up 1 − (Value Check-up 2 − 1 year); 1 year is deducted to account for the year of aging between check-ups
The metabolic system, especially the liver values, improved. For body composition, muscle mass was notably strengthened, whereas body fat and weight also decreased significantly. These results confirm the findings of other studies on weight management. 43 On average, the participants lost 0.78 kg. The circulatory values, especially the BP values, also improved, which is in line with other studies demonstrating positive results of screening on BP. 44 In a study using web-based interventions, a notable reduction of 0.3 years was found for cardiovascular age (including systolic BP and cholesterol). 45
Discussion
The above results support various conclusions and implications for theory and practice.
When applying the framework of Yang et al. on adherence to mHealth interventions, the app used in the present study provided deep insights by measuring the length and depth of app usage and adherence by duration of usage and content used. 34 These factors were complemented by predictor factors (correlations between demographics and adherence, as well as outcome), as well as medical data for the outcomes (using objective measures instead of self-reported outcomes), to provide deep insights into these relations.
During the evaluation of usage frequency over time, the screening dates were identified as incentives. When considering the distribution of activities over 1 year, upcoming screenings caused a higher frequency of interaction with the app, which, in turn, stimulated awareness of one’s health status. A possible explanation for this finding is that a fear of receiving negative results motivated the users to be more active. Indeed, Jimenez and Bregenzer have recommended a combination of instant feedback and direct health expert feedback to enhance motivation for participation. 46 Balk-Moller et al. have further confirmed that users considered clinical examinations as incentives to change their behavior. 47 Regardless, app usage clearly dropped after a few months, which has been similarly reported by other studies, including technical components. 30 For instance, in a literature review by Buckingham et al. (2018), the attrition rate regarding a follow-up screening ranged from 0% to 74%, and four of the 30 reviewed studies recorded attrition rates above 40%. 8 Of course, the follow-up period varied widely.
However, these high attrition rates do not necessarily need to be considered a limitation or as nonadoption of the app. Because apps are meant to support and incentivize a change toward healthy behavior, permanent long-term usage of the app might not be necessary. It is more important that users become educated about their potential health problems, learn how to change them, and sustain healthy behaviors. Still, studies have indicated that more research is needed to understand the relationship between long-term usage and behavioral change. 48 The incentive of screenings and certain check-up results might reinforce the importance of combining screening with technology, as promoted by studies hypothesizing that workplace screening may be effective in stimulating lifestyle improvements. 30
Focusing on our first research question and the effects of demographics, we can state the following: we studied a typical IT group consisting of mostly young male employees. 49 Men were more likely to participate in a second screening, which, based on other studies, might be explained by the technical component of the intervention. 38 However, other research has claimed that women tend to search more extensively for health information. 35 Generally, positive correlations between elevated check-up results and app usage have led to the conclusion that alarming results e.g. high BMI triggered usage of the app. This phenomenon supports the purpose of WHP but contradicts assumptions that healthy employees are more likely to participate in WHP.50,51 Interestingly, certain findings (especially for BMI and body fat) were significant for women but not for men, implying that the potential triggers for usage might differ. For women, elevated weight and fat measures are apparently a more important incentive to use the app compared with other elevated biomarkers such as cholesterol or BP.
Additionally, the women seemed to spend more time on the app pages, even though this difference was small. Research on usage frequency (not duration) confirms these slight differences.26,52 Moreover, older users also spent more time on individual pages on average. Studies focused only on frequency (e.g., when comparing usage in a meditation app) have confirmed that other content in the app was used more by older users than younger ones. 52
The effects of the intervention showed that the participants generally improved their medical results. When examining the check-up results of the participants independently of their app usage, the values for body composition and metabolic system reflected the most significant improvement over the course of the intervention. This observation has been based on the calculated age compared with the real age, as well as the differences between the first and second screenings. A similar study on a website including screenings and technical usage has yielded similar results to support technical usage. 30 The greatest improvement of those values can be explained by the fact that a healthy lifestyle (i.e., proper nutrition and physical activity) has the strongest influence on such values. In the individual analysis of certain values, significant differences were found in BMI, body fat, and systolic BP. These results clearly signify the positive outcomes of the intervention. Because of the individual design of the intervention, detailed comparisons with other studies would be difficult.
The present exploratory study has conclusively indicated that users need incentives, such as another screening or low health values, for app usage. The use of screening in combination with the app improved the health of most participants, thus achieving positive results for WHP.
Limitations
When interpreting the results of the analysis, certain limitations must be considered. First, recruitment took place through the company process. Participation was approved on a first-come, first-serve basis, which may have introduced bias by involuntarily excluding participants. Additionally, self-selection bias by more motivated employees might have occurred. Regardless, the representativeness of the samples was tested separately for those who attended only one screening and those who completed two screenings. Besides app usage, the company applied other interventions, such as sports courses—during the same time period, but these were performed independent of the present study—which could have influenced the medical indicators. Regarding the calculated body age, the participants aged during the study; therefore, a standardized value of 1 year was subtracted from the difference between calculated body ages to reflect the aging between check-ups. Generally, the sample consisted of only two company samples derived from the same company. Finally, because the intervention was conducted in a practical setting, its effectiveness cannot be directly proven because other influencing factors might have played a role.
Conclusion
The study is among the first to use usage data of a WHP app and medical check-up results to evaluate the usage and effects of an mHealth intervention. The intervention was found to be effective in improving various health parameters, especially regarding body composition and the metabolic system. In a comparison of the activities performed in the app, the screening itself or the expectancy of an upcoming screening seemed to strengthen the motivation to use the app, which emphasizes the importance of medical check-up elements within technical health solutions. Interestingly, the differences in usage behaviors in the gender groups and age groups were further detected. Thus, the present study contributes to theories on app adoption and the potential motivators for initial app usage. Furthermore, the implications for implementation are also evident because an app intervention as a standalone product appears to be less valuable than an intervention that combines screening with technology and incentives for usage.
