Sage Journals: Discover world-class research

Abstract

Non-financial incentives such as badges, ranks, and status are often used to encourage user participation on online platforms. This study focuses on the effect of one such incentive, “status,” in the context of a third-party restaurant-review platform. In contrast to previous research that has mainly focused on the effects of such incentives on subsequent contributions from users who gained statuses, we explore how the intrinsic and perceived quality of content generated by users is impacted after users lose their statuses. Using natural language processing techniques to extract quality metrics from online reviews in our dataset, we exploit a quasi-experimental setting and demonstrate that even though the intrinsic quality of reviews significantly decreases after a reviewer is demoted by a platform, consumers on the platform nonetheless perceive these reviews as disproportionately useful. We draw on inequity theory and the elaboration likelihood model to theoretically support our empirical results, as well as conduct mechanism analyses to rule out alternative explanations. Furthermore, we find that temporal associations with a platform or with an elevated status do not moderate the effect of status loss on the intrinsic and perceived quality of reviews written post-demotion. The implications of our findings are significant for platform managers who manage the design of status-driven recognition systems and must determine how the change in status should be displayed on the platform.

Keywords

User-Generated Content Non-Financial Incentives Platform Policy Gamification Natural Language Processing Online Reviews

1. Introduction

With the growing influence of online reviews on consumer decisions and platform operations (Abrahams et al., 2015; Ceran et al., 2016; Cui et al., 2018; Kumar et al., 2018), one of the major operational challenges that online review platforms face is ensuring the quality of reviews (Chen et al., 2016; Lee et al., 2018). Therefore, if these platforms wish to succeed, they must establish mechanisms that motivate reviewers to contribute not only frequently but also with high quality (Dellarocas, 2010; Sun et al., 2021). To this end, platforms have devised various incentives to attract and retain reviewers who produce high-quality reviews (Burtch et al., 2018). While financial incentives for writing reviews are available (e.g., Khern-am-nuai et al., 2018), several review platforms commonly use non-financial incentives that recognize reviewer contributions (e.g., Goes et al., 2014). Among these non-financial incentives, recognition-based incentives (e.g., badges, status, and rank) are now widely employed by online review platforms (Anderson et al., 2013; Cavusoglu et al., 2015).

While considerable research has explored the effectiveness of recognition-based incentives (Anderson et al., 2013; Goes et al., 2016), recent studies have only begun to examine the impact of a transient, status-based incentive system on user contributions (Bhattacharyya et al., 2020; Zhang et al., 2020). In a transient, status-based incentive system, user statuses are not permanent, as they can be promoted or demoted from their statuses. In turn, platforms also struggle to maintain the exclusivity of status while minimizing the risk of alienating demoted reviewers. Although previous studies have investigated how a reviewer’s performance changes after achieving a certain status (Bhattacharyya et al., 2020), the impact of losing status on review quality has been largely unexplored. In this article, we address such a research gap by proposing the following research questions:

RQ1: What is the impact of status loss on the quality of reviews written by users who lose their status?

RQ2: How do readers perceive the change in review quality following the loss of a reviewer’s status?

In our context, we empirically investigate the implications of status loss on review quality in the context of a third-party, online review platform by using a unique approach that distinguishes between intrinsic and perceived measures of quality. Intrinsic quality measures reflect the objective quality of reviews and are a supply-side construct on an online platform, while the perceived quality of reviews is a demand-side manifestation and is quantified by how informative and helpful review readers (i.e., platform consumers) perceive these reviews to be. This distinction in quality measures, although rarely addressed in the literature, is nonetheless important because the intrinsic and perceived quality metrics of a review come from separate entities, reviewers, and a platform’s review consumers (hereafter referred to as platform consumers) respectively, and can affect a platform’s value differently.

We analyze the impact of status loss on these intrinsic and perceived qualities of reviews by using Yelp as the context. Yelp confers the transient performance-based status Elite and decides at the end of every year which individuals will be promoted to or demoted from this status. To operationalize intrinsic quality, we use six measures generated by natural language processing (NLP) techniques that have been commonly used in prior studies to represent objective review quality: review length (Yu et al., 2024), reading ease (McCloskey, 2021), reading grade (Carlson et al., 2015), the Gunning Fog index (Yin et al., 2016), the Dale Chall readability score (Khreiche, 2020), and lexical density (McCarthy and Jarvis, 2010). To measure the treatment effect of status loss on intrinsic quality, we exploit the fact that status-demotion heuristics are exogenous to reviewers, which offers us a quasi-experimental setting. Furthermore, to measure the effect of status loss on perceived quality, we use review helpfulness votes as a proxy, as reflected in the literature (Khern-am-nuai et al., 2018). Our study takes advantage of Yelp’s practice of displaying a reviewer’s past Elite status, even if the reviewer is no longer Elite, a norm that is common on user-generated contents (UGCs) platforms such as CodeChef and HackerRank as well as online games such as Guild Wars 2 and Destiny. Thus, when consumers read a review written by a demoted reviewer, they can easily recognize that the reviewer was formerly an Elite status holder. Consequently, we can identify whether the perceived quality of reviews posted by demoted reviewers is affected differently than their intrinsic quality due to the display of past statuses. We note that a platform’s decision to demote a reviewer is completely exogenous to platform consumers, which again provides us with a quasi-experimental setting. Subsequently, we leverage propensity-score matching (PSM) and the difference-in-differences (DiD) estimation strategy, for which the treatment group consists of demoted reviewers, while the control group consists of matched, current Elite reviewers. We find that platform consumers perceive reviews written by demoted reviewers and those by current Elite reviewers as statistically similar in quality. In conclusion, although reviews from demoted reviewers may exhibit a decline in intrinsic quality, there is no corresponding decrease in perceived quality. This suggests that perceived quality is asymmetrically rated higher than intrinsic quality. Thus, our study highlights this discrepancy between intrinsic and perceived quality metrics, which may subsequently impact the value and sustainability of a platform over the long run.

Our main findings have direct implications for platform operations, which have emerged to be one of the key issues in the operations management literature (Khern-am-nuai et al., 2024; Sun and Xu, 2018; Yan et al., 2019). The apparent loss in intrinsic quality would be less of a problem for platforms if these reviews were consumed less or perceived to be commensurately less helpful. However, we find that reviews written by demoted reviewers are asymmetrically perceived to have higher quality by platform consumers. We use inequity theory and the elaboration likelihood model (ELM) to theoretically explain our empirical findings. In particular, inequity theory suggests that the decline in intrinsic quality may be driven by the feeling of unfairness perceived by demoted reviewers, who respond by posting lower-quality reviews. Using mechanism analyses, we rule out the possibility that this result could be driven by alternative explanations such as a pre-demotion loss of interest in the platform among demoted reviewers or change in the review-writing strategies of demoted reviewers post-demotion (i.e., by opting to write reviews on business units distinct from those they previously covered before their demotion). We conclude that the disparity (i.e., the perceived quality of reviews posted by demoted reviewers is unjustly higher than their intrinsic quality) can be attributed to the platform’s strategy of displaying these demoted reviewers’ past Elite status. The ELM suggests that such a difference in the two quality measures could arise if review consumers give more weight to peripheral cues (i.e., the badge that shows that a demoted reviewer once held status) than to central cues (i.e., the intrinsic quality of the reviews). We use two additional analyses to lend support to this mechanism. First, we compare if reviews of similar intrinsic quality on Yelp were perceived differently if they were written by demoted reviewers as opposed to reviewers who never held Elite status. We find that the perception of review quality, after accounting for intrinsic quality measures, is statistically higher for demoted reviewers. Second, we use a randomized experiment, which conclusively proves that the perception of review quality is indeed guided by peripheral cues rather than central cues of quality. The experiment also demonstrates that our results from the main analyses are not driven by the potential failure to capture intrinsic quality by the measures that we use in this study. Additionally, we study the role of temporal effects by using two different moderators: how long a reviewer has been with a platform and how long a reviewer held a certain status before losing it. We find that neither of the moderators plays a significant role in moderating the effect of status loss. Next, we review the related literature and discuss the theoretical underpinning of our article.

2. Background Literature and Theoretical Underpinning

2.1. Related Literature

2.1.1. UGC Platform Strategies and User Performance

The literature in this stream of research usually examines UGC from a supply-side perspective (e.g., Duan et al., 2008; Hu et al., 2009; Pavlou and Gefen, 2004; Wu and Zhao, 2023). For instance, past literature has identified factors that platforms can use to drive individuals to ask or answer questions in knowledge-sharing communities such as StackOverflow (e.g., Pu et al., 2022). Meanwhile, researchers have also studied the designs of UGC platforms. For example, Burtch et al. (2022) studied the impact of peer awards on the contribution behavior of Reddit users. In the same vein, Rishika and Ramaprasad (2019) demonstrated that the contribution behavior is significantly affected by the symmetricity of social ties, social embeddedness, and tie strengths in the online community.

Another sub-stream of research in this area focuses on platform strategies regarding the use of incentives to encourage content contributions. Particularly, numerous prior works study the implications of incentives on the content contribution behavior, including monetary incentives (Burtch et al., 2018; Khern-am-nuai et al., 2018; Qiao et al., 2020) and non-monetary ones (Rasool and Pathania, 2023; Yu et al., 2023; Zhang et al., 2020), and how incentives impact sellers on the platform (Fradkin and Holtz, 2023; Qiao and Rui, 2023). Our study is closely related to prior works in this sub-stream that involve the status of platform contributors. In particular, the literature suggests that status is a strong motivational factor (e.g., Lefevere, 1983). Relatedly, Cheng et al. (2020) empirically demonstrated the economic implications of the social reputation system. They find that the reputation badges of the sellers have significant and positive impacts on sales. Outside the e-commerce context, the implications of status loss point to conflicting results. Notably, most of these studies have investigated domains in which individuals are bound by employment or a commitment that makes individuals’ participation non-voluntary; in such settings, the effects of losing status are negative (e.g., Marr and Thau, 2014). Meanwhile, status loss involving participants who act voluntarily has been studied in contexts such as gaming platforms and loyalty programs where the loss of status can either yield negative effects (e.g., Duguid and Goncalo, 2015; Wagner et al., 2009) or positive effects (e.g., Deodhar et al., 2019; Pettit et al., 2010).

Our study contributes to the literature on platform strategies and user performance by examining the effectiveness of transient status awards in two distinct ways. First, we investigate the impact of status demotion in online review platforms, for which the primary benefit of status is social recognition, and in which individual engagement is mainly driven by psychological factors such as altruism and moral responsibility, rather than the financial incentives featured in loyalty programs (Ke et al., 2020; Lampel and Bhalla, 2007). In the context of our study, Yelp explicitly prohibits participating businesses from offering monetary incentives for reviews.¹ This nuance in our empirical setting ensures that the identification of the effects of status loss on review quality are not confounded by the financial motivation of users to write reviews. Second, we study a status evaluation system in which the criteria for change in status is endogenous to the evaluator (i.e., the platform), and, thus, unknown to the reviewers. Prior studies on status loss have typically used settings in which the criteria for gaining and maintaining status are publicly known (Liang et al., 2017). However, when the criteria for demotion are endogenous, a loss of status can breed perceptions of unfairness and injustice among affected reviewers, which can impact their subsequent contributions. We expand upon this perspective in Section 2.2 where we discuss theories that underpin our empirical investigations.

2.1.2. Display of Status and Perceived Review Quality

User-generated reviews play an important role in the decision-making process of potential consumers, who rely heavily on UGC in the absence of other information (Dellarocas, 2003). The volume of reviews, tone of reviews, star ratings, and intrinsic as well as perceived quality of reviews all seem to affect the potential sales of the focal product/service (Floyd et al., 2014). However, a potential consumer using these reviews may not be acquainted with the reviewer(s). Thus, that consumer may question the credibility of the content generated by the latter. Therefore, the large variance in the veracity of this indirect experience coupled with the ever-growing volume of reviews requires consumers to use cues to quickly identify the most credible and relevant content (Yin et al., 2014). To help consumers search for credible information more efficiently, these platforms have instituted several measures based on peer evaluations of reviews or perceived quality measures, which then serve as proxies for the intrinsic quality of these reviews (Cao et al., 2011; Mudambi and Schuff, 2010; Pan and Zhang, 2011).

Intricately linked with perceived review quality is reviewer credibility. With respect to designing online review platforms, Dellarocas (2010) underscores the importance of “what information should be included” in a reviewer’s profile. As mentioned previously, recognition-based incentive systems are prevalent in online review platforms, and it is a common practice to show reviewers’ statuses or badges on the reviewers’ profile pages (e.g., “Top Contributor” on TripAdvisor, “Elite” on Yelp). Many platforms that employ a transient status-based reward system often show the past status of reviewers. For example, Yelp shows the years during which reviewers held “Elite” status, even if they currently do not hold that status. Similarly, HackerRank displays top leaderboard accolades that a user received previously. Such a practice is also commonly observed in massively multiplayer online role-playing games, such as Guild Wars 2 and Destiny. While this practice most likely stems from a platform’s attempt to keep demoted users motivated, it can have externalities on the platform that remain unexplored, constituting a research gap that our study intends to address.

2.1.3. UGC Quality Control

While consumers have relied on high-quality reviews to make future consumption decisions, companies for whom these reviews have been written have used such responses to mitigate consumer concerns, thereby increasing consumer satisfaction (Abrahams et al., 2015). However, operationally, there are quality-related challenges that an online review platform faces. Chen et al. (2016) observed that online reviews are at best incomplete, for they lack the opinions of consumers who never write reviews, which could lead to reporting bias. Chen et al. (2016) also find that positive experiences are reported more than negative experiences. Online review platforms are also subjected to sentiment manipulation from strategic parties, as shown by Lee et al. (2018) in the context of movie operations, and such manipulation could lead to a decline in information quality and loss of consumer welfare.

Our work contributes to the literature on the quality management of online reviews by providing insights into how the dynamics of intrinsic quality and perceived quality are impacted differently by a platform’s strategies on review quality management, specifically the decision to demote a reviewer from a transient status (“Elite” status in our empirical context). First, we empirically demonstrate that a platform’s strategy to demote reviewers leads to a decline in the intrinsic quality of reviews. Second, we show that a platform’s strategy to display the past status of the demoted reviewers leads to perception bias, in which a “poorer” review written by a demoted reviewer is unduly perceived to be of higher quality. In addition, our study contributes to the literature on the unintended consequences of platform policy (e.g., Joglekar et al., 2016; Anderson et al., 2023; Mayya and Viswanathan, 2024) by demonstrating that the policy to demote Elite reviewers not only directly impacts the quality of reviews written by demoted reviewers but also indirectly impacts how these lower-quality reviews are perceived by review readers. Last, our work is also related to the preferred partnership literature (Meehan and Wright, 2013; Sahaym et al., 2023) as we unveil how the removal of the “preferred partner” tag (i.e., the Elite tag in our empirical context) by one entity impacts relevant stakeholders.

2.2. Underpinning Theories

In our review of the literature, two relevant theories emerge as potential mechanisms that could inform our empirical analyses. In this subsection, we expand upon these theories and discuss how they underpin the contribution behavior of demoted reviewers and how consumers perceive the quality of reviews.

2.2.1. Inequity Theory

Recall that in our empirical setting, the criteria for status demotion are endogenous to the platform, which may induce perceptions of unfairness among demoted reviewers. Such perceptions and corresponding changes in behavior are typically explained by inequity theory.

Inequity theory, developed by Adams (1963) and based on Festinger’s theory of cognitive dissonance (Festinger, 1957), explains how individuals perceive and evaluate fairness with respect to their inputs and outcomes compared to those of others. The theory states that individuals evaluate the value (Outcomes, such as salary, increment, promotion, and status) they get from a task with respect to what they have invested into conducting the task (Inputs, such as effort and skills). Individuals then evaluate fairness or equity by comparing the ratio of their outcomes-to-inputs to that of relevant others (Pritchard et al., 1972). Subsequently, if inequity is perceived, then affected individuals may engage in inequity reduction by working harder to change their inputs or outcomes. In contrast, perceived inequity may demotivate individuals with respect to both the task and the evaluation environment, resulting in a reduced effort that leads to a decline in performance and, in the extreme, departure from the task (Goodman and Friedman, 1971; Pritchard, 1969).

In the context of our study, reviewers expect to be recognized with fairness by the platform to which they are contributing. Hence, if reviewers believe that their contributions are comparable to those of other reviewers who held similar status in a given period, then the former would expect the platform to allow them to retain their status in the next period. Inequity theory in online community settings has been applied primarily to identify different factors that motivate contribution behavior. For example, Chou et al. (2016) showed that a sense of virtual community is fostered by the perception of online justice or fairness, which leads to value co-creation behavior. Drawing upon inequity theory, Feng and Ye (2016) suggested that in online communities, individuals who consume reviews perceive themselves to have gained knowledge unfairly from the efforts of others and, therefore, engage in reciprocal contributions to restore equity. Conversely, Bhattacharyya et al. (2020) used the inequity theory to explain that eligible but unacknowledged members of an online review platform may reduce their contributions because of a sense of recognition inequality. In our context, we employ inequity theory to illustrate how the intrinsic quality of reviews is affected for individuals when they are demoted from a status on an online community platform.

2.2.2. Elaboration Likelihood Model

As previously discussed, consumers of an online review platform typically develop their perceptions of review quality based on various factors. Our study focuses on one of these factors, reviewer status, as the primary variable of interest in our empirical analysis. In that regard, previous studies have extensively drawn upon the ELM to explain the formation of perceptions among consumers.

ELM proposes that a recipient’s perception of information is influenced by both the content of the information and the context in which it is presented (Petty et al., 1986). Zhu et al. (2014) draw upon ELM to theorize that consumers seeking information could focus on “central cues” and be influenced by “argument quality,” or they could value “peripheral cues” more and be influenced by “source credibility” (Khern-am-nuai et al., 2023; Petty et al., 1981; Petty and Cacioppo, 2012; Sussman and Siegal, 2003). More precisely, Zhu et al. (2014) draw upon the source-credibility model to discuss the evolution of opinion leaders in online communities. Source credibility is defined by Ohanian (1990) as “a communicator’s positive characteristics that affect the receiver’s acceptance of a message.” The source-credibility model dates to the seminal work of Hovland et al. (1953), who found two factors leading to the perceived credibility of the communicator, namely, “expertness” and “trustworthiness.” Hovland et al. (1953) defined “expertness” as “the extent to which a communicator is perceived to be a source of valid assertions” and define “trustworthiness” as “the degree of confidence in the communicator’s intent to communicate the assertions he considers most valid.” Subsequently, Ohanian (1990) added a third factor called “source attractiveness,” which was proposed in the source valence model by McGuire William (1985), in contrast to the previously mentioned two-factor source credibility model from Hovland et al. (1953).

In the context of online review platforms, we posit that the credibility of the reviewer (or source) plays a critical role in shaping platform consumers’ perceptions of a review’s quality. In the online context, while a potential consumer may not have any personal acquaintance with a reviewer, the social status or recognition conferred by peers or the platform can act as a direct substitute for all three parameters (expertise, trustworthiness, and source attractiveness) of the source credibility model. In a similar vein, signaling theory suggests trust and credibility about others’ actions (e.g., written reviews) are reinforced when the superiority of the other is demonstrated through a signal (e.g., platform-conferred status recognition) that is difficult to replicate and is costly to obtain (Donath, 2007). Therefore, while “argument quality” determines the intrinsic quality of reviews, “source credibility” impacts perceived quality. In our specific context, we leverage ELM to discuss how a change in reviewer status, a peripheral cue, influences consumers’ perceived quality of such reviews.

In summary, our empirical investigations are guided by the inequity theory and the ELM. We summarize how these theories underpin the constructs used in our empirical analyses in Figure 1.

Figure 1.

Conceptual model of our study.

3. Research Context and Data

In this section, we describe our research context and the data we use in our empirical analyses.

3.1. Empirical Setting

We use Yelp as our focal online review platform. With over 70 million active reviewers and 200 million reviews, it is one of the largest review platforms on the Internet (Smith, 2021). Reviewers on Yelp can leave a review for a product or service that they have used and can help the business with feedback and word of mouth. Around 60% of Yelp businesses are restaurants, cafes, or food and beverage services.

At the end of each year, reviewers can nominate themselves for “Elite” status. The nominations are evaluated by Yelp’s local area manager, and this process is known to be subjective (Nilsson et al., 2018). Therefore, it is not guaranteed that two reviewers with similar contributions will be conferred (or not conferred) status similarly. Each year, Elite reviewers account for about 2% of Yelp’s reviewer base, and the reviewers who receive this status retain it for the next calendar year. These reviewers are automatically renominated for the continuance of their status. However, some of them may not be conferred the same status in the following year if they do not meet the criteria that are endogenously decided upon by Yelp. More importantly, a reviewer who is declined a promotion (or is demoted from an existing status) is unaware of the exact criteria behind the platform’s decision (Bhattacharyya et al., 2020). We leverage this process to develop a quasi-experiment-based identification strategy in our subsequent empirical analyses. We primarily use Yelp’s academic dataset,² which provides information for different business units from 11 cities between 2006 and 2018. There are over 6 million reviews written by 1.2 million reviewers for 192,609 businesses in total. For each review, the dataset provides the date of a review, text written by the reviewer, the star rating conferred by the reviewer, the compliments that a review received (e.g., useful, funny, and cool), and the business for which the review is written, as well as the business’s characteristics and categories. Reviewer details consist of the reviewer’s location, years of activity on Yelp, number of reviews written, years the reviewer maintained Elite status (if any), and overall compliments received.

Our dataset consists of reviews written in 11 cities only. However, it is important to note that reviewers in our dataset may write reviews in other locations that are not a part of the Yelp academic dataset, which imposes two issues. First, we are interested in studying reviewer behavior by using an observable dataset, and it would be problematic to include reviewers whose behavior is largely unobserved. For example, we find reviewers for whom we observe only one review in our dataset. However, our dataset is restricted to only 11 cities, and it is possible that such reviewers wrote many more reviews for businesses outside of these 11 cities. Second, the literature has shown that review-generating behavior is significantly different when reviewers write reviews outside their base location (Kokkodis and Lappas, 2020). We take two steps to alleviate these issues. First, we follow the approach used by Bhattacharyya et al. (2020) to filter out reviewers who have written > 25% of their reviews outside our dataset (i.e., outside the 11 cities captured in the Yelp Academic dataset). This practice ensures that the reviewers included are not tourists. In addition, we use the unique reviewer ID information from the dataset and program a web crawler that collects additional data for each reviewer. These additional data include all the reviews written by focal reviewers beyond the cities covered in our primary data. We augment our existing dataset with these additional data and use it for our main analysis. In total, there are 495,430 reviews written by reviewers who have achieved Elite status at least once.

3.1.1. Intrinsic and Perceived Review Quality Measures

In the extant literature on online reviews, researchers have used several variables to proxy the intrinsic quality of reviews. In this article, we use the following six variables that are commonly used in the literature to capture intrinsic review quality:

Review length: This measures the number of alphanumeric characters in a review. The past literature has shown that longer reviews contain more information and therefore are commonly considered to have higher quality (Mudambi and Schuff, 2010).

Flesch reading ease: This is a score that measures the ease of reading a text. It considers the total number of sentences, words, and syllables included in a document to generate this score. A higher score means that the textual content is simple to read. As such, prior studies commonly denote reviews with low Flesch reading ease scores as high-quality reviews (Garnefeld et al., 2021; McCloskey, 2021).

Flesch-Kincaid reading grade: This metric measures the grade level of education required to produce textual content. Accordingly, prior studies tend to interpret a review with a higher Flesch-Kincaid grade (i.e., a review produced by a writer with a higher level of education) as a higher-quality review (Carlson et al., 2015; Manchaiah et al., 2020).

Gunning fog index: Similar to the Flesch-Kincaid reading grade, the Gunning fog index measures the education grade of the writer based on the text. Accordingly, the literature has associated a review with a higher Gunning fog index score (i.e., higher-education grade of the review writer) with higher review quality (Khern-am-nuai et al., 2018; Yin et al., 2016).

Dale Chall readability score: This score also measures the grade level required to write the textual content. A score below 4.9 means that the review is written by someone with grade 4 or below capabilities, while a score of above 10 means that the review is written by someone with grade 16 or above capabilities. As such, prior studies usually treat reviews with a high Dale Chall readability score as high-quality reviews (Khreiche, 2020; Zhang et al., 2022).

Lexical density: This measures how many different lexical words are present in a text divided by the total number of words in the review. Therefore, a higher quality review tends to have a higher lexical density score (McCarthy and Jarvis, 2010).

We extract these intrinsic quality measures from the review text using a package called TextStat in Python (Bansal and Aggarwal, 2021). The operationalization of these variables is summarized in Section A of the E-Companion, where we also provide visualization examples of these measures.

To measure the perceived quality of a review, we create two measures. On Yelp, review readers can vote reviews as “useful,” “cool,” and “funny.” In existing literature on online reviews, these votes are typically considered collectively as indications of review quality (e.g., Bakhshi et al., 2015; Li et al., 2019). Therefore, our first measure, termed Compliments, aggregates all types of votes for a post. Specifically, we define compliments as the sum of useful, cool, and funny votes. Furthermore, we recognize that some previous studies, particularly those utilizing Yelp data, have particularly focused on review usefulness (e.g., Srivastava and Kalro, 2019; Ceylan et al., 2024). Thus, to align with this prevailing approach, we also separately analyze the “useful” votes. Accordingly, we introduce our second variable for perceived quality, termed Useful, which represents the count of votes designated as “useful.”

In Table 1, we present the summary statistics of our variables of interest, that is, the intrinsic quality measures and perceived quality measures for the 9,879 reviewers who lost statuses during our observation period. We also include our control variables, namely, the average star rating, the number of years since the reviewer has contributed to Yelp, and the number of years since the reviewer has/had Elite status.

Table 1.
Summary statistics of the variables of interest (per reviewer).

Pre-treatment Post-treatment

Variable type Variable Mean Std Dev Min Max Mean Std Dev Min Max

Intrinsic quality Review length 885.4 428.0 19.0 4896.0 843.5 487.7 19.0 4974.0

Reading ease 42.0 58.5 −2513.6 121.1 48.2 58.9 −2513.6 122.9

Reading grade 20.1 21.9 −3.5 367.1 17.6 21.3 −3.5 367.1

Gunning fog index 22.4 22.3 0.4 252.2 19.9 21.6 0.4 266.2

Dale Chall readability score 8.2 2.8 0.1 36.2 7.9 2.7 0.1 38.2

Lexical density 87.1 16.5 0.0 1575.0 85.9 22.6 0.0 1575.0

Perceived quality Useful votes 1.1 6.2 0.0 1122.0 1.5 2.2 0.0 44.8

Compliment votes 2.2 12.8 0.0 2255.0 3.3 5.1 0.0 125.8

Control Number of reviews 50.4 50.6 1.0 776.0 14.4 18.3 1.0 350.0

Average rating 3.8 0.4 1.0 5.0 3.9 0.7 1.0 5.0

Years on Yelp 3.9 1.8 0.0 13.2 5.1 1.9 0.0 13.2

Years as Elite 2.5 1.3 0.0 8.0 3.5 1.4 0.0 9.0

Note: The summary statistics above are generated from 9,879 reviewers who lost their statuses during our observation period.

		Pre-treatment	Post-treatment
Intrinsic quality	Review length	885.4	428.0	19.0	4896.0	843.5	487.7	19.0	4974.0
	Reading ease	42.0	58.5	−2513.6	121.1	48.2	58.9	−2513.6	122.9
	Reading grade	20.1	21.9	−3.5	367.1	17.6	21.3	−3.5	367.1
	Gunning fog index	22.4	22.3	0.4	252.2	19.9	21.6	0.4	266.2
	Dale Chall readability score	8.2	2.8	0.1	36.2	7.9	2.7	0.1	38.2
	Lexical density	87.1	16.5	0.0	1575.0	85.9	22.6	0.0	1575.0
Perceived quality	Useful votes	1.1	6.2	0.0	1122.0	1.5	2.2	0.0	44.8
	Compliment votes	2.2	12.8	0.0	2255.0	3.3	5.1	0.0	125.8
Control	Number of reviews	50.4	50.6	1.0	776.0	14.4	18.3	1.0	350.0
	Average rating	3.8	0.4	1.0	5.0	3.9	0.7	1.0	5.0
	Years on Yelp	3.9	1.8	0.0	13.2	5.1	1.9	0.0	13.2
	Years as Elite	2.5	1.3	0.0	8.0	3.5	1.4	0.0	9.0

3.2. Research Design

Using the data we described in the previous section, we construct our empirical models. Since our first objective is to establish the causal effect of status loss on the intrinsic quality of future reviews, we face the endogeneity issue of who gets demoted. Fortunately, Yelp uses an endogenous promotion/demotion process (i.e., it does not reveal the exact criteria it uses to promote a reviewer to Elite status or to demote a current Elite reviewer). We leverage this selection process to set up our research framework as a quasi-experiment, wherein we use PSM to control for potential endogeneity concerns and subsequently a DiD technique to estimate the treatment effect(s). These techniques have been extensively used in studies that analyze causal inferences (e.g., Kokkodis and Lappas, 2020; Mayya et al., 2021; Sharma et al., 2020). Within this framework, our analysis is akin to a two-group experiment, in which the reviewers who lost status are considered to have received the treatment while those who did not lose status constitute our control group. When we compare both the difference between the treatment and control groups and the corresponding difference between the pre-treatment and post-treatment periods, we can causally identify the effect of the treatment (i.e., loss of status) on intrinsic quality and the perceived quality of reviews. It is also worth noting that Yelp does not distinguish reviews written by friends and those written by other reviewers, which ensures that our estimations are not impacted by potential peer effects.

3.2.1. Data Structure

In our article, we construct the data at the reviewer level (i.e., each observation corresponds to a reviewer). For each time a reviewer lost status in our dataset, we create an observation that consists of two time periods. The first time period ( $t = 0$ ) corresponds to the year before a reviewer lost status. The second time period ( $t = 1$ ) corresponds to the year after a reviewer lost status. Note that the year a reviewer lost status may be different for different reviewers. To ensure that our analyses are conservative and to avoid reviewers who lost the Elite status just after gaining it, we only include reviewers who have had the status for at least 2 years before losing it. For example, a reviewer may have had the Elite status in 2008 and 2009, but then lost that status at the beginning of 2010; in such a case, we consider the behavior of this reviewer in the years 2009 and 2010, such that 2009 represents the pre-treatment year and 2010 represents the post-treatment year. Of note, we observe that 288 reviewers have lost status multiple times; for these reviewers, we only include the first time they their lost statuses for our analysis. With this construction, our treatment group has a total of 8,723 reviewers who lost Elite status. This set of demoted reviewers constitutes our treatment group.

Meanwhile, to create the control group for our analyses, we implement the following procedures. A reviewer who never lost status during the time frame of our data set becomes a candidate for the control group. Moreover, in our setting we allow a demoted reviewer to be a control-group candidate, albeit not in the same year that the reviewer lost Elite status. More specifically, for these reviewers we only consider the not-yet-demoted years as part of the potential control group observation. Furthermore, for such a reviewer, the following years cannot be observations for the pre-year in the control group: the first year that the reviewer achieved Elite status, the year at the end of which that reviewer lost status, and the year after the reviewer lost status. With these heuristics in place, for the qualified control-group candidates, we construct an observation per reviewer per year that consists of two time periods. The first time period ( $t = 0$ ) corresponds to the pre-treatment year. The second time period ( $t = 1$ ) corresponds to the post-treatment year. We use these observations to develop the control group for our study, as we will explain in detail in the next subsection.

3.2.2. Propensity-Score Matching

To perform our empirical analyses, we must control for the endogeneity related to how an Elite reviewer is demoted. In parity with usual practices in quasi-experimental settings, we use PSM to find the matched control group of current Elite reviewers to compare with our treatment group of reviewers who had lost Elite status. Since our objective is to identify a pair of reviewers who write reviews of similar quality in the pre-treatment period, we follow prior studies and use variables that represent review quality as matching covariates (e.g., Qiao et al., 2020). Specifically, as we discussed earlier in Section 3.1.1, these variables include review length, the Flesch reading ease, the Flesch-Kincaid reading grade, the Gunning fog index, the Dale Chall readability score, and lexical density, all of which capture intrinsic review quality. We also include usefulness votes and total compliments, which represent perceived quality.³ In addition, we include more covariates to improve the robustness of our matching by considering variables used in studies that employed reviewer-level matching in the context of online reviews. These variables include average star rating, the number of years since a reviewer has contributed to Yelp, and the number of years since a reviewer gained/lost status (e.g., Khern-am-nuai et al., 2018). We performed this matching using all data in the pre-treatment period, for which we had 8,723 treatment reviewers and 15,689 potential control group reviewers. We estimate the propensity score using a logit function and use nearest-neighbor matching without replacement to obtain our final matched data, which contain 8,723 unique treatment reviewers and 5,925 unique control reviewers. Given that the majority of reviewers experienced a status loss within our 12-year observation window, we permit demoted reviewers to serve as control users during the time they retained Elite status. Consequently, a demoted reviewer may function as a control user(s) multiple times in our matching process, with their Elite years serving as control observations. Thus, the overall count of unique control reviewers in our matched dataset is lower than that of treated users.

We next validate our matching by testing the balance between the variables of the treatment group against those of the matched control group. It is important to note that we perform PSM because we aim to use our DiD analysis as our primary empirical specification. However, because our dataset consists of only two time periods, the balance test essentially serves as a test for the parallel trend assumption as well. Here, we evaluate the balance between the treatment group and the control group in PSM-based matching using the standardized mean difference (SMD) (Cohen, 1988). As shown in Table 2, we achieve excellent matching between the treatment group (reviewers who lost Elite status) and the control group (current Elite reviewers) since none of the variables has an SMD higher than 0.20, which is the threshold that indicates a lack of balance between the treatment group and the control group (Zhu et al., 2024).

Table 2.
Covariate balance between treated and control groups.

Variable Lost Elite Current Elite SMD

Useful 1.079 1.024 0.022

Compliments 2.149 2.061 0.015

Average stars 3.884 3.871 0.021

Review length 871.731 870.553 0.002

Reading ease 45.819 45.232 0.008

Reading grade 18.555 18.864 −0.012

Gunning fog 20.815 21.169 −0.013

Dale Chall readability score 7.981 8.022 −0.012

Lexical density 86.638 87.336 −0.028

Years on Yelp 4.505 4.474 0.015

SMD = standardized mean difference.

Variable	Lost Elite	Current Elite	SMD
Useful	1.079	1.024	0.022
Compliments	2.149	2.061	0.015
Average stars	3.884	3.871	0.021
Review length	871.731	870.553	0.002
Reading ease	45.819	45.232	0.008
Reading grade	18.555	18.864	−0.012
Gunning fog	20.815	21.169	−0.013
Dale Chall readability score	7.981	8.022	−0.012
Lexical density	86.638	87.336	−0.028
Years on Yelp	4.505	4.474	0.015

3.2.3. Difference-in-Differences

Having identified our matched data, we now have measures for the intrinsic quality and perceived quality of reviews written before versus after the treatment for both the treatment group and the control group. Here, we utilize the DiD regression specification that is commonly used in the literature (e.g., Kumar et al., 2022) with the following equation: $D V_{i t} = γ D e m o t e d_{i, t} + β X_{i t} + δ_{t} + ϵ_{i t} .$ (1)In equation (1), subscript i denotes the individual reviewer, and subscript t denotes the year in the data. $D V_{i}$ represents our dependent variable of interest. Similar to other studies with data structure same as ours (e.g., Pu et al., 2020; Gao et al., 2024; Zhu et al., 2024), $D V_{i}$ is constructed as the average value of corresponding intrinsic quality and perceived quality measures of each review posted by individual $i$ in time period $t$ . $X_{i t}$ denotes the control variables that we use in our analyses, and $δ_{t}$ controls for time-fixed effects.⁴ $D e m o t e d_{i, t}$ is an indicator variable that denotes whether a reviewer $i$ has status in period $t$ . Therefore, for a demoted user, this variable takes the value 1 in periods following demotion. However, for the reviewers who never lost status during the time frame of our dataset, $D e m o t e d_{i, t}$ is always zero. We note that our dependent variables of interest change based on our research objectives. For our first research question, we are interested in discerning the effect of status loss on the intrinsic quality of reviews. To estimate this causal effect, we run six different regressions; for each regression, we choose one of the six intrinsic quality measures as the dependent variable in equation (1). For our second research question, we explore the effect of status loss on the perceived quality of reviews. To explore this effect, we use the variables “Compliments” and “Useful,” each as a dependent variable in equation (1), to run two sets of regressions.

In the context of a transient, status-based reward system, researchers have underscored the importance of studying the temporal effects on the quality of reviews. For instance, Zhang et al. (2020) showed that an individual’s review quality stabilizes over time, as supported by learning theory, which proposes that repeated experiences linger in the subconscious memory (Hofer et al., 2009; Mowrer, 1960). Interestingly, Bhattacharyya et al. (2020) find that reviewers who hold status for long periods tend to be less productive, which negatively impacts quality. This drop in performance is attributed to reinforcer satiation that diminishes the strength of the reinforcement (i.e., the urge to maintain status) on an individual’s behavior with the repeated occurrence of the reinforcement (Bhattacharyya et al., 2020; Murphy et al., 2003). Therefore, the intrinsic quality of reviews after a status loss could be dependent on temporal associations between a reviewer and a platform. In our context, these temporal associations could include the length of time a reviewer has participated with a platform, or it could reflect how long a reviewer held Elite status before losing it. We call these time-related variables “temporal moderators.” Furthermore, features related to temporal associations with an online review platform are commonly highlighted characteristics of a reviewer. When reviewers exhibit these characteristics, they strengthen peripheral cues, which possibly then affects a reviewer’s expertness-trustworthiness-attractiveness. Therefore, we examine two temporal moderators (expressed as Moderator in equation (2)): (i) the number of years a reviewer contributes to the platform (YearsOnYelp) and (ii) the number of years that the reviewer has achieved Elite status (YearsAsElite). We include these variables in our DiD specification as interactions with the treatment variable. Formally, our second model specification is as follows: $\begin{aligned} D V_{i t} & = γ D e m o t e d_{i, t} + β X_{i t} + θ M o d e r a t o r \\ + η (D e m o t e d_{i, t} \times M o d e r a t o r_{i}) + δ_{t} + ϵ_{i t}, \end{aligned}$ (2)in which $θ$ denotes the effect of the temporal moderator and $η$ denotes the effect of the temporal moderator on the treatment effect. The rest of the coefficients are the same as the ones defined in equation (1).

4. Empirical Results

In this section, we present the results from our empirical exercises. For ease of exposition, we express $D e m o t e d_{i, t}$ as StatusLossEffect. Therefore, the variable StatusLossEffect takes the value of $1$ for observations of demoted reviewers in the post-treatment time period and zero otherwise.

4.1. Main Results

In this subsection, we discuss two sets of results. First, we report findings related to the effect of changes in status on the intrinsic review quality of demoted reviewers, as well as how consumers’ perceived quality of reviews written by the demoted users is affected. To obtain these results, we use the DiD specification in equation (1). Table 3 shows the estimated effect of status loss on the six intrinsic quality measures alongside the two perceived quality measures.

Table 3.
Effect of status loss on review quality of lost Elite compared to current Elite reviewers.

Intrinsic quality Perceived quality

Reading Reading Gunning Dale Chall Lexical Review

ease grade fog readability score density length Compliments Useful

StatusLossEffect 3.321* −1.359* −1.357* −0.148 −1.294* −51.898* 0.093 −0.018

(−1.273) (−0.487) (−0.498) (−0.063) (−0.441) (−10.493) (−0.111) (−0.048)

YearsOnYelp −1.327* 0.493* 0.500* 0.067* 0.238* −8.492* −0.025 −0.016

(−0.208) (−0.08) (−0.082) (−0.01) (−0.072) (−1.716) (−0.018) (−0.008)

Average rating 4.666* −1.901* −1.917* −0.167* 1.809* −54.006* 0.068 −0.062*

(−0.529) (−0.202) (−0.207) (−0.026) (−0.183) (−4.359) (−0.046) (−0.02)

Constant 22.045 28.956* 31.837* 8.937* 77.640* 737.611*** 1.047 0.999

(−24.297) (−9.288) (−9.512) (−1.202) (−8.421) (−200.245) (−2.128) (−0.913)

Time fixed effect Yes Yes Yes Yes Yes Yes Yes Yes

$R^{2}$ 0.030 0.031 0.030 0.025 0.008 0.011 0.002 0.002

N 34,892 34,892 34,892 34,892 34,892 34,892 34,892 34,892

Note: * $p < 0.10$ ; $p < 0.05$ ; * $p < 0.01$ ; HC1 robust standard errors are in parentheses.

	Intrinsic quality	Perceived quality
StatusLossEffect	3.321***	−1.359***	−1.357***	−0.148**	−1.294***	−51.898***	0.093	−0.018
	(−1.273)	(−0.487)	(−0.498)	(−0.063)	(−0.441)	(−10.493)	(−0.111)	(−0.048)
YearsOnYelp	−1.327***	0.493***	0.500***	0.067***	0.238***	−8.492***	−0.025	−0.016**
	(−0.208)	(−0.08)	(−0.082)	(−0.01)	(−0.072)	(−1.716)	(−0.018)	(−0.008)
Average rating	4.666***	−1.901***	−1.917***	−0.167***	1.809***	−54.006***	0.068	−0.062***
	(−0.529)	(−0.202)	(−0.207)	(−0.026)	(−0.183)	(−4.359)	(−0.046)	(−0.02)
Constant	22.045	28.956***	31.837***	8.937***	77.640***	737.611***	1.047	0.999
	(−24.297)	(−9.288)	(−9.512)	(−1.202)	(−8.421)	(−200.245)	(−2.128)	(−0.913)
Time fixed effect	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes
$R^{2}$	0.030	0.031	0.030	0.025	0.008	0.011	0.002	0.002
N	34,892	34,892	34,892	34,892	34,892	34,892	34,892	34,892

With regards to the impact of status loss on the intrinsic quality measures of a review, Table 3 reveals that reviewers who have been demoted tend to produce reviews with lower intrinsic quality compared to those who retain their status. Such a decrease is consistent across all intrinsic quality measures we employ. We note that all but one of these intrinsic measures are statistically significant at $p < 0.01$ , with the exception being the Dale Chall readability score, which is statistically significant at $p < 0.05$ . Specifically, reviews contributed after being demoted contain about 52 fewer characters (around 6% less in length). Meanwhile, the Flesch reading ease score increases by about 3.32 points, indicating that the content becomes simpler to read. This finding is consistent with the significant decrease in the Flesch-Kincaid reading grade, the Gunning fog index, and the Dale Chall readability score, all of which indicate that the education required to read/write treated review text is significantly lower. Last, the decrease in the lexical density score indicates that the treated review text contains significantly fewer lexical words.

With regards to the impact of status loss on the perceived quality of a review, Table 3 shows that the StatusLossEffect is statistically insignificant at $p < 0.10$ for both of our perceived quality measures, namely, “Compliments” and “Useful” votes. These results make statistical inferences inconclusive: following the ELM, a consumer should have perceived a drop in values of central cues (as the intrinsic quality of review decreases) as well as a drop in peripheral cues (as Elite status goes away), which should have led to a decline in the perceived quality of reviews written by demoted reviewers. We posit that these insignificant results stem from the platform’s practice of showing the past Elite status of a demoted reviewer, which does not allow the weight on peripheral cues to decrease, even though the intrinsic quality demonstrated by that demoted reviewer declines. In addition, a consumer may not be able to discern the drop in intrinsic quality as much (i.e., the perceived central cue is not significantly affected). We revisit this theory in Section 5, where we discuss in detail the mechanisms in the manifested results.

4.2. Effect of Temporal Moderators

Our main results demonstrate that demoted reviewers tend to contribute reviews with significantly lower quality after a loss of status. However, despite this lower intrinsic quality, there are no significant changes to the perceived quality of reviews written by demoted reviewers from the perspective of review consumers. A natural question thus arises: Are these effects moderated by the experience of demoted reviewers? To answer this question, we identify two moderators commonly used in the literature to represent reviewer experience (e.g., Bradley, 2007; English et al., 2010). Here, we explore the effect of two temporal moderators: (i) the number of years a reviewer contributes to a platform (YearsOnYelp) and (ii) the number of years that a reviewer has Elite status (YearsAsElite). Tables 4 and 5 show the impact of these temporal moderators on the effect of status loss on review quality of demoted reviewers.

Table 4.
Heterogeneous effect on review quality of lost Elite compared to current Elite reviewers, based on the number of years a reviewer contributes to Yelp.

Intrinsic quality Perceived quality

Reading Reading Gunning Dale Chall Lexical Review

ease grade fog readability score density length Compliments Useful

StatusLossEffect 2.761* −1.166 −1.176 −1.200 −1.509* −46.264* −0.002 −0.045

(−1.529) (−0.584) (−0.598) (−0.785) (−0.53) (−12.599) (−0.134) (−0.057)

StatusLossEffect 0.996 −0.344 −0.322 −1.017 0.383 −10.026 0.169 0.047

$\times$ YearsOnYelp (−1.506) (−0.576) (−0.59) (−0.773) (−0.522) (−12.411) (−0.132) (−0.057)

YearsOnYelp −1.368* 0.507* 0.513* 0.596* 0.223* −8.083* −0.032* −0.018

(−0.217) (−0.083) (−0.085) (−0.112) (−0.075) (−1.79) (−0.019) (−0.008)

Average rating 4.663* −1.900* −1.916* −2.348* 1.807* −53.979* 0.068 −0.062*

(−0.529) (−0.202) (−0.207) (−0.272) (−0.183) (−4.359) (−0.046) (−0.02)

Constant 22.048 28.955* 31.836* 36.697* 77.641* 737.583*** 1.048 0.999

(−24.298) (−9.288) (−9.512) (−12.48) (−8.421) (−200.246) (−2.128) (−0.913)

Time fixed effect Yes Yes Yes Yes Yes Yes

N 34,892 34,892 34,892 34,892 34,892 34,892 34,892 34,892

Note: * $p < 0.10$ ; $p < 0.05$ ; * $p < 0.01$ ; HC1 robust standard errors are in parentheses.

	Intrinsic quality	Perceived quality
StatusLossEffect	2.761*	−1.166**	−1.176**	−1.200	−1.509***	−46.264***	−0.002	−0.045
	(−1.529)	(−0.584)	(−0.598)	(−0.785)	(−0.53)	(−12.599)	(−0.134)	(−0.057)
StatusLossEffect	0.996	−0.344	−0.322	−1.017	0.383	−10.026	0.169	0.047
$\times$ YearsOnYelp	(−1.506)	(−0.576)	(−0.59)	(−0.773)	(−0.522)	(−12.411)	(−0.132)	(−0.057)
YearsOnYelp	−1.368***	0.507***	0.513***	0.596***	0.223***	−8.083***	−0.032*	−0.018**
	(−0.217)	(−0.083)	(−0.085)	(−0.112)	(−0.075)	(−1.79)	(−0.019)	(−0.008)
Average rating	4.663***	−1.900***	−1.916***	−2.348***	1.807***	−53.979***	0.068	−0.062***
	(−0.529)	(−0.202)	(−0.207)	(−0.272)	(−0.183)	(−4.359)	(−0.046)	(−0.02)
Constant	22.048	28.955***	31.836***	36.697***	77.641***	737.583***	1.048	0.999
	(−24.298)	(−9.288)	(−9.512)	(−12.48)	(−8.421)	(−200.246)	(−2.128)	(−0.913)
Time fixed effect	Yes	Yes	Yes	Yes	Yes	Yes
N	34,892	34,892	34,892	34,892	34,892	34,892	34,892	34,892

Table 5.

Heterogeneous effect on review quality of lost Elite compared to current Elite reviewers, based on the number of years a reviewer held Elite status.

	Intrinsic quality						Perceived quality
	Reading	Reading	Gunning	Dale Chall	Lexical	Review
	ease	grade	fog	readability score	density	length	Compliments	Useful
StatusLossEffect	2.947**	−1.225**	−1.223**	−0.134*	−1.510***	−55.555***	0.08	−0.033
	(−1.502)	(−0.574)	(−0.588)	(−0.074)	(−0.52)	(−12.367)	(−0.131)	(−0.056)
StatusLossEffect	0.537	−0.189	−0.189	−0.018	0.486	9.000	0.044	0.036
$\times$ YearsAsElite	(−1.468)	(−0.561)	(−0.575)	(−0.073)	(−0.509)	(−12.092)	(−0.129)	(−0.055)
YearsOnYelp	−0.649**	0.234**	0.235**	0.032**	−0.152*	−18.490***	−0.114***	−0.054***
	(−0.254)	(−0.097)	(−0.099)	(−0.013)	(−0.088)	(−2.093)	(−0.022)	(−0.01)
YearsAsElite	−1.560***	0.594***	0.607***	0.079***	0.856***	22.122***	0.198***	0.083***
	(−0.342)	(−0.131)	(−0.134)	(−0.017)	(−0.119)	(−2.82)	(−0.03)	(−0.013)
Average rating	4.683***	−1.907***	−1.923***	−0.168***	1.800***	−54.241***	0.066	−0.062***
	(−0.529)	(−0.202)	(−0.207)	(−0.026)	(−0.183)	(−4.354)	(−0.046)	(−0.02)
Constant	21.117	29.309***	32.197***	8.983***	78.140***	750.583***	1.164	1.048
	(−24.291)	(−9.286)	(−9.509)	(−1.202)	(−8.414)	(−200.04)	(−2.126)	(−0.912)
Time fixed effect	Yes	Yes	Yes	Yes	Yes	Yes
$R^{2}$	0.03	0.031	0.031	0.025	0.01	0.013	0.002	0.002
N	34,892	34,892	34,892	34,892	34,892	34,892	34,892	34,892

Note: * $p < 0.10$ ; ** $p < 0.05$ ; *** $p < 0.01$ ; HC1 robust standard errors are in parentheses.

First, we examine the effect that both of these moderators have on intrinsic quality in two sets of regressions presented in Tables 4 and 5, wherein the effects of the moderators are shown as interactions with StatusLossEffect (i.e., the regression specification in equation (2)). Interestingly, we do not find any statistically significant effect of either of those interactions (StatusLossEffect $\times$ YearsOnYelp and $StatusLossEffect \times YearsAsElite$ ), indicating that the two temporal variables do not affect the treatment effect of status loss. However, in both cases, the main effect of status loss is similar to our results in Table 3 (i.e., the intrinsic quality declines after a reviewer loses status).

Second, we analyze the effect of temporal moderators on how a status loss affects the perceived quality of reviews, also shown in Tables 4 and 5 for moderators YearsOnYelp and YearsAsElite, respectively. As before, we do not find any statistically significant effect on either of these variables on the treatment effect, as reflected by the coefficients on StatusLossEffect $\times$ YearsOnYelp and $StatusLossEffect \times YearsAsElite$ . Additionally, the treatment effect StatusLossEffects is statistically insignificant for perceived quality measures. Therefore, we find no conclusive evidence that, compared to reviewers who maintain their Elite status, the perceived quality of reviews written by a demoted reviewer is affected, even after we control for temporal moderators.

We note that the main effects of both the temporal moderators, captured by the coefficients of YearsOnYelp and YearsAsElite, are statistically significant and consistent with the literature. Namely, the experience of the reviewers positively affects both the intrinsic quality and perceived quality of the reviews they contribute.

5. Mechanism Analysis

Our empirical findings demonstrate that when reviewers experience a loss of status, their posted reviews are intrinsically lower quality. However, consumers of these reviews continue to perceive them as having relatively high quality. Additionally, the effects of the status loss do not seem to be influenced by reviewers’ experience with the platform. In this section, we examine the empirical evidence supporting two potential underlying mechanisms that explain these results: (1) the perception of inequity among demoted reviewers to explain the decline in the intrinsic quality of reviews and (2) the predominance of peripheral cues over central cues of quality among consumers, which results in the higher perceived quality of otherwise inferior reviews.

5.1. Why Do Demoted Reviewers Post Reviews With Lower Quality?

Inequity theory states that when individuals compare their own outcome-input ratios to those of relevant others, the former may perceive unfairness if their outcomes were lower than individuals with seemingly similar or lower inputs (Adams, 1963, 1965). We argue that in our context since the status-demotion criteria were endogenously determined by the platform (i.e., not publicly announced), a demotion could lead to perceived unfairness among demoted individuals. In accordance with inequity theory, demoted individuals who perceive such unfairness can react by (a) distorting either their own or others’ inputs or outcomes cognitively, (b) acting in a manner that causes others to change their own inputs or outcomes, (c) behaving in a way that changes their own inputs or outcomes (e.g., by working harder), (d) selecting a different comparison individual (presumably someone whose outcome/input ratio is considered equal to theirs), or (e) exiting the situation altogether, such as quitting the job (Pritchard et al., 1972). In our context, Yelp does not reveal the criteria it follows to make status decisions. Therefore, a demoted reviewer is likely to perceive unfairness. However, in such instances, a reviewer cannot directly change the effort level or the outcome of other reviewers. Instead, demoted reviewers can either increase their own effort to regain their lost status, or these reviewers can decrease their effort when they make future contributions. Previous studies have shown that individuals who feel undervalued often express decreased job satisfaction and demonstrate lower performance (Adams, 1963; Pritchard et al., 1972). Therefore, our findings, which show a decline in intrinsic quality measures following status loss, are consistent with the results. In the next step of our analysis, we examine two alternative explanations for the decrease in the intrinsic quality of reviews following a loss of status.

5.1.1. Do Demoted Reviewers Post Reviews With Lower Quality Because They Have Lost Interest in the Platform Even Before the Demotion?

We made an implicit assumption in using inequity theory to explain the decline in intrinsic quality: that the observed effect on demoted reviewers was solely due to their loss of status. However, it is plausible that reviewers who lost their status may have lost interest in contributing reviews even before the status loss occurred. This could have resulted in the production of inferior reviews and the subsequent loss of status. To ensure the robustness of our results against such self-selection issues, we present two arguments.

First, we recall that our identification strategy relies on a matching technique to ensure that the demoted reviewers (i.e., treatment group) and the current Elite reviewers (i.e., control group) have similar characteristics before the treatment (i.e., the loss of status). Our balance test in Table 2 ensures that both groups have reasonably similar characteristics. In other words, we found no evidence that demoted reviewers (treatment group) demonstrate a loss of interest differently from matched current Elite reviewers (control group) in contributing reviews before the treatment.

Second, we employ an alternative specification to be more conservative. We focus only on demoted reviewers who, in the pre-demotion period, wrote more reviews than the median number of reviews written by Elite reviewers. This way, we restrict the demoted reviewers to only those who did not show signs of losing interest in writing reviews on the platform before they lost status. We rerun the DiD regression specified in equation (1) on a subset of these demoted reviewers and their corresponding matched control group of current Elite reviewers. The results, shown in Table 6, demonstrate that even among this subset of demoted reviewers, the negative effect of status loss on intrinsic review quality persists.

Table 6.
Effect of status loss on the intrinsic quality of reviews by lost Elite compared to current Elite reviewers for reviewers who make significant contributions (in volume) in the pre-treatment period.

Reading Reading Gunning Dale Chall Lexical Review

ease grade fog readability score density length

StatusLossEffect 4.900* −2.182** −2.107* −0.307** −1.062 −47.490*

(2.882) (1.103) (1.130) (0.144) (0.985) (25.314)

YearsOnYelp −1.236* 0.447* 0.444* 0.066* 0.362* −9.848*

(0.372) (0.143) (0.146) (0.019) (0.127) (3.272)

Average rating 4.559* −1.876* −1.865* −0.152* 2.190* −71.353*

(0.821) (0.314) (0.322) (0.041) (0.281) (7.210)

Intercept −21.179 45.127* 48.204* 11.199* 86.259* 935.051***

(37.711) (14.429) (14.786) (1.884) (12.888) (331.240)

Time fixed effect Yes Yes Yes Yes Yes Yes

$R^{2}$ 0.05 0.05 0.05 0.04 0.01 0.02

N 9,695 9,695 9,695 9,695 9,695 9,695

Note: * $p <$ 0.10; $p <$ 0.05; * $p < 0.01$ ; HC1 robust standard errors are in parentheses.

	Reading	Reading	Gunning	Dale Chall	Lexical	Review
StatusLossEffect	4.900*	−2.182**	−2.107*	−0.307**	−1.062	−47.490*
	(2.882)	(1.103)	(1.130)	(0.144)	(0.985)	(25.314)
YearsOnYelp	−1.236***	0.447***	0.444***	0.066***	0.362***	−9.848***
	(0.372)	(0.143)	(0.146)	(0.019)	(0.127)	(3.272)
Average rating	4.559***	−1.876***	−1.865***	−0.152***	2.190***	−71.353***
	(0.821)	(0.314)	(0.322)	(0.041)	(0.281)	(7.210)
Intercept	−21.179	45.127***	48.204***	11.199***	86.259***	935.051***
	(37.711)	(14.429)	(14.786)	(1.884)	(12.888)	(331.240)
Time fixed effect	Yes	Yes	Yes	Yes	Yes	Yes
$R^{2}$	0.05	0.05	0.05	0.04	0.01	0.02
N	9,695	9,695	9,695	9,695	9,695	9,695

5.1.2. Do Demoted Reviewers Post Lower Quality Reviews Because They Write About Businesses That Are Different From the Ones From the Pre-Demotion Period?

In the previous section, we presented empirical evidence that even among demoted reviewers who post reviews at a higher level than the average contribution level of all Elite reviewers, a decrease in intrinsic review quality is observed after status loss. Hence, it is unlikely that the decline in intrinsic review quality is driven by reviewers who lost interest in the platform even before their loss in status. In this subsection, we explore an alternative explanation in which the decrease in review quality may be caused by a change in the review-writing strategies of demoted reviewers after losing their status. Specifically, after demotion, a reviewer may strategically write reviews for certain types of restaurants that differ from the ones they wrote about before demotion. As the set of focal restaurants changes, the intrinsic quality of the reviews written for this new set of restaurants may also change.

To investigate such a potential alternative explanation of our results, we rerun our DiD analysis specified in equation (1) on intrinsic review quality measures after incorporating more controls for restaurant-related and peer reviewer-related characteristics. Specifically, we have added the following covariates as additional control variables: the average ratings assigned to restaurants, the age of the restaurants, the review volume of the restaurants, the average menu price of the restaurants reviewed, the number of restaurants in the area, and the average ratings assigned to the businesses by other reviewers. As observed in Table 7, even after we control for these attributes, we still find that the intrinsic quality declines for demoted reviewers. Therefore, we posit that the decrease in the intrinsic quality of the reviews, which contributed to the loss of status, is unlikely to be driven by the change in the type of restaurants reviewed by the demoted reviewers.

Table 7.
Effect of status loss on the intrinsic quality of reviews by lost Elite reviewers compared to current Elite after controlling for additional restaurant and reviewer-related characteristics.

Reading Reading Gunning Dale Chall Lexical Review

ease grade fog readability score density length

StatusLossEffect 5.230 −1.970 −1.979 −0.261 −1.943* −68.894*

(2.125) (0.814) (0.833) (0.105) (0.602) (17.170)

YearsOnYelp −1.730* 0.632* 0.644* 0.094* 0.225* −17.232*

(0.439) (0.168) (0.172) (0.022) (0.124) (3.545)

Average rating 9.643* −3.708* −3.748* −0.431* 0.926 −59.233*

(1.288) (0.493) (0.505) (0.064) (0.365) (10.407)

Average menu price −0.265* 0.091 0.093 0.016*** 0.047 0.759

(0.102) (0.039) (0.040) (0.005) (0.029) (0.822)

Businesses around .0009 −.0004 −.0004 −.0001 −.0003 −.0036

(.0008) (.0003) (.0003) (.0001) (.0002) (.0066)

Businesses rating −2.009 0.665 0.67 0.133* 1.855* 24.698

(1.445) (0.553) (0.567) (0.071) (0.409) (11.675)

Review volume of businesses 0.270 −0.042 −0.041 −0.023 0.135 4.617

(0.648) (0.248) (0.254) (0.032) (0.184) (5.238)

Age of businesses −0.722 0.295 0.308 0.035 −0.779* −28.716*

(0.789) (0.302) (0.309) (0.039) (0.223) (6.372)

Constant −17.102 44.861* 48.206* 10.397* 71.894* 847.670***

(38.372) (14.69) (15.045) (1.893) (10.871) (310.036)

Time fixed effect Yes Yes Yes Yes Yes Yes

$R^{2}$ 0.04 0.04 0.04 0.03 0.02 0.02

N 7,474 7,474 7,474 7,474 7,474 7,474

Note: * $p < 0.10$ ; $p < 0.05$ ; * $p < 0.01$ ; HC1 robust standard errors are in parentheses.

	Reading	Reading	Gunning	Dale Chall	Lexical	Review
StatusLossEffect	5.230**	−1.970**	−1.979**	−0.261**	−1.943***	−68.894***
	(2.125)	(0.814)	(0.833)	(0.105)	(0.602)	(17.170)
YearsOnYelp	−1.730***	0.632***	0.644***	0.094***	0.225*	−17.232***
	(0.439)	(0.168)	(0.172)	(0.022)	(0.124)	(3.545)
Average rating	9.643***	−3.708***	−3.748***	−0.431***	0.926**	−59.233***
	(1.288)	(0.493)	(0.505)	(0.064)	(0.365)	(10.407)
Average menu price	−0.265***	0.091**	0.093**	0.016***	0.047	0.759
	(0.102)	(0.039)	(0.040)	(0.005)	(0.029)	(0.822)
Businesses around	.0009	−.0004	−.0004	−.0001	−.0003	−.0036
	(.0008)	(.0003)	(.0003)	(.0001)	(.0002)	(.0066)
Businesses rating	−2.009	0.665	0.67	0.133*	1.855***	24.698**
	(1.445)	(0.553)	(0.567)	(0.071)	(0.409)	(11.675)
Review volume of businesses	0.270	−0.042	−0.041	−0.023	0.135	4.617
	(0.648)	(0.248)	(0.254)	(0.032)	(0.184)	(5.238)
Age of businesses	−0.722	0.295	0.308	0.035	−0.779***	−28.716***
	(0.789)	(0.302)	(0.309)	(0.039)	(0.223)	(6.372)
Constant	−17.102	44.861***	48.206***	10.397***	71.894***	847.670***
	(38.372)	(14.69)	(15.045)	(1.893)	(10.871)	(310.036)
Time fixed effect	Yes	Yes	Yes	Yes	Yes	Yes
$R^{2}$	0.04	0.04	0.04	0.03	0.02	0.02
N	7,474	7,474	7,474	7,474	7,474	7,474

5.2. Why Do Consumers on the Platform Continue to Perceive Reviews Written by Demoted Reviewers as High-Quality Reviews?

Our main findings suggest that the intrinsic quality of reviews written by demoted individuals declines after their status loss. However, we also find that the perceived quality of these reviews does not decrease after the loss of status. To explain this result, we turn to the ELM, which posits that evaluations of information are guided by a combination of central and peripheral cues, with the former based on the intrinsic quality and the latter based on the source credibility (Petty et al., 1981, 1986; Sussman and Siegal, 2003). To evaluate the quality of reviews, consumers on the platform may depend on peripheral cues as they might not personally know the reviewers or their level of expertise. In such cases, platform recognition in the form of badges and awards can reinforce these peripheral cues and enhance the consumers’ confidence in the quality of reviews (Donath, 2007). In our context, even if a reviewer has been demoted, the platform still displays whether a reviewer previously held Elite status, which enhances weight of the peripheral cue of quality and increases consumer confidence in the credibility of reviews written by demoted reviewers. Correspondingly, the dominance of peripheral cues over central cues causes consumers to not recognize the decline in the intrinsic quality of reviews written by demoted reviewers. Furthermore, it is possible that consumers may not easily recognize the decrease in intrinsic quality, leading to a less significant effect on the perceived central cue. Thus, consumers perceive that the quality of reviews written by demoted reviewers is statistically similar to those written by current Elite reviewers, even though the intrinsic quality is lower in reviews written by the former compared to the latter.

5.2.1. Perceived Quality Comparison Between Demoted Reviewers and Reviewers With No Past Status

To further support the idea that the display of past statuses affects perceived quality, we conduct an additional comparison between demoted reviewers and reviewers who never held statuses, so we may determine how their review quality perceptions differ. If the display of past statuses truly has a disproportionate effect on perceived quality, we would then expect reviews written by demoted reviewers to be perceived as higher quality compared to reviews of similar intrinsic quality written by reviewers who never had statuses. To test this conjecture, we run the same DiD estimation stated in equation (1) and use demoted reviewers as our treatment group against a matched set of reviewers who never held statuses as our control group.⁵ Our results, presented in Table 8, indicate that, even after we control for intrinsic review quality, both compliments and useful are statistically significant and positive. Thus, content created by reviewers who previously held Elite status but lost it later (i.e., “with” Lost Elite badge displayed by Yelp) is perceived as higher quality than content created by reviewers who never held any status (i.e., “without” any badge), even when the intrinsic quality of the two groups of reviewers is similar.

Table 8.
Effect of status loss on perceived quality in reviews by lost-Elite reviewers compared to reviewers who never held status.

Compliments Useful

StatusLossEffect 2.729* (0.098) 0.891* (0.047)

YearsOnYelp −0.060* (0.014) −0.037* (0.007)

Average rating −0.223* (0.027) −0.218* (0.013)

Constant 1.3 (2.236) 1.338 (1.077)

Time fixed effect Yes Yes

$R^{2}$ 0.106 0.078

N 30,944 30,944

Note: * $p < 0.10$ ; $p < 0.05$ ; * $p < 0.01$ ; HC1 robust standard errors are in parentheses.

	Compliments	Useful
StatusLossEffect	2.729*** (0.098)	0.891*** (0.047)
YearsOnYelp	−0.060*** (0.014)	−0.037*** (0.007)
Average rating	−0.223*** (0.027)	−0.218*** (0.013)
Constant	1.3 (2.236)	1.338 (1.077)
Time fixed effect	Yes	Yes
$R^{2}$	0.106	0.078
N	30,944	30,944

However, one can still argue that our use of six distinct intrinsic quality measures may not fully capture the true meaning of “information” for a consumer. Thus, the information content of reviews written by demoted reviewers might remain unchanged, yet our intrinsic measures might fail to account for this. To address such concerns, we conducted a randomized experiment on Amazon Mechanical Turk to investigate the underlying mechanism that guides consumers’ perception of quality, showing that it is indeed peripheral cues, rather than central cues, that play a significant role in online review platforms.

5.2.2. Randomized Experiment

The objective of our randomized experiment is to cleanly capture the impact of displaying past and present statuses (or no status) on the perceived quality of reviews. In our context, we are interested in evaluating the impact of displaying the status of reviewers to platform consumers on how consumers’ perceptions of quality changes based on the status, itself.

We recruited 480 subjects in September 2022 using Amazon Mechanical Turk (MTurk) to conduct our experiment. Our subjects were required to rate the quality of the review text on a scale of 1–10. Each subject received seven reviews to rate and a few demographic questions to answer, including gender, age, and level of education. We randomly selected a subset of 50 reviews from our reviews dataset. Each review could appear to a subject with only one of the randomly chosen possible treatments: written by a reviewer who currently holds Elite status ( $t 1$ ), who held Elite status in the past ( $t 2$ ), and who never held status, in which case no status is shown (control group $c$ ). Each subject is shown a set of seven distinct and randomly selected reviews, each with one of the randomly chosen treatments, from the chosen subset of 50 reviews. Therefore, between 50 reviews and three possible treatments, we have 150 different review instances. An example of how a question would appear in each treatment group is displayed in Figure A4 in Section B of the E-Companion.

We did not collect any personally identifiable information from the subjects. At the end of the task, each subject was paid $0.5 for participation. It took an average of 4 min and 37 s for each subject to finish the task. To ensure the quality of our subjects, we only recruited subjects who had participated in at least a hundred similar approved tasks in the past. Our detailed instructions and the demographic questions given to subjects are included in Figure A5 in E-Companion Section B.2.

Out of the data collected from our experiment, we observed a few systematic inconsistencies in our data-generation process. First, we observed that some of the responses may be generated by bots. This can be determined by the time that a subject would take to log in and submit the Recaptcha for a form. We found 28 such anomalies and excluded their observations. Second, we noticed that some subjects chose to rate all the reviews at the highest quality of 10. It is clear that these subjects did not follow the instructions and were participating just for the reward; hence, we excluded observations from 14 such subjects. In total, we have 438 subjects who evaluated a total of 50 unique reviews. Among these, we have 1,064 observations for which reviews show no status ( $c$ ), 1,015 observations for which reviews show current Elite status ( $t 1$ ), and 987 observations for which reviews show past Elite status ( $t 2$ ). We analyze these data using the following regression model: $p e r c e i v e d Q u a l i t y_{i j} = γ (T r e a t m e n t_{i}) + β X_{i j} + δ_{i} + ϵ_{i t},$ (3)in which the perceived quality of review $j$ is assigned by subject $i$ in the experiment. $T r e a t m e n t_{i}$ is an indicator variable that takes the value one if subject $i$ receives the treatment and zero otherwise. In addition, $γ$ represents the effect of the treatment with respect to the baseline control group (c). Moreover, $X_{i j}$ represents the control variables, which include the subject’s demographic variables and the review’s actual quality. Specifically, we include the review length and Dale Chall readability scores as our control variables. Last, $δ_{i}$ represents the subject-fixed effects.

In Table 9, we find that both the treatments that display the reviewer status (current Elite status $t 1$ or past Elite status $t 2$ ) result in higher perceived quality. The coefficients of $t 1$ and $t 2$ are positive and significant, reflecting that displaying a status indeed increases the perceived quality of reviews. We further analyzed the difference in perceived quality between displaying the current status compared to the past status; for this analysis, we did not include reviews that included no reviewer status. Therefore, we ran the regression stated in equation (3) with reviews shown as being written by current Elite reviewers ( $t 1$ ) against the baseline, (i.e., reviews shown as being written by past Elite reviewers). We present our result in Table 10. Interestingly, these results do not provide any evidence that the perceived quality of a review changes between displaying a reviewer’s current status and displaying a past status. This result is qualitatively similar to our main result: displaying a status positively impacts the perceived quality, and there is no difference between showing the current status or the past status of a reviewer.⁶

Table 9.

Effect of display of different status on perceived review quality.

	Perceived quality
t1	0.398***
	(0.069)
t2	0.276***
	(0.069)
Review length	0.000
	(0.000)
Dale Chall readability score	0.007
	(0.008)
Intercept	7.400***
	(0.126)
Subject FE	Yes
$R^{2}$	0.015
N	3,066

Note: * $p <$ 0.10; ** $p < 0.05$ ; *** $p < 0.01$ ; HC1 robust standard errors are in parentheses.

Table 10.

Difference in the effect of Elite status display on perceived review quality w.r.t. to past-Elite status display.

	Perceived quality
t2	−0.126
	(0.116)
Review length	−0.000
	(0.000)
Dale Chall readability score	0.001
	(0.009)
Intercept	7.953***
	(0.148)
Subject FE	Yes
$R^{2}$	0.02
N	1,960

Note: * $p < 0.10$ ; ** $p < 0.05$ ; *** $p < 0.01$ ; HC1 robust standard errors are in parentheses.

In summary, we use this section to draw upon inequity theory and explain the decline in intrinsic quality of reviews contributed by demoted reviewers after their loss of status. We also extend our empirical analyses to rule out two alternative explanations that may explain the decline in intrinsic quality after status loss. In addition, we utilize the ELM to theoretically explain why the perceived quality of reviews does not change for the reviews posted after status loss, even when those reviews have significantly lower intrinsic quality. We also provide further support to our proposed mechanism by conducting a randomized experiment to cleanly identify the change in perceived quality (or the lack thereof) among consumers with respect to the change in reviewers’ statuses.

6. Robustness Checks

To ensure that our results are robust, we perform several additional analyses. Details of these analyses and corresponding results are reported in Section D of the E-Companion. The analyses are as follows:

To ensure that our results are not driven by the matching method employed in our empirical analysis, we consider an alternative matching algorithm based on an entropy balancing technique. We report our results in Section D.1 of the E-Companion.

In our main specification, a reviewer can act as both a treated individual and a control reviewer, depending on the timing of their demotion from the Elite status. Thus, we match on each instance of a reviewer for our analysis. As a robustness check, we match at the reviewer level instead of reviewer instance, excluding $Y e a r s O n Y e l p$ as a matching covariate, and incorporating the number of reviews as an additional matching covariate. Details of this robustness is provided in Section D.2 of the E-Companion.

We consider an alternative identification strategy to ensure that our results are not driven by our primary identification strategy: matching and DiD. Specifically, we employ a causal forest algorithm (Wager and Athey, 2018) and report our results in Section D.3 of the E-Companion.

We perform a falsification test using a placebo test in which we use a “fake” treatment date instead of the actual treatment to observe whether our results are simply spurious. The fake treatment date is the same date as the actual treatment date but one calendar year prior. We report our results in Section D.4 of the E-Companion.

We perform another falsification test using a randomized treatment test. Specifically, we randomly assign users in our dataset to either a treatment group or a control group and run our regression models for 10,000 iterations. We report our results in Section D.5 of the E-Companion.

In our main analysis, we excluded the number of reviews as a matching covariate because, in many cases, it is a perfect predictor of status loss. In this robustness check, to verify that our findings are not influenced by the exclusion of number of reviews, we incorporate it both as a variable in matching and as a control variable in our causal models. The detailed outcomes are presented in Section D.6 of the E-companion.

In our main analysis, we addressed self-selection bias by employing PSM. As a robustness check, we employ a Heckman two-stage estimation. In the first stage of the model, along with the review quality measures used in PSM, we incorporate three location-based variables. These variables, influencing Yelp managers’ demotion decisions, are less likely to have a direct impact on an individual’s review quality, making them suitable for fulfilling the exclusion restrictions. Detailed results can be found in Section D.7 of the E-Companion.

In our main analyses, we set up the treatment-effect model as a “canonical” version of DiD, that is, two groups in two time periods. In this robustness check, we implement a staggered DiD approach following Callaway and Sant’Anna (2021). We report our results in Section D.8 of the E-Companion.

We provide predictive analytics on the Yelp data, as a substitute to the Amazon Turk Experiment, so we may establish that peripheral cues indeed guide the perceived quality. We report our results in Section D.9 of the E-Companion.

All results are qualitatively and quantitatively similar to our main results. Notably, the treatment group in these robustness exercises is reviewers who lost Elite status, and the control group is reviewers who currently hold Elite status.

7. Discussion and Conclusion

Consumer reliance on peer information in the digital age has led to the dominance of UGC platforms. For these platforms, balancing content quality and user engagement is a key operational challenge. While the impact of status rewards for high-quality content have been studied, the impact of losing status remains unexplored. This paper addresses this gap by analyzing observational data from Yelp to examine the consequences of status loss on UGC platforms.

This study explores the impact of exclusivity in performance-based rewards on the quality of UGC. Particularly, we focus on the effect of losing status on review quality within an online review platform. We examine whether demoting reviewers leads to lower quality reviews and, consequently, harms the platform. Our findings reveal a two-fold effect. First, the intrinsic review quality of reviewers diminishes significantly upon demotion, potentially due to dissatisfaction with the platform stemming from the perceived inequity related to demotion. We rule out alternative explanations of a decline in intrinsic quality stemming from reviewers losing interest in the platform before demotion or from opting to review different types of businesses post-demotion. Second, the platform’s policy of displaying a reviewer’s past status disproportionately affects the perceived review quality. Even with the lower intrinsic quality, consumers continue to consider these reviews helpful due to the dominance of peripheral cues (i.e., a reviewer’s past status) over central cues (i.e., the intrinsic review quality) in their evaluations. This disparity between the intrinsic and perceived quality of a review undermines the value and sustainability of the platform in the long run. We utilize inequity theory and the ELM to explain these findings. The summary of our findings is provided in Table 11.

Table 11.
Results summary.

Findings Corresponding results

$∙$ Status loss significantly decreases the intrinsic quality of reviews Section 4.1, Table 3

written by users who lose the Elite status.

$\circ$ Demoted reviewers who were actively engaged with the platform before Section 5.1.1, Table 6

losing the status also tend to write lower-quality reviews after status loss.

$\circ$ The effect of status loss on intrinsic review quality persists after Section 5.1.2, Table 7

controlling for restaurant-related and reviewer-related characteristics.

$∙$ The effect of status loss on the perceived quality of reviews written Section 4.1, Table 3

by users who lose the Elite status is statistically insignificant.

$\circ$ The perceived quality of reviews written by demoted reviewers is higher Section 5.2.1, Table 8

than that of similar reviews written by users who have never had status.

$\circ$ The perceived quality of reviews written by Elite/past Elite reviewers is Section 5.2.2, Table 9

higher than that of exactly similar reviews written by users with no status.

$\circ$ The perceived quality of reviews written by Elite reviewers is not significantly Section 5.2.2, Table 10

different than similar reviews written by past Elite reviewers.

$∙$ Reviewer experience with the platform does not significantly moderate Section 4.2, Tables 4 and 5

the effects of status loss on intrinsic/perceived review quality.

Note: Results in Tables 3 to 8 are based on the observational data obtained from Yelp.

Results in Tables 9 and 10 are based on the randomized experiments conducted through Amazon Mechanical Turk.

Findings	Corresponding results
$∙$ Status loss significantly decreases the intrinsic quality of reviews	Section 4.1, Table 3
written by users who lose the Elite status.
$\circ$ Demoted reviewers who were actively engaged with the platform before	Section 5.1.1, Table 6
losing the status also tend to write lower-quality reviews after status loss.
$\circ$ The effect of status loss on intrinsic review quality persists after	Section 5.1.2, Table 7
controlling for restaurant-related and reviewer-related characteristics.
$∙$ The effect of status loss on the perceived quality of reviews written	Section 4.1, Table 3
by users who lose the Elite status is statistically insignificant.
$\circ$ The perceived quality of reviews written by demoted reviewers is higher	Section 5.2.1, Table 8
than that of similar reviews written by users who have never had status.
$\circ$ The perceived quality of reviews written by Elite/past Elite reviewers is	Section 5.2.2, Table 9
higher than that of exactly similar reviews written by users with no status.
$\circ$ The perceived quality of reviews written by Elite reviewers is not significantly	Section 5.2.2, Table 10
different than similar reviews written by past Elite reviewers.
$∙$ Reviewer experience with the platform does not significantly moderate	Section 4.2, Tables 4 and 5
the effects of status loss on intrinsic/perceived review quality.

7.1. Research Contributions

Our study offers significant research contributions. First, it provides valuable insights on a previously unexplored area: the effect of losing status on reviewer behavior and platform welfare. While previous studies have focused on the impact of status acquisition on review-generating behavior, the implications of status loss in these transient, status-based incentive systems have been less explored. This study addresses this gap by empirically examining how status loss affects the behavior of demoted reviewers.

Second, we go beyond user behavior to explore the operational implications for platforms. Review platforms rely on both reviewer effort (intrinsic quality) and consumer perception (perceived quality) to thrive. These distinct aspects, often conflated in the existing literature, influence platform value differently. Here, we analyze how status loss affects the interplay between intrinsic and perceived quality measures. Interestingly, our findings reveal a decline in intrinsic quality post-demotion, but no corresponding decline in perceived quality. This highlights a critical gap that future platform designs should address.

Finally, we introduce a novel theoretical framework involving inequity theory and the ELM. Specifically, we explain the decline in intrinsic quality with inequity theory, positing that demotion triggers the perception of unfairness, leading to lower effort. The gap between intrinsic and perceived quality is explained by the ELM. We argue that consumers rely more heavily on peripheral cues (past status badges) than central cues (review content) when evaluating reviews due to the platform’s practice of displaying past statuses. This integration of inequity theory and the ELM provides a unique lens to understand the dynamics of intrinsic and perceived quality in online reviews.

7.2. Managerial Implications

Our study presents several important and relevant insights for practitioners who develop and manage the quality of content for UGC platforms. First, our empirical evidence not only is statistically significant but also carries meaningful economic implications. For example, our main results demonstrate that demoted reviewers tend to write reviews with significantly lower quality. Such an impact has direct and negative implications for a platform’s welfare, as reported by Chan et al. (2022). Meanwhile, our results on the discrepancy between perceived review quality and intrinsic review quality point to potential long-term negative implications for customer satisfaction (Hong et al., 2018), which directly impacts platform economics (Amorim and Pratas, 2022).

Second, our empirical findings highlight the potential negative consequences of displaying past badges on profiles of demoted users within a transient, status-based, user recognition system. Such a system can result in consumers perceiving content generated by demoted users to be of better quality than it is, leading to reduced consumer satisfaction with the platform in the long term and impacting its welfare and sustainability. Thus, our results suggest that platform managers should reconsider policies that entail displaying the past statuses of demoted reviewers.

Third, our findings underscore the importance of potential design mechanisms and intervention strategies to platform managers who must address issues of status loss. For example, offering incentives such as participation in social events, exclusive invitations, or a dedicated forum to address perceptions of inequity (similar to the support forum in Meta Stack Overflow by Bhattacharyya et al., 2020) could motivate demoted reviewers to generate higher-quality reviews. Moreover, our findings demonstrate that even though demoted users tend to produce lower intrinsic quality reviews following the loss of their statuses, the perceived quality of their reviews remains unchanged. Therefore, consumers on UGC platforms should be made aware of this possible bias (e.g., offer them intrinsic quality metrics of reviews), so they can make well-informed decisions (Goes et al., 2014). Alternatively, platforms could display intrinsic quality metrics to assist users during their content consumption (e.g., Hou and Ma, 2022).

Fourth, insights from our study can be used by operation managers in UGC platforms to develop predictive analytics frameworks for identifying potential reviewers of interest. In Section E of the E-Companion, we demonstrate how a simple predictive analytics framework can be developed using a scalable, off-the-shelf, machine learning model that can consistently identify, more than a year before a reviewer loses status, whether the reviewer is likely to reduce the quality of his/her reviews after demotion. By proactively identifying such reviewers, appropriate actions can be taken in the form of incentives or retention mechanisms to alleviate potential adverse impacts that the platform may face in the future. Depending on the platform’s objective, a similar predictive tool can be developed by leveraging the insights provided in this study.

7.3. Limitations and Future Research

This article is subject to certain limitations, which, simultaneously, offer an excellent avenue for future research. First, our findings are based on observational data from Yelp, a platform that is known for online restaurant reviews. Future studies could examine whether other factors (e.g., culture and content type) moderate outcomes. For example, research could be conducted on platforms that operate in different cultural contexts (e.g., Asian platforms) or have different types of content (e.g., Q&A platforms). Second, it would be interesting to study how status loss can be designed in ways that can motivate the demoted reviewers to work harder to regain their statuses. Third, although we have framed this study as a quasi-experimental study to enhance its external validity, there are still opportunities to decipher more with respect to internal validity and underlying mechanisms. Furthermore, qualitative research methodologies such as focus group interviews or surveys could be employed in future studies to explore the relationship between intrinsic review quality and perceived review quality in the context of status loss. These approaches would provide valuable insights to extend the literature. Similarly, accessing review writers’ thought processes to obtain insights on the underlying psychological drivers of demoted users (e.g., whether demoted users learn to change their behavior post-demotion) would also be an excellent avenue for future qualitative research. Last, future research may examine additional variables, such as the characteristics of the business units receiving reviews, as potential moderators that influence the impact of status loss on review-generating behavior.

Supplemental Material

sj-pdf-1-pao-10.1177_10591478241279801 - Supplemental material for Status Downgrade: The Impact of Losing Status on a User-Generated Content Platform

Supplemental material, sj-pdf-1-pao-10.1177_10591478241279801 for Status Downgrade: The Impact of Losing Status on a User-Generated Content Platform by Vandith Pamuru, Wreetabrata Kar and Warut Khern-am-nuai in Production and Operations Management

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research,authorship,and/or publication of this article.

Funding

The author(s) received no financial support for the research,authorship and/or publication of this article.

ORCID iD

Warut Khern-am-nuai

Supplemental Material

Supplemental material for this article is available online (doi:

How to cite this article

Pamuru V,Kar W and Khern-am-nuai W (2024) Status Downgrade: The Impact of Losing Status on a User-Generated Content Platform. Production and Operations Management xx(x): 1–21.

References

Abrahams

Fan

Wang

Zhang

Jiao

(2015) An integrated text analytic framework for product defect discovery. Production and Operations Management 24(6): 975–990.

Adams

(1963) Towards an understanding of inequity. The Journal of Abnormal and Social Psychology 67(5): 422.

Adams

(1965) Inequity in social exchange. Advances in Experimental Social Psychology 2: 267–299. Elsevier.

Amorim

Pratas

(2022) Measuring the impact of risk perception, trust and satisfaction on loyalty in e-marketplaces. In: Marketing and Smart Technologies. Smart Innovation, Systems and Technologies, vol. 280. Singapore: Springer.

Anderson

Huttenlocher

Kleinberg

, et al. (2013) Steering user behavior with badges. In: 22nd international conference on World Wide Web (WWW '13) , Rio de Janeiro, Brazil, 13–17 May, pp. 95–106. New York, NY, USA: Association for Computing Machinery.

Anderson

Keith

Lopez

(2023) Opportunities for system dynamics research in operations management for public policy. Production and Operations Management 32(6): 1895–1920.

Bakhshi

Kanuparthy

Shamma

(2015) Understanding online reviews: Funny, cool or useful? In: 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15) , Vancouver, BC, Canada, 14–18 March, pp. 1270–1276. New York, NY, USA: Association for Computing Machinery.

Bansal

Aggarwal

(2021) textstat. (accessed 1 September).

Bhattacharyya

Banerjee

Bose

Kankanhalli

(2020) Temporal effects of repeated recognition and lack of recognition on online community contributions. Journal of Management Information Systems 37(2): 536–562.

10.

Bradley

(2007) Job tenure as a moderator of stressor–strain relations: A comparison of experienced and new-start teachers. Work & Stress 21(1): 48–64.

11.

Burtch

Hong

Lee

(2022) How do peer awards motivate creative content? Experimental evidence from reddit. Management Science 68(5): 3488–3506.

12.

Burtch

Hong

Bapna

Griskevicius

(2018) Stimulating online reviews by combining financial incentives and social norms. Management Science 64(5): 2065–2082.

13.

Callaway

Sant’Anna

(2021) Difference-in-differences with multiple time periods. Journal of Econometrics 225(2): 200–230.

14.

Cao

Duan

Gan

(2011) Exploring determinants of voting for the “helpfulness” of online user reviews: A text mining approach. Decision Support Systems 50(2): 511–521.

15.

Carlson

Livermore

Rockmore

(2015) A quantitative analysis of writing style on the US supreme court. Washington University Law Review 93: 1461.

16.

Cavusoglu

Huang

K-W

(2015) Can gamification motivate voluntary contributions? The case of stackoverflow Q&A community. In: 18th ACM Conference Companion on Computer Supported Cooperative Work & Social Computing (CSCW'15 Companion) , Vancouver, BC, Canada, 14–18 March, pp. 171–174. New York, NY, USA: Association for Computing Machinery.

17.

Ceran

Singh

Mookerjee

(2016) Knowing what your customer wants: Improving inventory allocation decisions in online movie rental systems. Production and Operations Management 25(10): 1673–1688.

18.

Ceylan

Diehl

Proserpio

(2024) Words meet photos: When and why photos increase review helpfulness. Journal of Marketing Research 61(1): 5–26.

19.

Chan

Yang

Zeng

(2022) Bolstering ratings and reviews systems on multi-sided platforms: A co-creation perspective. Journal of Business Research 139: 208–217.

20.

Chen

Zheng

Ceran

(2016) De-biasing the reporting bias in social media analytics. Production and Operations Management 25(5): 849–865.

21.

Cheng

Fan

Guo

Huang

Qiu

(2020) Can “gold medal” online sellers earn gold? The impact of reputation badges on sales. Journal of Management Information Systems 37(4): 1099–1127.

22.

Chou

E-Y

Lin

C-Y

Huang

H-C

(2016) Fairness and devotion go far: Integrating online justice and value co-creation in virtual communities. International Journal of Information Management 36(1): 60–72.

23.

Cohen

(1988) Statistical Power Analysis for the Behavioral Sciences. New York, NY, USA: Routledge.

24.

Cui

Gallino

Moreno

Zhang

(2018) The operational value of social media information. Production and Operations Management 27(10): 1749–1769.

25.

Dellarocas

(2003) The digitization of word of mouth: Promise and challenges of online feedback mechanisms. Management Science 49(10): 1407–1424.

26.

Dellarocas

(2010) Online reputation systems: How to design one that does what you need. MIT Sloan Management Review 51(3): 33.

27.

Deodhar

Babar

Burtch

(2019) Falling from digital grace: Participation in online software contests following loss of status. In: 52nd Hawaii International Conference on System Sciences , Grand Wailea, Maui, Hawaii, USA, 8–11 January, pp. 4458–4465.

28.

Donath

(2007) Signals in social supernets. Journal of Computer-Mediated Communication 13(1): 231–251.

29.

Duan

Whinston

(2008) Do online reviews matter?—An empirical investigation of panel data. Decision Support Systems 45(4): 1007–1016.

30.

Duguid

Goncalo

(2015) Squeezed in the middle: The middle status trade creativity for focus. Journal of Personality and Social Psychology 109(4): 589.

31.

English

Morrison

Chalon

(2010) Moderator effects of organizational tenure on the relationship between psychological climate and affective commitment. Journal of Management Development 29(4): 394–408.

32.

Feng

(2016) Why do you return the favor in online knowledge communities? A study of the motivations of reciprocity. Computers in Human Behavior 63: 342–349.

33.

Festinger

(1957) A Theory of Cognitive Dissonance. Redwood City, CA, USA: Stanford University Press.

34.

Floyd

Freling

Alhoqail

Cho

Freling

(2014) How online product reviews affect retail sales: A meta-analysis. Journal of Retailing 90(2): 217–232.

35.

Fradkin

Holtz

(2023) Do incentives to review help the market? Evidence from a field experiment on airbnb. Marketing Science 42(5): 853–865.

36.

Gao

Wang

Ding

Guo

(2024) The pitfalls of review solicitation: Evidence from a natural experiment on tripadvisor. Management Science (Forthcoming).

37.

Garnefeld

Krah

Bohm

Gremler

(2021) Online reviews generated through product testing: Can more favorable reviews be enticed with free products? Journal of the Academy of Marketing Science 49: 733–722.

38.

Goes

Guo

Lin

(2016) Do incentive hierarchies induce user effort? Evidence from an online knowledge exchange. Information Systems Research 27(3): 497–516.

39.

Goes

Lin

Au Yeung

C-m

(2014) “Popularity effect” in user-generated content: Evidence from online product reviews. Information Systems Research 25(2): 222–238.

40.

Goodman

Friedman

(1971) An examination of Adams’ theory of inequity. Administrative Science Quarterly 16(3): 271–288.

41.

Hofer

Mrsic-Flogel

Bonhoeffer

Hübener

(2009) Experience leaves a lasting structural trace in cortical circuits. Nature 457(7227): 313–317.

42.

Hong

Kim

H-S

Lennon

(2018) The effects of perceived quality and usefulness of consumer reviews on review reading and purchase intention. The Journal of Consumer Satisfaction, Dissatisfaction and Complaining Behavior 31: 71–89.

43.

Hou

(2022) Space norms for constructing quality reviews on online consumer review sites. Information Systems Research 33(3): 1093–1112.

44.

Hovland

Janis

Kelley

(1953) Communication and Persuasion. New Haven, CT, USA: Yale University Press.

45.

Zhang

Pavlou

(2009) Overcoming the J-shaped distribution of product reviews. Communications of the ACM 52(10): 144–147.

46.

Joglekar

Davies

Anderson

(2016) The role of industry studies and public policies in production and operations management. Production and Operations Management 25(12): 1977–2001.

47.

Liu

Brass

(2020) Do online friends bring out the best in us? The effect of friend contributions on online review provision. Information Systems Research 31(4): 1322–1336.

48.

Khern-am-nuai

Hashim

Pinsonneault

Yang

(2023) Augmenting password strength meter design using the elaboration likelihood model: Evidence from randomized experiments. Information Systems Research 34(1): 157–177.

49.

Khern-am-nuai

Kannan

Ghasemkhani

(2018) Extrinsic versus intrinsic rewards for contributing reviews in an online platform. Information Systems Research 29(4): 871–892.

50.

Khern-am-nuai

Cohen

Adulyasak

(2024) Selecting cover images for restaurant reviews: AI vs. wisdom of the crowd. Manufacturing & Service Operations Management 26(1): 330–349.

51.

Khreiche

(2020) Responding to customer reviews: Do managerial responses enhance review quality? Master’s thesis, University of Twente.

52.

Kokkodis

Lappas

(2020) Your hometown matters: Popularity-difference bias in online reputation platforms. Information Systems Research 31(2): 412–430.

53.

Kumar

Qiu

Kumar

(2022) A hashtag is worth a thousand words: An empirical investigation of social media strategies in trademarking hashtags. Information Systems Research 33(4): 1403–1427.

54.

Kumar

Mookerjee

Shubham

(2018) Research in operations management and information systems interface. Production and Operations Management 27(11): 1893–1905.

55.

Lampel

Bhalla

(2007) The role of status seeking in online communities: Giving the gift of experience. Journal of Computer-Mediated Communication 12(2): 434–455.

56.

Lee

S-Y

Qiu

Whinston

(2018) Sentiment manipulation in online platforms: An analysis of movie tweets. Production and Operations Management 27(3): 393–416.

57.

Lefevere

(1983) Literature, comparative and translated. Babel 29(2): 70–75.

58.

Wang

Meng

Zhang

(2019) Making restaurant reviews useful and/or enjoyable? The impacts of temporal, explanatory, and sensory cues. International Journal of Hospitality Management 83: 257–265.

59.

Liang

Schuckert

Law

Chen

C-C

(2017) Be a “superhost”: The importance of badge systems for peer-to-peer rental accommodations. Tourism Management 60: 454–465.

60.

Manchaiah

Kelly-Campbell

Bellon-Harn

Beukes

(2020) Quality, readability, and suitability of hearing health-related materials: A descriptive review. American Journal of Audiology 29(3): 513–527.

61.

Marr

Thau

(2014) Falling from great (and not-so-great) heights: How initial status position influences performance after status loss. Academy of Management Journal 57(1): 223–248.

62.

Mayya

Viswanathan

(2024) Delaying informed consent: An empirical investigation of mobile apps’ upgrade decisions. Available at SSRN 3457018.

63.

Mayya

Viswanathan

Agarwal

(2021) Who forgoes screening in online markets and why? Evidence from airbnb. MIS Quarterly 45(4): 1745–1776.

64.

McCarthy

Jarvis

(2010) MTLD, vocd-D, and HD-D: A validation study of sophisticated approaches to lexical diversity assessment. Behavior Research Methods 42(2): 381–392.

65.

McCloskey

(2021) An examination of the data quality of online reviews: Who do consumers trust? Journal of Electronic Commerce in Organizations (JECO) 19(1): 24–42.

66.

McGuire William

(1985) Attitudes and attitude change. Handbook of Social Psychology 2: 233–346.

67.

Meehan

Wright

(2013) Power priorities in buyer–seller relationships: A comparative analysis. Industrial Marketing Management 42(8): 1245–1254.

68.

Mowrer

(1960) Learning Theory and Behavior. Hoboken, NJ, USA: John Wiley & Sons Inc.

69.

Mudambi

Schuff

(2010) Research note: What makes a helpful online review? A study of customer reviews on amazon.com. MIS Quarterly 34(1): 185–200.

70.

Murphy

McSweeney

Smith

McComas

(2003) Dynamic changes in reinforcer effectiveness: Theoretical, methodological, and practical implications for applied research. Journal of Applied Behavior Analysis 36(4): 421–438.

71.

Nilsson

Darr

Perry

(2018) Analysis of yelp ’elite’ reviewers in toronto. Available at: https://thomasnilsson.github.io/02805-social-graphs-2018/.

72.

Ohanian

(1990) Construction and validation of a scale to measure celebrity endorsers’ perceived expertise, trustworthiness, and attractiveness. Journal of Advertising 19(3): 39–52.

73.

Pan

Zhang

(2011) Born unequal: A study of the helpfulness of user-generated product reviews. Journal of Retailing 87(4): 598–612.

74.

Pavlou

Gefen

(2004) Building effective online marketplaces with institution-based trust. Information Systems Research 15(1): 37–59.

75.

Pettit

Yong

Spataro

(2010) Holding your place: Reactions to the prospect of status gains and losses. Journal of Experimental Social Psychology 46(2): 396–401.

76.

Petty

Cacioppo

(2012) Communication and Persuasion: Central and Peripheral Routes to Attitude Change. Berlin, Germany: Springer Science & Business Media.

77.

Petty

Cacioppo

Goldman

(1981) Personal involvement as a determinant of argument-based persuasion. Journal of personality and social psychology 41(5): 847.

78.

Petty

Cacioppo

Petty

, et al (1986) The Elaboration Likelihood Model of Persuasion. New York, NY, USA: Springer.

79.

Pritchard

(1969) Equity theory: A review and critique. Organizational Behavior and Human Performance 4(2): 176–211.

80.

Pritchard

Dunnette

Gorgenson

(1972) Effects of perceptions of equity and inequity on worker performance and satisfaction. Journal of Applied Psychology 56(1): 75.

81.

Chen

Qiu

Cheng

(2020) Does identity disclosure help or hurt user content generation? Social presence, inhibition, and displacement effects. Information Systems Research 31(2): 297–322.

82.

Liu

Chen

Qiu

Cheng

(2022) What questions are you inclined to answer? Effects of hierarchy in corporate Q&A communities. Information Systems Research 33(1): 244–264.

83.

Qiao

Lee

S-Y

Whinston

Wei

(2020) Financial incentives dampen altruism in online prosocial contributions: A study of online reviews. Information Systems Research 31(4): 1361–1375.

84.

Qiao

Rui

(2023) Text performance on the vine stage? The effect of incentive on product review text quality. Information Systems Research 34(2): 676–697.

85.

Rasool

Pathania

(2023) Word of mouse! What drives consumer voluntarism to write online reviews? Journal of Marketing Communications (Forthcoming).

86.

Rishika

Ramaprasad

(2019) The effects of asymmetric social ties, structural embeddedness, and tie strength on online content contribution behavior. Management Science 65(7): 3398–3422.

87.

Sahaym

Vithayathil

Sarker

Bjørn-Andersen

(2023) Value destruction in information technology ecosystems: A mixed-method investigation with interpretive case study and analytical modeling. Information Systems Research 34(2): 508–531.

88.

Sharma

Telang

Zentner

(2020) The impact of digital book readers on print sales: Analysis using genre exposure heterogeneity. Available at SSRN 3579521.

89.

Smith

(2021) Yelp statistics, user counts and facts. Available at: https://expandedramblings.com/index.php/yelp-statistics/ (accessed 15 June 2021).

90.

Srivastava

Kalro

(2019) Enhancing the helpfulness of online consumer reviews: The role of latent (content) factors. Journal of Interactive Marketing 48(1): 33–50.

91.

Sun

(2018) Online reviews and collaborative service provision: A signal-jamming model. Production and Operations Management 27(11): 1960–1977.

92.

Sun

Viswanathan

Huang

Zheleva

(2021) Designing promotional incentives to embrace social sharing: Evidence from field and online experiments. MIS Quarterly 45(2): 789–820.

93.

Sussman

Siegal

(2003) Informational influence in organizations: An integrated approach to knowledge adoption. Information Systems Research 14(1): 47–65.

94.

Wager

Athey

(2018) Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association 113(523): 1228–1242.

95.

Wagner

Hennig-Thurau

Rudolph

(2009) Does customer demotion jeopardize loyalty? Journal of Marketing 73(3): 69–85.

96.

Zhao

(2023) What consumer complaints should hoteliers prioritize? Analysis of online reviews under different market segments. Journal of Hospitality Marketing & Management 32(1): 1–28.

97.

Yan

Tan

Sun

(2019) Shared minds: How patients use collaborative information sharing via social media platforms. Production and Operations Management 28(1): 9–26.

98.

Yin

Bond

Zhang

(2014) Anxious or angry? Effects of discrete emotions on the perceived helpfulness of online reviews. MIS Quarterly 38(2): 539–560.

99.

Yin

Mitra

Zhang

(2016) Research note—When do consumers value positive vs. negative reviews? An empirical investigation of confirmation bias in online word of mouth. Information Systems Research 27(1): 131–144.

100.

Gutt

Khern-am-nuai

(2024) Review helpfulness score vs. review unhelpfulness score: Two sides of the same coin or different coins? IEEE Transactions on Engineering Management 71: 8031–8044.

101.

Khern-am-nuai

Pinsonneault

Wei

(2023) Impacts of social interactions and peer evaluations on online review platforms. Journal of Management Information Systems 40(4): 1271–1300.

102.

Zhang

Wei

Zeng

(2020) A matter of reevaluation: Incentivizing users to contribute reviews in online platforms. Decision Support Systems 128: 113158.

103.

Zhang

Yang

Qiao

Zhang

(2022) Responsive and responsible: Customizing management responses to online traveler reviews. Journal of Travel Research 61(1): 120–135.

104.

Zhu

Khern-am-nuai

(2024) Negative peer feedback and user content generation: Evidence from a restaurant review platform. Production and Operations Management (Forthcoming).

105.

Zhu

Yin

(2014) Is this opinion leader’s review useful? Peripheral cues for online review helpfulness. Journal of Electronic Commerce Research 15(4): 267.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

1.51 MB

		Pre-treatment				Post-treatment
Variable type	Variable	Mean	Std Dev	Min	Max	Mean	Std Dev	Min	Max
Intrinsic quality	Review length	885.4	428.0	19.0	4896.0	843.5	487.7	19.0	4974.0
	Reading ease	42.0	58.5	−2513.6	121.1	48.2	58.9	−2513.6	122.9
	Reading grade	20.1	21.9	−3.5	367.1	17.6	21.3	−3.5	367.1
	Gunning fog index	22.4	22.3	0.4	252.2	19.9	21.6	0.4	266.2
	Dale Chall readability score	8.2	2.8	0.1	36.2	7.9	2.7	0.1	38.2
	Lexical density	87.1	16.5	0.0	1575.0	85.9	22.6	0.0	1575.0
Perceived quality	Useful votes	1.1	6.2	0.0	1122.0	1.5	2.2	0.0	44.8
	Compliment votes	2.2	12.8	0.0	2255.0	3.3	5.1	0.0	125.8
Control	Number of reviews	50.4	50.6	1.0	776.0	14.4	18.3	1.0	350.0
	Average rating	3.8	0.4	1.0	5.0	3.9	0.7	1.0	5.0
	Years on Yelp	3.9	1.8	0.0	13.2	5.1	1.9	0.0	13.2
	Years as Elite	2.5	1.3	0.0	8.0	3.5	1.4	0.0	9.0