Abstract
INTRODUCTION
In service systems, allowing servers to take vacations when the congestion level is low can help reduce the system's operating cost and arouse servers' enthusiasm, see Tian and Zhang (2006). An efficient vacation mechanism in practice is to let the server resume service once the system's workload reaches a critical level. For example, in make‐to‐order production systems with high setup costs, the production line usually begins to operate only when the number of tasks reaches a critical level, see Guo and Hassin (2011) and Li et al. (2016) for more detailed discussions of make‐to‐order production systems.
This type of vacation termination rule is referred to as the
In this paper, we consider a new mechanism in a vacation queueing system, where each arriving customer, upon finding the server to be on vacation, is offered an opportunity to pay a fee to instantaneously end the server's vacation; we refer to this option as
PTAS enables customers to gain proactive control of their own service experience because, if they deem PTAS to be worthy, customers no longer need to wait for other (future) customers to help advance the service process. In some sense, PTAS can help address the fairness issue from the customers' perspective, it turns the control of the server's state from passive to active. In addition, the impact of PTAS is beyond the scope of an individual customer. A customer adopting PTAS may help improve the service experience of other customers, including (i)
In our vacation queueing model endowed with PTAS, customers are delay‐sensitive and strategic. They make the following one‐time decisions immediately upon their arrivals: (i) to join or to balk and (ii) if joining, to pay (for PTAS) or not to pay, in anticipation of their expected individual welfare. If no one utilizes PTAS and the server is on vacation, the service will automatically resume when the queue length reaches some designated threshold
Literature review
Our analysis has points of contact to four extant streams of research: (i) strategic behavior in vacation queues, (ii) strategic behavior in priority queues, (iii) two‐dimensional customer strategies, and (iv) information provision policies.
Strategic behavior in vacation queues
The research on strategic customers in queues was pioneered by Naor (1969), where arriving customers decide on whether to join an M/M/1 queue based on the available queue length. The case of unobservable queue for an M/M/1 model was developed by Edelson and Hilderbrand (1975). Following Naor (1969), strategic customer behavior in queueing systems has been widely studied in the literature, see Hassin and Haviv (2003), Stidham Jr (2009), and Hassin (2016) for comprehensive reviews. We hereby focus on reviewing works on vacation queues. The first work on vacation queues with
Strategic behavior in priority queues
Allowing customers to pay for priority has been proven an efficient way to increase service profit and social welfare in queueing systems. Adiri and Yechiali (1974) were the first to develop the pure equilibrium priority purchasing strategy in an observable queueing model. Their results were later extended by Hassin and Haviv (1997) to allow for mixed strategies in the same settings. Gavirneni and Kulkarni (2016) studied the equilibrium strategy in unobservable queues having heterogeneous customers. Wang et al. (2019) conducted a comparison analysis for the equilibrium performance of a priority queue under different information structures. The partial priority scheme is proposed by Yang et al. (2022) in Covid‐19 testing queues. Offering PTAS to customers looks in a way similar to allowing them to purchase a higher priority over others. Nevertheless, we draw a major distinction: In priority queues, the purchasing behavior is shown to be of pure follow‐the‐crowd (FTC) type, that is, a customer is more inclined to purchase priority when others do so as well, see, for example, Hassin and Haviv (1997, 2003). However, in our vacation queue model endowed with PTAS, as we will show later, FTC and avoid‐the‐crowd (ATC) often coexist when the threshold
Two‐dimensional customer strategies
In contrast to previous queueing game literature that focuses on either the customers' joining strategy or purchasing strategy (e.g., in priority queues), our framework allows customers' strategy to be a combination of both. At the heart of our equilibrium analysis is to establish the two‐dimensional joining‐and‐purchasing strategy. To our best knowledge, only a few papers in the queueing game literature have investigated this type of two‐dimensional equilibrium strategy. Hassin and Roet‐Green (2017) studied a queueing model where customers first decide on whether to inspect the queue length and then make a second join‐or‐balk decision. This work was later extended by Hassin and Roet‐Green (2018) to a two‐server queue with inspection costs. Wang et al. (2019) studied the joining and priority purchasing strategy in a priority queueing model. Besides the joining decisions, Cui et al. (2020) and Yang et al. (2021) considered queueing models where customers can choose to pay to improve their queueing positions. Other two‐dimensional settings can be found in referral priority models (Yang & Debo, 2019), online retailing queueing models (Wang et al., 2021a), multichannel service models with product exchange (Sun et al., 2022a), and restaurant models allowing orders to be placed ahead and then picked up later (Sun et al., 2022b). Motivated by cloud services, Dierks and Seuken (2021) solved the service provider's profit optimization problem and established a multidimensional equilibria. Abhishek et al. (2012) considered two pricing schemes for selling cloud services to two user classes. Gao et al. (2019) investigated a service system with two competing firms offering services under two different pricing and service rules, in which arriving customers need to decide on (i) whether to receive service; (ii) if yes, from which firm; and (iii) if choosing the bid‐based firm, the amount of her bid. We emphasize that the consideration of a two‐dimensional customer strategy adds significant complexity to the equilibrium analysis.
Information revelation policies
There is a stream of queueing literature that studies the impact of information provision on queueing outcomes. By studying social welfare under both full and no queue information, Hassin (1986) discovered that the revelation of real‐time queue length improves the social welfare because such information helps better match service capacity with customer demand. Chen and Frank (2004) investigated the system throughput under the two aforementioned information provision policies and discovered that delayed information may have both positive and negative effects on the system throughput. Simhon et al. (2016) considered the optimal information disclosure problem in an M/M/1 queue, and concluded that the commonly adopted threshold policy is never optimal. Hassin and Koshman (2017) proposed a new profit‐maximizing mechanism in that customers will be notified whether the queue length is below a certain threshold. Hu et al. (2018) found that throughput and social welfare can be unimodal in the fraction of informed customers; their findings infer that creating the “right” amount of information heterogeneity among customers may lead to improved outcomes. Similar results can be found in the retrial and priority queueing models, see Wang and Wang (2019) and Wang and Fang (2022). Anunrojwong et al. (2020) studied the effective design of information policies with the objective of reducing congestion in social services. Recently, Lingenbrink and Iyer (2019) solved a long‐standing open problem on the optimal signaling mechanism in unobservable queues; their illuminating findings suggested that such a signaling mechanism can be effective in achieving the optimal revenue in settings where state‐dependent pricing is infeasible.
Contributions and organization
In summary, we make the following contributions.
Organization of the paper
The rest of the paper is structured as follows. The model description is given in Section 2. In Section 3, we study an unobservable vacation queue model endowed with PTAS. We conduct equilibrium analysis in three steps: We first report the equilibrium joining strategy under a fixed PTAS purchasing probability (Propositions 2 and 3); we next develop the equilibrium PTAS purchasing strategy with exogenous arrival rates (Proposition 4); and finally, we integrate results in the previous two steps to establish the joint joining‐and‐purchasing strategy (Theorem 1). In Section 4, we study an observable vacation queue endowed with PTAS and characterize the SPE strategies (Theorems 2 and 3); we also provide the system performance in equilibrium. In Section 5 we compare the system performance under the two base information policies, investigate the revenue/pricing implications, and contrast our PTAS setting to other common mechanisms. We develop some extensions of our base models in the Supporting Information and draw concluding remarks in Section 6. All proofs are given in the Supporting Information.
MODEL DESCRIPTION
We consider a production system with
Customers are homogeneous and delay‐sensitive. They incur a delay cost at rate
We first study two main information policies: (1) unobservable queue (so customers' behavior will rely on their anticipation of the expected mean delay) and (2) observable queue (so that customers can make strategic decisions using the real‐time queue length).
4
We conduct equilibrium analysis in both cases and study which one provides more benefits from the service provider's perspective. To model the system dynamics as a
UNOBSERVABLE QUEUE
In this section, we establish customers' equilibrium strategy when the queue length is unobservable. In Section 3.1 we first study the steady‐state system performance under an arbitrary (mixed) strategy. In Sections 3.2–3.4, we fully describe the joint joining‐and‐purchasing equilibrium strategy.
Preliminaries
Because the server's state is observable, we let
Steady‐state performance
The state transition diagram of the CTMC

State transition diagram under strategy
Let
For any given strategy Consider an unobservable M/M/1 vacation queue with PTAS. Assume that all customers follow strategy The steady‐state probabilities are The expected queue length is The conditional expected waiting times of an arriving customer seeing an inactive server and an active server are The system throughput is The service provider's revenue collected by selling PTAS is
According to (8), the mean steady‐state queue length
To make contact with existing results in the literature, we advocate that our model is general and covers several previously studied queueing models. If no one adopts PTAS (i.e.,
Using results derived so far, we can investigate the equilibrium joining‐and‐purchasing strategies. When all customers adopt strategy
Although customer arrivals arise from a homogeneous Poisson stream, we assign one of two “labels” to all arrivals immediately upon their arrivals; we do so according to the specific server state (busy working or on vacation) they observe. In particular, customers seeing an active server (i.e., A strategy profile
Throughout the paper, we restrict our attention to symmetric Nash equilibrium. Similar definitions of state‐dependent symmetric equilibria can be found in (3.3) and (3.4) of Wang and Wang (2019).
As will soon become clear in subsequent analysis, there often exist multiple equilibria. To identify those that are most relevant, we resort to notion of utility dominance. Given two equilibrium strategies
In what follows, we will adopt the notion of Pareto dominance to identify the most efficient equilibrium strategy among multiple equilibria (whenever exist) that maximizes the ex ante expected utility of customers. We will first characterize the equilibrium strategy for
Equilibrium strategy for
‐customers
The following lemma guarantees the uniqueness of the equilibrium for For any given strategy of If If where
Proposition 2 indicates that
Equilibrium strategy for
‐customers
Because
It should be noted that in the subsequent analysis, the Pareto‐dominance criteria are no longer helpful in distinguishing two mixed equilibria because they can both induce a zero expected utility. In these cases, we will turn to the so‐called A two‐dimensional equilibrium strategy
ESS is useful in excluding the unstable mixed equilibria: If an equilibrium is stable, the system dynamics, when facing a small perturbation in customer behavior, is guaranteed to return to that equilibrium point. But this is not true for an unstable equilibrium. Following the steps to establish Proposition 2, we first obtain the best response of
Joining strategy
Consider an unobservable M/M/1 vacation queue with PTAS. For a given PTAS purchasing probability Equilibrium joining strategy with a fixed
According to Propositions 2 and 3, For a given Monotonicity in
PTAS purchasing strategy
In this subsection, we develop
Note that when
If the tagged customer adopts PTAS with probability
For any given When
Part Part When When When See Figure 2 for a graphical illustration of these three cases.

The function
Consider an unobservable M/M/1 vacation queue with PTAS. For a given joining probability If If If If where
First, it is the FTC behavior that gives rise to multiple equilibria (as in Cases
Joint equilibrium strategy
We are now ready to derive the joint equilibrium of
Consider the unobservable M/M/1 vacation queue with PTAS. Assume If
where all relevant parameters are given by (S33)–(S35) in the Supporting Information.
In Theorem 1, all nonzero equilibria can be classified into three categories:
Careful partition of the parameter space is less straightforward, and multiple equilibria may coexist due to the nonmonotonicity of
We next conduct a numerical example with

The unobservable case: equilibrium purchasing probability
OBSERVABLE QUEUE
In this section, we investigate the M/M/1 vacation queue with PTAS when the real‐time queue length is revealed to all arriving customers. Unlike the unobservable case where customers join the queue with a probability (independent with the queue length), their joining‐and‐purchasing decisions are now based on the real‐time system state, see Naor (1969). When a customer (if joining) is indifferent between accepting and rejecting PTAS upon arrival, we assume for simplicity that she will choose to pay for PTAS. A similar assumption can be found in observable priority queues, see Wang et al. (2021b). Next, we characterize the equilibrium strategy, and then compute the system performance in equilibrium.
Since state‐dependent decisions are made in the observable case, we consider this model with infinitely many decision makers (customers), each facing a state sampled from the state‐space A strategy
To characterize an SPE, one needs to consider all the system states in
Equilibrium analysis
Suppose a tagged customer arrives and finds an
Suppose the server is
If, in addition Consider an observable M/M/1 vacation queue with High reward: If Low reward: If
According to Theorem 2, when the PTAS fee is sufficiently small, any joining customer purchases PTAS (if seeing an inactive server) so the system will be activated by the first arriving customer in equilibrium. Besides, the system reduces to a standard work‐conservation queue in which customers join if and only if the queue length is below some threshold.
When the PTAS fee is higher than the cost of waiting for one future arrival, the tagged customer's best response has to take into account the behavior of future arrivals. Note that it may be worthwhile to wait for one arrival, but there is no guarantee that she will activate the server for sure. Let Consider an observable M/M/1 vacation queue with
According to Theorem 3, the SPE on the equilibrium path is of a threshold type, that is, when the server is on vacation, arriving customers will join without purchasing PTAS until the queue size reaches a threshold
System performance
Next, we derive the system performance under the equilibrium strategy in the observable case. In particular, we compute the system throughput and the PTAS revenue, which is the rate at which customers pay for PTAS, multiplied by the PTAS fee Consider an observable M/M/1 vacation queue with PTAS, the steady‐state probabilities, throughput, and revenue are given below. If If The system throughput and PTAS revenue are given by
The server, whenever on vacation, will be activated as soon as the queue length reaches a certain level. Unlike standard
We consider a numerical example to visualize results in Theorem 4. In Figure 4 we plot the throughput

Throughput and PTAS revenue in the observable vacation queue for different
COMPARISONS AND IMPLICATIONS
In this section, we first compare the system performance (e.g., throughput and PTAS revenue) and pricing implications under two information disclosure policies. Next, we benchmark the performance of our PTAS vacation model to that of a regular vacation queue without PTAS. Finally, we study how our PTAS model distinguishes from the pay‐for‐priority queues.
Impact of service reward
For any fixed PTAS fee
In the observable case, the expected customer utility depends on their queueing positions. So a smaller service reward
In Figure 5, we plot the PTAS revenue under two information policies as a function of the service reward. Consistent with Theorem 5, Figure 5 shows that, when

Comparison of revenue and system throughput under two information structures for different
Impact of congestion level
For any fixed PTAS fee
Results in Theorem 6 are consistent with the general consensus: When the potential arrival rate is sufficiently small, all customers in the unobservable model join the system; but in the observable case, balking can still happen when customers observe a longer queue upon arrival. Next, we proceed to compare the system performance measures under two information levels relative to the case without PTAS. Note that the server in a standard vacation queue can never be activated if Under both information policies, the PTAS revenue is nonmonotonic in the congestion level
At a quick look, the fact that the PTAS revenue is not monotonically increasing in the potential demand seems to counter the conventional wisdom. In fact, the market size
In the unobservable (observable) case, the revenue reaches its peak at some finite The optimal prices satisfy
When the demand volume is sufficiently low, the only way to activate the server is through purchasing PTAS (because the queue length almost never reaches
In Figure 6 we use a numerical example to illustrate the PTAS revenue (left panel) and optimal PTAS fee (right panel) under the two information policies. Consistent with results in Theorem 8, Figure 6 shows that a higher PTAS fee should be set in the observable case. In addition, the PTAS revenue has a unimodal form in the demand volume, and the optimal PTAS fee is weakly decreasing in

Comparisons of optimal PTAS revenue and corresponding PTAS fee under two information structures for different
Advantage of PTAS in vacation queues
In this subsection, we investigate how the PTAS mechanism benefits vacation queues. In particular, we provide a comparison of throughput in two models: an M/M/1 vacation queue with PTAS and an PTAS achieves improved system throughput for the M/M/1 vacation queue in both the observable and unobservable cases.
PTAS improves the system throughput by allowing customers to activate service immediately upon their arrivals rather than awaiting future arrivals to increment the queue length to level
In support of Theorem 9, we give a numerical example to compare the system throughput for vacation models with and without PTAS for different service reward

Comparing the system's throughput functions in unobservable vacation queues with and without PTAS, with
PTAS versus pay‐for‐priority
In Section 3, we have given a brief discussion on how the equilibrium strategy of our PTAS queue differs from that of the priority queue. To reiterate, a major distinction is that the priority queue exhibits a pure FTC behavior while PTAS shows a more sophisticated behavior that is the hybrid of both FTC and ATC. To further this discussion, we next compare the optimal revenue (i.e., revenue under the optimal fee) of these two models. We hereby restrict our attention to the unobservable setting (i.e., the server's state is observable but the queue length is not).
7
Denote by In the unobservable case, we have
When the demand volume is low, customers in a priority queue anticipate a smaller expected delay so they intend not to pay for priority service, whereas in our PTAS model, customers are more inclined to purchase PTAS, because otherwise the server's vacation may last for a longer time. When the demand volume is high, customers are incentivized to mitigate their delay via the purchase of priority, while PTAS becomes less necessary because the queue is already long enough to reach level

Comparison of
CONCLUSION
In this paper, we study the equilibrium performance of a vacation queueing model with strategic customers. Unlike standard vacation queues in the extant literature where the server's vacation is ended whenever the queue length reaches a critical level, we introduce a new mechanism in that a customer, upon finding the server to be on vacation, may choose to pay a fee to end the server's vacation. This mechanism is referred to as PTAS. The ingenuity of PTAS lies in its ability to allow the server (when on vacation) to be activated immediately by arriving customers, which gives customers more active controls on the server's state (so earlier customer arrivals no longer need to passively wait for future customers to reach the critical queue threshold).
In the present model, customers seeing an inactive server need to make two decisions: (i) whether to join the queue and (ii) if yes, whether to pay for PTAS. We investigate customers' equilibrium joining‐and‐purchasing strategies, and study their responses to this mechanism under three information cases: (i) observable queue and server state, (ii) unobservable queue and observable server state, and (iii) unobservable queue and server state. Our theoretical analysis reveals results that are seemingly contrary to the conventional wisdom. For example, due to the coexistence of FTC and ATC behavior, a higher service reward does not always guarantee a higher system throughput. In addition, the PTAS revenue is a nonmonotone function in the demand volume (a higher potential demand may even yield a lower revenue). These findings provide quantitative and qualitative insights into the system design of vacation queue systems. We also conduct a careful performance comparison of different information policies.
There are several avenues for future research. One interesting direction is to study customers' rational abandonment behavior in response to the new PTAS mechanism. Another potential topic is to allow the service provider to dynamically adjust the PTAS fee based on the real‐time queue length in order to further improve the system revenue.
