Preprint
Article

This version is not peer-reviewed.

The Robustness of the cm-Rule for an Unreliable Single-Server Two-Class Queueing System with Constant Retrial Rates

A peer-reviewed article of this preprint also exists.

Submitted:

10 August 2023

Posted:

11 August 2023

You are already at the latest version

Abstract
We study the robustness of the cμ-rule for the optimal allocation of a resource consisting of one unreliable server to parallel queues with two different classes of customers. The customers in queues can be served with respect to a FCFS retrial discipline, when the customers at the head of queues repeatedly tries to occupy the server in a random time. It is proved that for the scheduling problem in the system without arrivals the cμ-rule minimizes the total average cost. For the system with arrivals it is difficult directly to prove the optimality of the same policy with explicit relations. We derived for an infinite-buffer model a static control policy that also prescribes for the certain values of system parameters the service exclusively for the class-i customers if both of queues are not empty with the aim to minimize the average cost per unit of time. It is also shown that in a finite-buffer case the cμ-rule fails.
Keywords: 
;  ;  ;  ;  

1. Introduction

The modelling and analysis of telecommunications and computer systems is now inconceivable without the various tasks associated with resource allocation and formulated in the framework of the queueing theory. One of such classic problems is to allocate a server to multiple parallel queues with the mostly studied objective to minimize the average cost per unit of time. It was shown that for many systems with such kind of resource allocation problem the allocation policy in form of the c μ -rule is optimal. In literature this rule is also known as Smith’s rule or Weighted Shortest Processing Time. According to this rule the waiting class-i customer from a non-empty queue is allocated to the server if it has a minimal weight of the form c i μ i , where c i is a holding cost per unit of time the customer is in the queue or at server and μ i is a overall service rate of the class-i customer. The c μ -rule is a very attractive policy, since it is a static one and requires only the information whether a certain queue is empty or not. Obviously, to apply such a policy the values c i and μ i must be known, which unfortunately is not always the case, especially for the overall service intensity.
The optimality of the c μ -rule for ordinary multi-class single-server queues in different settings was proved, see e.g. [3,4]. In [16], the authors analyzed the c μ -rule for queueing models with non-linear costs. The optimality of this rule for discrete-time queueing models with a general distributed inter-arrival time and geometrically distributed service time was established in [2]. A concept of a generalized c μ -rule was proposed in [16], where it was shown that this rule is asymptotically optimal for non-decreasing convex delay costs. A classic scheduling problem with a single resource shared by two competing queues was considered in [10] in the context of the stochastic flow model where it was shown that the c μ -rule is optimal. The non-preemptive assignment of a single server to two infinite-capacity retrial queues was analyzed in [18] where c μ -rule was optimal for a scheduling problem in a system without arrivals. The authors in [11] considered the learning-based variants of the c μ -rule where the service rates μ are unknown.
The constant retrial policy was introduced in [8], where it was assumed that only the customer at the head of the queue can request for service after an exponentially distributed retrial time. The single class queueing systems with constant retrial rate and different options were analysed intensively, e.g. in [6,7,17] and other papers. The uncontrolled analogue of a two-class queueing system has also been investigated by a number of authors. For example, in [1] the authors studied a two-class system with a single exponential service requirements and constant retrial policy. The regenerative approach was used there to derive a set of necessary stability conditions of such a system. The generating function of the stationary distribution of the number of orbiting customers at service completion epochs was obtained in [5].
This paper deals with a controllable unreliable single-server two-class queueing system with constant retrial rates. The optimal allocation problem for this queueing model with retrial and reliability attributes is a new one. The emphases of the paper is on answering the question how robust is the c μ -rule as an optimal allocation policy in the queueing system under study. The queueing system is studied without and with arrivals. In first case explicit relations of the c μ -rule can be derived. In second case, the relations for the optimality of the c μ -rule were obtained for the model with a certain constraint on arrival process.
In Section 2 the queueing model is described. The main results including the analysis of the optimal allocation policy and some numerical experiments are presented in Section 3.

2. Model description

We analyze the markovian single-server queueing system servicing two classes of customers as illustrated in Figure 1. The customers of each class i = 1 , 2 arrive to the system according to a Poisson stream with a rate λ i . Independently of the state of server the class-i customers join upon arrival the corresponding waiting line or queue with infinite capacity N i = .
The service of customers from the queue occurs according to a FIFO retrial discipline, i.e. the customer waiting at the head of the queue retries to occupy the server in exponentially distributed time with a rate θ i . The server during a service process of the ith-class customer can fail in exponentially distributed time with a rate α i . In this case the customer leaves the service area and joins again the head of its queue. In failed state the server can be repaired in exponentially distributed time with a rate β . All type of time intervals are assumed to be mutually independent. At moments of retrial arrival the idle server may accept the customer of a certain class, who attempts to occupy the server, or can deny the service. The system performance is described by the steady state average cost which is of the form
g f = c 1 L 1 f + c 2 L 2 f ,
where L i f is the average number of the ith-class customers present in the system given the allocation policy is f and c i the holding cost per unit of time the ith-class customer spends in the system. The objective of the present analysis is to provide how robust is a static policy f defined as a c μ -rule for the system under study by minimizing the average cost per unit of time (1).
Denote by Q i ( t ) the number of ith-class customers in the system at time t and by D ( t ) the state of the server at time t which is defined as follows,
D ( t ) = 0 the server is idle , 1 the server services the 1 st class customer , 2 the server services the 2 nd class customer , 3 the server is failed .
Consider the three-dimensional continuous-time Markov chain
{ X ( t ) } t 0 = { Q 1 ( t ) , Q 2 ( t ) , D ( t ) } t 0 ,
with a state space
E = { x = ( q 1 , q 2 , d ) : q 1 , q 2 N 0 , d { 0 , 1 , 2 , 3 } }
and policy-dependent infinitesimal matrix Λ f = [ λ x y f ] with components for x y ,
λ x y f = λ i y = x + e i , μ i y = x i e 3 , d ( x ) = i , θ i y = x e i + i e 3 , d ( x ) = 0 , f ( x ) = i , α i y = x + e i i e 3 , d ( x ) = i , β y = x 3 e 3 , 0 otherwise ,
where e i stands for a vector of dimension 3 with 1 in the ith position, i = 1 , 2 , 3 , and 0 elsewhere. Here f is a control policy which prescribes at the moments of retrial arrival the allocation of the ith-class customer to the server. The set of admissible control actions is defined by A = { 1 , 2 } , where f ( x ) = i A means that upon retrial arrival in state x the ith-class customer must be accepted for service. The sets of admissible control actions in state x is denoted by A ( x ) A , where A ( x ) = A for x = ( q 1 , q 2 , 0 ) , q 1 , q 2 1 , A ( x ) = 1 for x = ( q 1 , 0 , 0 ) and A ( x ) = 2 for x = ( 0 , q 2 , 0 ) . It is assumed that the stability condition is fulfilled. According to a general result for the M / G / 1 system with parallel queues, see e.g. [4], the system is stable if the total load ρ = ρ 1 + ρ 2 < 1 , where ρ i the load of the class-i queue. The value ρ i can be obtained if we treat the class-i queue as independent single-server queueing system with parameters λ i , μ i , θ i , α i and β . The corresponding continuous-time Markov-chain is then a quasi-birth-and-death (QBD) process with a three-diagonal block infinitesimal matrix
Q = B 0 B 1 0 0 A 2 A 1 A 0 0 0 A 2 A 1 A 0 0 0 A 2 A 1 , where B 0 = λ i 0 0 μ i ( λ i + μ i + α i ) 0 0 0 1 , B 1 = λ i 0 0 0 λ i α i 0 0 0 , A 1 = ( λ i + θ i ) 0 0 μ i ( λ i + μ i + α i ) 0 β 0 ( λ i + β ) , , A 0 = λ i 0 0 0 λ i α i 0 0 λ i , A 2 = 0 θ i 0 0 0 0 0 0 0 .
According to a matrix-analytic approach for QBD-processes [12], the queue i is stable if the following inequality holds,
p A 2 e > p A 0 e , e = ( 1 , 1 , 1 ) .
where p = ( p 1 , p 2 , p 3 ) is a stationary distribution for the infinitesimal transition matrix
A = A 0 + A 1 + A 2 = θ i θ i 0 μ i ( α i + μ i ) α i β 0 β
which satisfies the system p A = 0 and p e = 1 . The solution of the system is given by
p 1 = ( α i + μ i ) β C , p 2 = β θ i C , p 3 = α θ i C ,
where C = α i ( β + θ i ) + β ( μ i + θ i ) . By substituting the solution (5) into (4) we get
ρ i = λ i C β i μ i θ i = λ i α i + μ i μ i θ i θ i β α i + β α i + μ i + 1 < 1 .
and the stability condition is then defined as
i = 1 2 λ i α i + μ i μ i θ i θ i β α i + β α i + μ i + 1 < 1 .
Below we present the main results of this paper obtained for a system with two classes of customers. However, these results can also be generalized to the case of an arbitrary number of classes.

3. Optimal allocation problem

Consider first a classic scheduling problem for the allocation of customers in the system without arrivals, i.e. when λ 1 = λ 2 = 0 , in which the customers are queued. The waiting customers must be served in such a way that the overall average cost is minimized. It is assumed that the allocation policy f ( q 1 , 0 , 0 ) = 1 and f ( 0 , q 2 , 0 ) = 2 for q 1 , q 2 1 . Here we have a classical scheduling problem
Proposition 1.
In state x = ( q 1 , q 2 , 0 ) the optimal allocation policy can be defined in form of a c μ -rule,
f ( x ) = 1 , if c 2 μ ˜ 1 c 1 μ ˜ 2 , 2 , if c 2 μ ˜ 1 c 1 μ ˜ 2 ,
where μ ˜ i = m i α i + β m i μ i β + 1 , m i = α i + μ i μ i θ i , i = 1 , 2 . In case of equality c 2 μ ˜ 1 = c 1 μ ˜ 2 , the control actions 1 and 2 are equivalent.
Proof. 
Denote by V ( x ) the total minimal average cost given the initial state is x E . This function is given by
V ( q 1 , q 2 , 0 ) = min 1 θ 1 ( q 1 c 1 + q 2 c 2 ) + V ( q 1 1 , q 2 , 1 ) , 1 θ 2 ( q 1 c 1 + q 2 c 2 ) + V ( q 1 , q 2 1 , 2 ) ,
q 1 , q 2 1 , V ( q 1 , q 2 , 1 ) = 1 μ 1 + α 1 ( ( q 1 + 1 ) c 1 + q 2 c 2 ) + μ 1 μ 1 + α 1 V ( q 1 , q 2 , 0 ) + α 1 μ 1 + α 1 V ( q 1 + 1 , q 2 , 3 ) ,
q 1 , q 2 0 , V ( q 1 , q 2 , 2 ) = 1 μ 2 + α 2 ( q 1 c 1 + ( q 2 + 1 ) c 2 ) + μ 2 μ 2 + α 2 V ( q 1 , q 2 , 0 ) + α 2 μ 2 + α 2 V ( q 1 , q 2 + 1 , 3 ) ,
q 1 , q 2 0 ,
V ( q 1 , q 2 , 3 ) = 1 β ( q 1 c 1 + q 2 c 2 ) + V ( q 1 , q 2 , 0 ) , q 1 , q 2 0 .
We need to prove that f ( q 1 , q 2 , 0 ) = 1 if c 2 μ ˜ 1 < c 1 μ ˜ 2 as defined in a proposition. First we show that f ( q 1 , q 2 1 , 0 ) = 1 implies f ( q 1 , q 2 , 0 ) = 1 . In case f ( q 1 , q 2 , 0 ) = 2 and f ( q 1 , q 2 1 , 0 ) = 1 we get from (8), () and () after some simple algebra the relation for V ( q 1 , q 2 , 0 ) ,
V ( q 1 , q 2 , 0 ) = 1 μ 2 + α 2 β μ 2 + α 2 + μ 2 μ 2 θ 2 ( q 1 c 1 + q 2 c 2 ) + 1 μ 1 + α 1 β μ 1 + α 1 + μ 1 μ 1 θ 1 ( q 1 c 1 + ( q 2 1 ) c 2 ) + V ( q 1 1 , q 2 1 , 0 ) .
In case f ( q 1 , q 2 , 0 ) = 1 , if in state ( q 1 , q 2 1 , 0 ) the action 2 is chosen instead of the action 1, we get the following inequality,
V ( q 1 , q 2 , 0 ) 1 μ 1 + α 1 β μ 1 + α 1 + μ 1 μ 1 θ 1 ( q 1 c 1 + q 2 c 2 ) + 1 μ 2 + α 2 μ 2 β + α 2 + μ 2 μ 2 θ 2 ( ( q 1 1 ) c 1 + q 2 c 2 ) + V ( q 1 1 , q 2 1 , 0 ) .
Now, if f ( q 1 , q 2 , 0 ) = 1 is optimal, then the difference of expressions (12) and (13) must be positive, i.e.
c 1 1 μ 2 + α 2 β μ 2 + α 2 + μ 2 μ 2 θ 2 c 2 1 μ 1 + α 1 β μ 1 + α 1 + μ 1 μ 1 θ 1 = c 1 α 2 + μ 2 μ 2 θ 2 θ 2 β α 2 + β α 2 + μ 2 + 1 c 2 α 1 + μ 1 μ 1 θ 1 θ 1 β α 1 + β α 1 + μ 1 + 1 = c 1 m 2 α 2 + β m 2 μ 2 β + 1 c 2 m 1 α 1 + β m 1 μ 1 β + 1 0 .
what is coincide with a statement.
In case of equality c 2 μ ˜ 1 = c 1 μ ˜ 2 the control actions 1 and 2 are equivalent, i.e. f ( x ) = 1 2 for x = ( q 1 , q 2 , 0 ) . We show that the optimal policy f ( q 1 1 , q 2 , 0 ) = 1 2 and f ( q 1 , q 2 1 , 0 ) = 1 2 implies that f ( q 1 , q 2 , 0 ) = 1 2 . Note that f ( 0 , q 2 , 0 ) = 2 and f ( q 1 , 0 , 0 ) = 1 for q 1 , q 2 1 . For the minimum in (8) in case f ( q 1 , q 2 1 , 0 ) = 1 2 we get then for the control action f ( q 1 , q 2 , 0 ) = 2 the relation
1 θ 2 ( q 1 c 1 + q 2 c 2 ) + V ( q 1 , q 2 1 , 2 )
in form (12). For the control action f ( q 1 , q 2 , 0 ) = 1 the relation
1 θ 1 ( q 1 c 1 + q 2 c 2 ) + V ( q 1 1 , q 2 , 1 )
is equal to the right hand side of the inequality (13). The difference of the total average cost given by the relation in (14) for equivalent actions must be equal to zero. Therefore, if c 2 μ ˜ 1 = c 1 μ ˜ 2 , the control action 1 and 2 are equivalent. □
It is assumed now that new customers can arrive to the system, i.e. λ 1 > 0 , λ 2 > 0 . We expect that the same c μ -rule defined in (7) will be optimal for the system with arrivals but from technical reasons it is quite difficult to derive expressions for the mean overall service times of the ith-class customers. Therefore, to analyse the properties of a an optimal control policy we have to introduce a queueing model with a constraint on possible arrivals. This model differs from original one, since we assume that a new arrival can occur only in state x = ( q 1 , q 2 , 0 ) , q 1 , q 2 0 , where the server is empty. The dynamic-programming approach, see e.g. [9], [14,15], is used to establish the properties of the optimal control policy in the following Proposition. Note that the state space of the corresponding Markov decision process (MDP) is infinite and the costs are unbounded. The existence of an average cost optimal stationary policy and convergence of the value-iteration algorithm can be verified in the same way as it was done in [13], where the authors generalized the existence of the optimal policy for the discounted expected total cost minimization to the average cost criterion.
Proposition 2.
In state x = ( q 1 , q 2 , 0 ) the optimal allocation policy for the system with a constraint on arrivals can be defined in form
f ( x ) = 1 , if c 2 m 1 c 1 m 2 , α 1 + β m 1 μ 1 α 2 + β m 2 μ 2 , 2 if c 2 m 1 c 1 m 2 , α 1 + β m 1 μ 1 α 2 + β m 2 μ 2 .
In case of equalities c 2 m 1 = c 1 m 2 and α 1 + β m 1 μ 1 = α 2 + β m 2 μ 2 , the control actions 1 and 2 are equivalent.
Proof. 
For the introduced cost structure the average cost stationary optimal policy exists. This policy can be found as a solution of the system of optimality equations for the dynamic-programming relative value function v : E R and gain g,
v ( q 1 , q 2 , 0 ) = q 1 c 1 + q 2 c 2 g + λ 1 v ( q 1 + 1 , q 2 , 0 ) + λ 2 v ( q 1 , q 2 + 1 , 0 ) + min { θ 1 v ( q 1 1 , q 2 , 1 ) + ( 1 θ 1 λ 1 λ 2 ) v ( q 1 , q 2 , 0 ) , θ 2 v ( q 1 , q 2 1 , 2 ) + ( 1 θ 2 λ 1 λ 2 ) v ( q 1 , q 2 , 0 ) } ,
where q 1 , q 2 1 and c ( q 1 , q 2 , 0 ) = q 1 c 1 + q 2 c 2 is an immediate cost of the corresponding MDP. In states with the only one nonempty queue the optimal policy consists in service of the corresponding customer, the relative value functions are of the form,
v ( q 1 , 0 , 0 ) = q 1 c 1 g + λ 1 v ( q 1 + 1 , 0 , 0 ) + λ 2 v ( q 1 , 1 , 0 ) + θ 1 v ( q 1 1 , 0 , 1 ) + ( 1 θ 1 λ 1 λ 2 ) v ( q 1 , 0 , 0 ) ,
v ( 0 , q 2 , 0 ) = q 2 c 2 g + λ 1 v ( 1 , q 2 , 0 ) + λ 2 v ( 0 , q 2 + 1 , 0 ) + θ 2 v ( 0 , q 2 1 , 2 ) + ( 1 θ 2 λ 1 λ 2 ) v ( 0 , q 2 , 0 ) ,
where q 1 , q 2 1 . The value v ( 0 , 0 , 0 ) = 0 , then g = λ 1 v ( 1 , 0 , 0 ) + λ 2 v ( 0 , 1 , 0 ) . The equations for states where the server is busy or failed are
v ( q 1 , q 2 , 1 ) = ( q 1 + 1 ) c 1 + q 2 c 2 g + μ 1 v ( q 1 , q 2 , 0 ) + α 1 v ( q 1 + 1 , q 2 , 3 )
+ ( 1 μ 1 α 1 ) v ( q 1 , q 2 , 1 ) , v ( q 1 , q 2 , 2 ) = q 1 c 1 + ( q 2 + 1 ) c 2 g + μ 2 v ( q 1 , q 2 , 0 ) + α 2 v ( q 1 , q 2 + 1 , 3 )
+ ( 1 μ 2 α 2 ) v ( q 1 , q 2 , 2 ) ,
v ( q 1 , q 2 , 3 ) = q 1 c 1 + q 2 c 2 + β v ( q 1 , q 2 , 0 ) + ( 1 β ) v ( q 1 , q 2 , 3 ) .
The equation (16) can be expressed using (19)–(21) in the following way,
v ( q 1 , q 2 , 0 ) = q 1 c 1 + q 2 c 2 g + λ 1 v ( q 1 + 1 , q 2 , 0 ) + λ 2 v ( q 1 , q 2 + 1 , 0 ) + min { θ 1 β α 1 + β α 1 + μ 1 ( q 1 c 1 + q 2 c 2 g ) + μ 1 θ 1 α 1 + μ 1 v ( q 1 1 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v ( q 1 , q 2 , 0 ) , θ 2 β α 2 + β α 2 + μ 2 ( q 1 c 1 + q 2 c 2 g ) + μ 2 θ 2 α 2 + μ 2 v ( q 1 , q 2 1 , 0 ) + α 2 θ 2 α 2 + μ 2 + ( 1 λ 1 λ 2 θ 2 ) v ( q 1 , q 2 , 0 ) } .
We rewrite equations (17) and (18) in the same way,
v ( q 1 , 0 , 0 ) = q 1 c 1 g + λ 1 v ( q 1 + 1 , 0 , 0 ) + λ 2 v ( q 1 , 1 , 0 ) + θ 1 β α 1 + β α 1 + μ 1 ( c 1 q 1 g )
+ μ 1 θ 1 α 1 + μ 1 v ( q 1 1 , 0 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v ( q 1 , 0 , 0 ) , v ( 0 , q 2 , 0 ) = q 2 c 2 g + λ 1 v ( 1 , q 2 , 0 ) + λ 2 v ( 0 , q 2 + 1 , 0 ) + θ 2 β α 2 + β α 2 + μ 2 ( c 2 q 2 g )
+ μ 2 θ 2 α 2 + μ 2 v ( 0 , q 2 1 , 0 ) + α 2 θ 2 α 2 + μ 2 + ( 1 λ 1 λ 2 θ 2 ) v ( 0 , q 2 , 0 ) .
The solution for the system of optimality equations can be calculated recursively using equivalent discrete-time model on a finite horizon obtained by an uniformization procedure. The corresponding recursive relations have almost the same structure. Namely, the equations (22)–(24) can be rewritten in such a way that on the left-hand side the function v ( x ) is replaced by the n + 1 -stage cost function v n + 1 ( x ) and on the right-hand side we put g = 0 and function v ( x ) is replaced by the n-stage cost function v n ( x ) . It is assumed that the initial condition is v 0 ( x ) = 0 , x E . Let the inequalities in the first raw of (15) hold. Consider the term for action selection in obtained recursive relations. In case
θ 1 β α 1 + β α 1 + μ 1 ( q 1 c 1 + q 2 c 2 ) + μ 1 θ 1 α 1 + μ 1 v n ( q 1 1 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( q 1 , q 2 , 0 ) θ 2 β α 2 + β α 2 + μ 2 ( q 1 c 1 + q 2 c 2 ) + μ 2 θ 2 α 2 + μ 2 v n ( q 1 , q 2 1 , 0 ) + α 2 θ 2 α 2 + μ 2 + ( 1 λ 1 λ 2 θ 2 ) v n ( q 1 , q 2 , 0 )
for any n N 0 , the optimal control action is f ( q 1 , q 2 , 0 ) = 1 for an arbitrary n. The statement is proved by induction on n. If n = 0 , the inequality (25) obviously holds. Assume the validity of this inequality for some n. Then it must be proved that (25) holds for n n + 1 . Expressions for v n + 1 ( q 1 1 , q 2 , 0 ) , v n + 1 ( q 1 , q 2 1 , 0 ) and v n + 1 ( q 1 , q 2 , 0 ) can be obtained from (22)–(24). The first terms by q 1 c 1 + q 2 c 2 in inequality (25) are multiplied by the factor ν = 1 defined as
ν = λ 1 + λ 2 + μ 1 θ 1 α 1 + μ 1 + α 1 θ 1 α 1 + μ 1 + 1 λ 1 λ 2 θ 1 .
In case q 1 2 , q 2 1 , we obtain from (25) the following inequality,
ν θ 1 β α 1 + β α 1 + μ 1 ( q 1 c 1 + q 2 c 2 ) + μ 1 θ 1 α 1 + μ 1 [ θ 1 β α 1 + β α 1 + μ 1 + 1 ( ( q 2 1 ) c 1 + q 2 c 2 ) + λ 1 v n ( q 1 , q 2 , 0 ) + λ 2 v n ( q 1 1 , q 2 + 1 , 0 ) + μ 1 θ 1 α 1 + μ 1 v n ( q 1 2 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( q 1 1 , q 2 , 0 ) ] + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) [ θ 1 β α 1 + β α 1 + μ 1 + 1 × × ( q 1 c 1 + q 2 c 2 ) + λ 1 v n ( q 1 + 1 , q 2 , 0 ) + λ 2 v n ( q 1 , q 2 + 1 , 0 ) + μ 1 θ 1 α 1 + μ 1 v n ( q 1 1 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( q 1 , q 2 , 0 ) ] ν θ 2 β α 2 + β α 2 + μ 2 ( q 1 c 1 + q 2 c 2 ) + μ 2 θ 2 α 2 + μ 2 [ θ 1 β α 1 + β α 1 + μ 1 + 1 ( q 1 c 1 + ( q 2 1 ) c 2 ) + λ 1 v n ( q 1 + 1 , q 2 1 , 0 ) + λ 2 v n ( q 1 , q 2 , 0 ) + μ 1 θ 1 α 1 + μ 1 v n ( q 1 1 , q 2 1 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( q 1 , q 2 1 , 0 ) ] + α 2 θ 2 α 2 + μ 2 + ( 1 λ 1 λ 2 θ 2 ) [ θ 1 β α 1 + β α 1 + μ 1 + 1 × × ( q 1 c 1 + q 2 c 2 ) + λ 1 v n ( q 1 + 1 , q 2 , 0 ) + λ 2 v n ( q 1 , q 2 + 1 , 0 ) + μ 1 θ 1 α 1 + μ 1 v n ( q 1 1 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( q 1 , q 2 , 0 ) ] ,
where we multiply the first term in the left- and right-hand side by the factor ν = 1 . The inequality (26) which is valid due to the following reasons. For the terms with the factor λ 1 by adding to the left-hand side the item c 1 θ 1 β α 1 + β α 1 + μ 1 and to the right-hand side the item c 1 θ 2 β α 2 + β α 2 + μ 2 we get,
θ 1 β α 1 + β α 1 + μ 1 ( ( q 1 + 1 ) c 1 + q 2 c 2 ) + μ 1 θ 1 α 1 + μ 1 v n ( q 1 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( q 1 + 1 , q 2 , 0 ) θ 2 β α 2 + β α 2 + μ 2 ( ( q 1 + 1 ) c 1 + q 2 c 2 ) + μ 2 θ 2 α 2 + μ 2 v n ( q 1 + 1 , q 2 1 , 0 ) + α 2 θ 2 α 2 + μ 2 + ( 1 λ 1 λ 2 θ 2 ) v n ( q 1 + 1 , q 2 , 0 ) .
The inequality (27) holds due to the induction assumption (25) in state x = ( q 1 + 1 , q 2 , 0 ) . In the same way prove the inequality for the terms with the factor λ 2 . In this case we add to the left-hand side and right-hand side of the corresponding inequality respectively the elements c 2 θ 1 β α 1 + β α 1 + μ 1 and to the right-hand side c 2 θ 2 β α 2 + β α 2 + μ 2 . The the induction assumption (25) in state x = ( q 1 , q 2 + 1 , 0 ) can be applied. The inequality obtained for the terms with a factor μ 1 θ 1 α 1 + μ 1 by subtracting c 1 θ 1 β α 1 + β α 1 + μ 1 and c 1 θ 2 β α 2 + β α 2 + μ 2 respectively from the left-hand side and right-hand side holds as well due to the induction assumption in state x = ( q 1 1 , q 2 , 0 ) . The inequalities for the terms with factors α 1 θ 1 α 1 + μ 1 and ( 1 λ 1 λ 2 θ ) satisfy directly the induction assumption (25). The rest of terms builds the inequality of the form
θ 1 β α 1 + β α 1 + μ 1 λ 1 c 1 λ 2 c 2 + c 1 μ 1 θ 1 α 1 + μ 1 + μ 1 θ 1 α 1 + μ 1 θ 1 β α 1 + β α 1 + μ 1 + 1 ( ( q 1 1 ) c 1 + q 2 c 2 ) + α 1 θ 1 α 1 + μ 1 θ 1 β α 1 + β α 1 + μ 1 + 1 ( q 1 c 1 + q 2 c 2 ) + ( 1 λ 1 λ 2 θ 1 ) θ 1 β α 1 + β α 1 + μ 1 + 1 ( q 1 c 1 + q 2 c 2 ) θ 2 β α 2 + β α 2 + μ 2 λ 1 c 1 λ 2 c 2 + c 1 μ 1 θ 1 α 1 + μ 1 + μ 2 θ 2 α 2 + μ 2 θ 1 β α 1 + β α 1 + μ 1 + 1 ( q 1 c 1 + ( q 2 1 ) c 2 ) + α 2 θ 2 α 2 + μ 2 θ 1 β α 1 + β α 1 + μ 1 + 1 ( q 1 c 1 + q 2 c 2 ) + ( 1 λ 1 λ 2 θ 2 ) θ 1 β α 1 + β α 1 + μ 1 + 1 ( q 1 c 1 + q 2 c 2 ) .
After some simple algebra we get from (28) the inequality
θ 1 β α 1 + β α 1 + μ 1 λ 1 c 1 λ 2 c 2 + c 2 μ 2 θ 2 α 2 + μ 2 c 1 μ 1 θ 1 α 1 + μ 1 θ 2 β α 2 + β α 2 + μ 2 λ 1 c 1 λ 2 c 2 + c 1 μ 1 θ 1 α 1 + μ 1 c 2 μ 2 θ 2 α 2 + μ 2
or by means of the variable m i it is of the form
α 1 + β m 1 μ 1 β λ 1 c 1 λ 2 c 2 + c 2 m 2 c 1 m 1 α 2 + β m 2 μ 2 β λ 1 c 1 λ 2 c 2 + c 1 m 1 c 2 m 2 .
The inequality (29) holds due to the assumptions for the control action 1, i.e. c 2 m 2 c 1 m 1 and α 1 + β m 1 μ 1 α 2 + β m 2 μ 2 and the inequality
λ 1 m 1 + λ 2 m 2 = λ 1 α 1 + μ 1 μ 1 θ 1 + λ 2 α 2 + μ 2 μ 2 θ 2 < 1
obtained directly from the stability condition (6). In fact, the expressions in brackets of the inequality (29) can be rewritten respectively as
c 2 m 2 ( 1 λ 2 m 2 ) c 1 m 1 λ 1 m 1 and c 1 m 1 ( 1 λ 1 m 1 ) c 2 m 2 λ 2 m 2 0 .
The second expression in (30) is obviously non-negative, due to conditions c 1 m 1 c 2 m 2 and 1 λ 1 m 1 > λ 2 m 2 . If
c 2 m 2 ( 1 λ 2 m 2 ) c 1 m 1 λ 1 m 1 0 ,
then (29) is true. If
c 2 m 2 ( 1 λ 2 m 2 ) c 1 m 1 λ 1 m 1 0 ,
then
c 2 m 2 ( 1 λ 2 m 2 ) c 1 m 1 λ 1 m 1 c 1 m 1 ( 1 λ 1 m 1 ) + c 2 m 2 λ 2 m 2 = c 2 m 2 c 1 m 1 0
that confirms the validity of the inequality (29).
The inequality (25) for q 1 = 1 can be proved using the same technique as before. Indeed, the inequality (26) is converted to
ν θ 1 β α 1 + β α 1 + μ 1 ( c 1 + q 2 c 2 ) + μ 1 θ 1 α 1 + μ 1 [ θ 2 β α 2 + β α 2 + μ 2 + 1 q 2 c 2 + λ 1 v n ( 1 , q 2 , 0 ) + λ 2 v n ( 0 , q 2 + 1 , 0 ) + μ 2 θ 2 α 2 + μ 2 v n ( 0 , q 2 1 , 0 ) + α 2 θ 2 α 2 + μ 2 + ( 1 λ 1 λ 2 θ 2 ) v n ( 0 , q 2 , 0 ) ] + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) [ θ 1 β α 1 + β α 1 + μ 1 + 1 × × ( c 1 + q 2 c 2 ) + λ 1 v n ( 2 , q 2 , 0 ) + λ 2 v n ( 1 , q 2 + 1 , 0 ) + μ 1 θ 1 α 1 + μ 1 v n ( 0 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( 1 , q 2 , 0 ) ] ν θ 2 β α 2 + β α 2 + μ 2 ( c 1 + q 2 c 2 ) + μ 2 θ 2 α 2 + μ 2 [ θ 1 β α 1 + β α 1 + μ 1 + 1 × × ( c 1 + ( q 2 1 ) c 2 ) + λ 1 v n ( 2 , q 2 1 , 0 ) + λ 2 v n ( 1 , q 2 , 0 ) + μ 1 θ 1 α 1 + μ 1 v n ( 0 , q 2 1 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( 1 , q 2 1 , 0 ) ] + α 2 θ 2 α 2 + μ 2 + ( 1 λ 1 λ 2 θ 2 ) [ θ 2 β α 2 + β α 2 + μ 2 + 1 × × ( c 1 + q 2 c 2 ) + λ 1 v n ( 2 , q 2 , 0 ) + λ 2 v n ( 1 , q 2 + 1 , 0 ) + μ 1 θ 1 α 1 + μ 1 v n ( 0 , q 2 , 0 ) + α 1 θ 1 α 1 + μ 1 + ( 1 λ 1 λ 2 θ 1 ) v n ( 1 , q 2 , 0 ) ] .
By further comparing the terms of the corresponding inequality for the parameters of the system, we obtain inequality (29) using the proof by induction as for the case q 2 2 . And finally we note that the inequalities in (15) automatically lead to the c μ -rule defined in (7), but this doesn’t always hold true vice - versa. □
Conjecture 1.
We expect that the policy defined by (15) is also optimal for the system where a constraint on arrival is omitted. This can be explained by the fact that the proportion of the class-i customers arrived to the system during the time when the server was idle will also be maintained when customers arrive in states where the server is busy or in a failed state. Therefore, the incentive to service the customer of a certain class remains the same and hence the policy (15) seems to be valid for the original queueing system with arrivals.
Example 1.
Consider the system with fixed parameters ( λ 1 , α 1 , β , c 1 ) = ( 0.13 , 0.20 , 5 , 1.00 ) and six cases of varied parameters given in Table 1. The last two columns of the table represents the values of the average cost g evaluated using a simulation technique for the policy with f ( q 1 , q 2 , 0 ) = 1 and f ( q 1 , q 2 , 0 ) = 2 , q 1 , q 2 1 .
The inequalities from (15) are:
Case 1 : c 2 μ ˜ 1 = 2.098 < 2.407 = c 1 μ ˜ 2 f = 1 , c 2 m 1 = 1.890 < 2.067 = c 1 m 2 , α 1 + β m 1 μ 1 = 0.550 < 0.823 = α 2 + β m 2 μ 2 ; Case 2 : c 2 μ ˜ 1 = 3.387 > 2.923 = c 1 μ ˜ 2 f = 2 , c 2 m 1 = 2.971 > 2.583 = c 1 m 2 , α 1 + β m 1 μ 1 = 0.700 > 0.658 = α 2 + β m 2 μ 2 ; Case 3 : c 2 μ ˜ 1 = 2.779 = 2.779 = c 1 μ ˜ 2 f = 1 2 , c 2 m 1 = 2.438 = 2.438 = c 1 m 2 , α 1 + β m 1 μ 1 = 0.700 = 0.700 = α 2 + β m 2 μ 2 ; Case 4 : c 2 μ ˜ 1 = 2.438 > 2.244 = c 1 μ ˜ 2 f = 2 , c 2 m 1 = 2.091 > 2.040 = c 1 m 2 , α 1 + β m 1 μ 1 = 0.828 > 0.500 = α 2 + β m 2 μ 2 ; Case 5 : c 2 μ ˜ 1 = 2.194 < 2.244 = c 1 μ ˜ 2 f = 1 , c 2 m 1 = 1.882 < 2.040 = c 1 m 2 , α 1 + β m 1 μ 1 = 0.829 > 0.500 = α 2 + β m 2 μ 2 ; Case 6 : c 2 μ ˜ 1 = 2.519 > 2.407 = c 1 μ ˜ 2 f = 2 , c 2 m 1 = 2.269 > 2.067 = c 1 m 2 , α 1 + β m 1 μ 1 = 0.550 < 0.823 = α 2 + β m 2 μ 2 .
In Case 3 we get equivalent policies, the simulation results for the average cost are very similar. In Cases 5 and 6 the relations are converse, but the optimal policy, as expected, still follows the rule ( ) .
For the system with finite buffer capacities the optimal allocation policy can have in general another structure then the c μ -rule due to the influence of boundary states. In the next example we illustrate such a result.
Example 2.
Consider the system with parameters of Case 4 from Example 1 and finite buffer capacity for both of queues N 1 = N 2 = 20 . The state-dependent optimal control actions f ( q 1 , q 2 , 0 ) evaluated by a dynamic-programming approach are summarized as a matrix represented in Table 2. The columns describe the number of customers in a queue 1 and the rows – in a queue 2. It can be seen that the optimal policy is not a static any more.
The optimal average cost here is g = 9.980 . The average cost for the policy f ( q 1 , q 2 , 0 ) = 2 , q 1 , q 2 1 , is equal to g = 11.591 and for the policy f ( q 1 , q 2 , 0 ) = 1 , q 1 , q 2 1 , the average cost takes lower value g = 10.077 . This results for the optimal policy differ from those obtained for higher buffer capacities. As N 1 and N 2 increase, the boundary between areas 1 and 2 in a control matrix shifts right. In infinite buffer case, the optimal policy is defined exclusively by actions f ( q 1 , q 2 , 0 ) = 2 , q 1 , q 2 1 , with the average cost g = 22.955 , while the alternative policy f ( q 1 , q 2 , 0 ) = 1 , q 1 , q 2 1 , leads now to the higher average cost g = 25.430 .

4. Conclusion

In this paper we have analyzed the optimal routing problem for the unreliable single-server two-class queueing system with constant retrial rates. We derived conditions for the optimality of a static policy to serve the customers from a certain queue. The system without new arrivals can be treated as a ordinary multi-class system with a generally distributed service time. For the system with new arrivals a c μ -rule cannot be used directly. We have provided a dynamic programming approach to find explicit conditions when static control policies are guaranteed to be optimal.

References

  1. Avrachenkov, K.; Morozov, E.; Nekrasova, R.; Steyaert, B. Stability analysis and simulation of n-class retrial system with constant retrial rates and poisson inputs. Asia-Pacific J. Oper. Res. 2014, 31. [Google Scholar] [CrossRef]
  2. Baras, J.S.; Dorsey, A.J.; Makowski, A.M. Two competing queues with linear costs and geometric service requirements: The μc-rule is often optimal. Advances in Applied Probability 1985, 17, 186–209. [Google Scholar] [CrossRef]
  3. Buyukkoc, C.; Varaiya, P.; Walrand, J. The cμ rule revisited. Advances in Applied Probability 1985, 17(1), 237–238. [Google Scholar] [CrossRef]
  4. Cox, D.R.; Smith, W.L. Queues; Methuen: London, 1961. [Google Scholar]
  5. Dimitriou, I. A two-class queueing system with constant retrial policy and general class dependent service times. Eur. J. Oper. Res. 2018, 270, 1063–1073. [Google Scholar] [CrossRef]
  6. Economou, A.; Kanta, S. Equilibrium customer strategies and social-profit maximization in the single-server constant retrial queue. Nav. Res. Logist. (NRL) 2011, 58, 107–122. [Google Scholar] [CrossRef]
  7. Efrosinin, D.; Winkler, A. Queueing system with a constant retrial rate, non-reliable server and threshold-based recovery. Eur. J. Oper. Res. 2011, 210, 594–605. [Google Scholar] [CrossRef]
  8. Fayolle, G. A simple telephone exchange with delayed feedbacks. In Proceedings of the international seminar on Teletraffic analysis and computer performance evaluation, Amsterdam, The Netherlands; 1986; pp. 245–253. [Google Scholar]
  9. Howard, R.A. Dynamic programming and Markov processes (technology press research monographs) (), 1st ed.; MIT Press: Cambridge, 1960. [Google Scholar]
  10. Kebarighotbi, A.; Cassandras, C.G. Revisiting the optimality of the cμ-rule with Stochastic Flow Models. In Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference; 2009. [Google Scholar]
  11. Krishnasamy, S.; Arapostathis, A.; Johari, R.; Shakkottai, S. On Learning the cμ Rule in Single and Parallel Server Networks. 2018. [Google Scholar] [CrossRef]
  12. Heyde, C.C.; Neuts, M.F. Matrix-Geometric Solutions in Stochastic Models. An Algorithmic Approach. J. Am. Stat. Assoc. 1982, 77, 690. [Google Scholar] [CrossRef]
  13. Özkan, E.; Kharoufeh, J. Optimal control of a two-server queueing system with failures. Probability in the Engineering and Informational Sciences 2014, 28, 489–527. [Google Scholar] [CrossRef]
  14. Puterman, M.L. Markov decision processes: discrete stochastic dynamic programming; Wiley-Interscience: New York, 1994. [Google Scholar]
  15. Sennott, L.T. Stochastic dynamic programming and the control of queueing systems. Wiley series in probability and statistics: applied probability and statistics; Wiley: Wiley.
  16. Van Mieghem, J.A. Dynamic scheduling with convex delay costs: The generalized cμ-rule. The Annals of Applied Probability 1995, 5, 809–833. [Google Scholar] [CrossRef]
  17. Wang, J.; Zhang, X.; Huang, P. Strategic behavior and social optimization in a constant retrial queue with the N -policy. Eur. J. Oper. Res. 2016, 256, 841–849. [Google Scholar] [CrossRef]
  18. Winkler, A. Dynamic scheduling of a single-server two-class queue with constant retrial policy. Ann. Oper. Res. 2011, 202, 197–210. [Google Scholar] [CrossRef]
Figure 1. Schema of the single server two-class controllable queueing system
Figure 1. Schema of the single server two-class controllable queueing system
Preprints 82087 g001
Table 1. Simulation results
Table 1. Simulation results
λ 2 μ 1 μ 2 θ 1 θ 2 α 2 c 2 g f = 1 g f = 2
0.30 5 3 0.55 0.5 0.10 1.00 25.657 29.403
0.26 5 3 0.70 0.40 0.10 2.00 36.909 34.238
0.26 5 3 0.70 0.43 0.12 1.64 18.223 18.291
0.30 3 5 0.51 0.5 0.10 1.00 25.430 22.955
0.30 3 5 0.51 0.5 0.10 0.90 23.421 24.709
0.30 5 3 0.55 0.5 0.10 1.20 33.348 31.610
Table 2. Matrix of optimal control actions f ( q 1 , q 2 , 0 )
Table 2. Matrix of optimal control actions f ( q 1 , q 2 , 0 )
q 2 \ q 1 0 1 2 3 4 5 6 7 8 9 10
0 0 2 2 2 2 2 2 2 2 2 2 ...
1 1 2 2 2 2 2 2 2 2 2 2 ...
2 1 2 2 2 2 2 2 2 2 2 2 ...
3 1 2 2 2 2 2 2 2 2 2 1 ...
4 1 2 2 2 2 2 2 2 2 1 1 ...
5 1 2 2 2 2 2 2 1 1 1 1 ...
6 1 2 2 2 2 1 1 1 1 1 1 ...
7 1 2 2 2 1 1 1 1 1 1 1 ...
8 1 2 1 1 1 1 1 1 1 1 1 ...
9 1 1 1 1 1 1 1 1 1 1 1 ...
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated