This section presents the basic foundations of the cooperative cruise control theory and DMPC as a bargaining game.
2.1. Cooperative Cruise Control
The cooperative cruise control problem has been extensively studied in the latest years. Recent developments have focused on wireless Vehicle-to-Vehicle (V2V) communication that has grown commercially. The wireless communication led to the definition of the Grand Cooperative Driving Challenge (GCDC) to manage a platoon of vehicles that have this technology. The main objective of cooperative cruise control is synchronizing the vehicles on the road with the traffic profile considered by an established reference. The traffic profile is commonly constituted by the inter-vehicle distance and the speed on a highway, reducing the time to transit in a highway and the fuel consumption. Communication is usually considered by the predecessor vehicle as a string stability case.
A simple description of the CACC setting (for longitudinal dynamics) is considered. In this case, each vehicle is modeled through its physical and mechanical parameters. The dynamics proposed in this case are linear as
where variables are speed
v and position
p, respectively. The parameters
and
are transmission parameters,
b is related to transmission efficiency, and
has the dimension of acceleration, or the force when it is multiplied by the vehicle mass. The term
is associated with an input uncertainty. These parameters can be included in the dynamics due to the approximation by an invertible steady-stable time-invariant model without the presence of uncertainties [
29]. This approach considerably reduces the complexity of the model without losing performance, and it has been used as an approximation in previous theories to validate this type of problem [
30].
The controller must be able to regulate the speed of each vehicle and maintain a distance from its neighbors. The graphic representation of the vehicle’s platoon is shown in
Figure 1, where the distance between each vehicle is defined as
, that is, the difference of the position of vehicles 1 and 2 with its respective subscripts
and
.
Let the states of the system be
, where the distance is position minus a predefined distance, and the control input
with a possibly input matched uncertainty
. It is possible to write the system as a linear affine continuous model of the form
with
This model considers the acceleration of neighboring vehicles in a
instant. For implementation control prediction, it is necessary to use the discrete dynamics of the system. Therefore, (
3) is modified as
with
and
.
Note that onboard sensors for controller action measure distance and speed, and the parameters of each vehicle can be different for a heterogeneous case. In the same way, it is important to consider that for the managed approximations of the cooperative cruise control theory, the considered model relates the position, inter-vehicular distance, and the speed of the vehicles in the network. In this particular application, a constant acceleration in an instant of time for each agent is the input for the developments.
2.2. Distributed Model Predictive Control with Bargaining Games
This section introduces the basic concepts of MPC from a bargaining game perspective. The control problem is contextualized as the negotiation method for solving a distributed optimization problem. The block diagram representing the framework is presented in
Figure 2, where it is observed that each vehicle, through its dynamics, enters a bargaining algorithm along a prediction horizon. This algorithm generates the optimal control action
and the disagreement point
. This block diagram is used for non-symmetric bargaining cases. For symmetric cases, the characteristics of each vehicle are the same. All cases are regulated by regulatory aspects or physical restrictions of the vehicles to be considered.
Definition 1. Bargaining Game A bargaining game is mathematically defined as the tuple .
In this case,
N is the number of vehicles involved in the process,
is the decision space of the control law, and
is defined as the local cost function of each vehicle. It is assumed that the vehicles are in a negotiating position to achieve a common objective, such as Nash’s notions [
31]. In the game, if it is impossible to reach an agreement, the term disagreement point is used for the bargain between vehicles [
32].
Assuming the dynamics of each vehicle as in (
3) and with its discrete representation (
5), the particular objective is to achieve energy-level optimization in each vehicle’s operation. For this optimization problem, a distributed locally cost function is defined as
with
as the representation of vehicle
i states built along the prediction horizon
, likewise
considering the control horizon
with
. Each cost function
is defined as
which is positive defined, convex, and where
and
are weighted positive definite matrices, i.e.,
. This cost function, by taking a conventional quadratic form, does not become the main contribution of the paper, which focuses on the application of this application in the predictive control algorithm.
For the control problem formulation, defining a decision space
for the whole system according to the physical operating conditions is important. An MPC problem with communication between agents is interpreted as a bargain so that it can be a bargaining game. For the analysis and solution of this type of game, Nash proposes an axiomatic methodology [
33], which was used in continuous and static systems [
32].
A continuous representation for a bargaining game is with the tuple , where S is the game decision space, which is a non-empty closed subset of and is the interaction disagreement point. For implementation purposes, it is important to consider the discrete dynamics of the game, so it is then defined as with , a closed non-empty subset that contains the profit functions values of each vehicle. The values of the states , the set , and the point of disagreement vary dynamically.
The evolution value of the disagreement point varies as
with
as an adjustment constant according to the definition of the axioms of the negotiation processes raised from the work of John Nash [
34]. In this case, if a vehicle decides to cooperate on the road, the disagreement point is reduced with a
factor, otherwise, it is increased by a
factor.
Definition 2. Utopia point is defined as the utopia point available for the vehicle i as ,
In this case,
is defined as the union of the cost functions
of the game, where then the discrete game can be interpreted as
. Notice that the analysis of a bargaining game can be carried out symmetrical for a game with similar characteristics between its players or non-symmetrical for a game where these characteristics differ, i.e., synchronization of oscillator systems with homogeneous characteristics or control of mechanical systems with heterogeneous physical characteristics. For the solution of a bargaining game, a non-symmetrical centralized scenario is proposed based on [
35] as
with
as a weight variable usually defined as
, with
N as the number of vehicles involved in the process. However, for a distributed control analysis, the solution to the optimization problem is proposed as
with
a distributed cost function usually defined quadratic and
the set of the remaining vehicles control actions except for the agent
i.
The optimization problem (
12) differs from problem (
9) in the sense that it considers
fixed and only optimizes as a function of
, this means that optimization does not involve the decisions of the entire network cooperatively. The solution to this problem then arises as a negotiation model that depends on the context given by the cooperative cruise control theory. This methodology does not use iterative solutions as others commonly used in distributed optimization problems [
36], which reduces the computational cost in operation with great benefits in high-impact applications such as vehicle platoon. Bargaining methodology allows the solution to a distributed control problem by solving only one local optimization with the information collected by its neighbors and achieving an agreement based on the Nash equilibrium concept of bargaining through the defined disagreement point. In summary, the objective is to apply a distributed control methodology for vehicles network on a highway. Based on the communication of their states, a negotiation can be interpreted as the solution to an optimization problem (
12). It is important to note that the communication and the parameters sent need to be available at any time. The lack of information can hinder the ability to reach an optimal or efficient agreement.