Optimal Control of Discrete Time-varying System with Multiple Delays and Multiplicative Noises

Preprint

Article

Optimal Control of Discrete Time-varying System with Multiple Delays and Multiplicative Noises

Altmetrics

Downloads

121

Views

Comments

This version is not peer-reviewed

Submitted:

16 May 2023

Posted:

18 May 2023

You are already at the latest version

Alerts

Abstract

This paper is concerned with the optimal linear quadratic Gaussian (LQG) control problem for discrete time-varying system with multiple input delays and multiplicative noises. The main contributions are two-fold. Firstly, when the state variables can be observed exactly, we obtain a necessary and suffcient condition for the multiple-delays system in terms of the non-homogeneous relationship between the state and costate, which is the solution to the coupled forward and backward stochastic difference equations. Secondly, when the state variables are partially observed, we derive a suboptimal linear output feedback controller for the discrete-time system based on the obtained results of the optimal LQG control. Numerical examples are shown to illustrate the proposed algorithm.

Keywords:

Subject: Computer Science and Mathematics - Applied Mathematics

1. Introduction

Linear quadratic Gaussian (LQG) control problem stems from the optimal stochastic control theory of the systems with additive Gaussian white noises and state/control-dependent, which combines the concept of linear quadratic regulators for full state feedback and Kalman filters for state estimation [1,2,3]. Recently, the optimal LQG control has been applied in various fields, such as the robots of power substation, all-electric vehicles, electrical safety engineering networked control systems (NCSs) [5,6,7,9]. Specificly, for mobile monitoring robot in a ultrahigh-voltage power substation, the LQG conotrol is proposed to minimize the difference between the actual SNIR and its expectation and the change in transmitting power[4]. These motivate us to study the more complicated LQG control systems with multiple input delays and multiplicative noises.

It is generally known that random time delay and packet dropout always occur in the data transmission of NCSs. Many literatures have been investigated on LQG control problems with input delays and packet loss [8,10,11,12,13,14]. Basin [8] presented an optimal linear regulator (LQR) with input delay by using the duality principle. Cacace[10] studied the LQG problems for linear system with single input delay. Matni[11] presented an explicit solution to a two-player distributed LQG problem in which communication between controllers occurs across a communication link with varying delay. Basin[12] further established a necessary and sufficient condition of the optimal LQR control for the linear system with multiple input delays. Zhang[14] studied the classical LQR problem with multiple input delays for both continuous-time and discrete-time cases.

On the other hand, packet dropout is generally described as the multiplicative noises. Many references have focused on the LQG system with multiplicative noises [15,16,17]. Gupta[15] solved the optimal LQG problem with packet-dropping links by decomposing the problem into a standard LQR state-feedback controller designing. Liang[16] studied the optimal control and stabilization problems for NCSs with remote controller and local controller subject to packet dropout. For systems with both input delay and packet dropout, Liang[18] presented the optimal LQR controller, and derived the necessary and sufficient condition for the mean-square stabilization. Liang[19] considered the discrete-time LQG system with input delay and multiplicative noises, and obtained both optimal state feedback controller and suboptimal output feedback controller.

The aforementioned literatures are mainly focused on single delay and packet dropout. To our best knowledge, little progress has been made on the optimal LQG control for time-varying systems with multiple input delays and multiplicative noises.

Motivated by the work of [18,19,20], this paper studies the optimal LQG control for discrete time-varying system involving with multiple delays and multiplicative noises. The main contributions of this paper are summarized as follows: 1) When the state variables can be observed exactly, by introducing the stochastic maximum principle for system with multiple delays and multiplicative noises, a solution to the forward backward differential equations (FBSDEs) is obtained based on the coupled Riccati equations. 2) In terms of the solution to the FBSDEs, a necessary and sufficient condition is given for the optimal LQG control. 3) When the state variables are partially observed, we derive a suboptimal linear output feedback controller by linearizing the optimal estimator and neglecting higher order terms.

The rest of the paper is organized as follows. In Section 2, we give the results of optimal state feedback control problem. In Section 3, we derive a suboptimal linear output feedback controller for the LQG systems involving multiple input delays and multiplicative noises. Numerical examples are provided in Section 4. Conclusions are given in Section 5.

Notation:

R^{n}

denotes the n-dimensional real Euclidean space. I presents the unit matrix of appropriate dimension. The superscript

^{'}

denotes the transpose of the matrix.

{Ω, F, P, {F_{k}}_{k \geq 0}}

denotes a complete probability space on which random variable

w_{k}

are defined such that

{F_{k}}_{k \geq 0}

is the natural filtration generated by

w_{k}

and

ν_{k}

, i.e.,

F_{k} = σ {w_{0}, \dots, w_{k}, ν_{0}, \dots, ν_{k}}

, augmented by all the

P -

null sets in

F

. A symmetric

A > 0 (\geq 0)

means that it is a positive definite (positive semi-definite) matrix.

T r (A)

represents the trace of matrix A.

2. State Feedback Controller

When the state variable

x_{k}

can be observed exactly, we consider the following discrete time-varying LQG system with multiple input delays and multiplicative noises

\begin{matrix} x_{k + 1} & = [C (k) + ν_{k} \bar{C} (k)] x_{k} + [D_{0} (k) + ν_{k} {\bar{D}}_{0} (k)] u_{k} \\ + [D_{d} (k) + ν_{k} {\bar{D}}_{d} (k)] u_{k - d} + w_{k}, \end{matrix}

(1)

where

x_{k} \in R^{n}

is the state,

u_{k} \in R^{m}

is the input control with the delay

d > 0

ν_{k}

is the scalar white noise with zero mean and variance

ϕ^{2}

w_{k} \in R^{n}

is the random variables satisfying

E [w_{k} | F_{k - 1}] = {\bar{w}}_{k}

and

E [w_{k} w_{k}^{'}] = Q_{w_{k}}

C (k), \bar{C} (k), D_{i} (k)

and

{\bar{D}}_{i} (k)

with

i = 0, d

are coefficient matrices with compatible dimensions.

ν_{k}

and

w_{k}

are correlated with

E [v_{k} w_{k}^{'} | F_{k - 1}] = ρ

E [v_{k} w_{l}^{'}] = 0

k \neq l

. The initial state

x_{0}

u_{i}

for

i = - d, \dots, - 1

are known.

The associated cost function for system (1) is given by

\begin{matrix} J_{N} = E \{\sum_{k = 0}^{N} {x_{k}}^{'} Q_{k} x_{k} + {u_{k}}^{'} R_{k} u_{k} + {x_{N + 1}}^{'} P_{N + 1} x_{N + 1}\}, \end{matrix}

(2)

where

Q_{k}

and

P_{N + 1}

are positive semi-definite constant matrices with appropriate dimensions, control cost matrix

R_{k}

should be positive definite matrix, and N is the horizon length.

Problem 1.

Find the unique

F_{k - 1}

-measurable state feedback controller

u_{k}

, for

k = 0, \dots, N

, to minimize (2) subject to (1).

For simplicity, we make the following definitions

\begin{matrix} C_{k} (k) = C (k) + ν_{k} \bar{C} (k), D_{k}^{i} (k) = D_{i} (k) + ν_{k} {\bar{D}}_{i} (k), \end{matrix}

for

i = 0, d

. Then the system (1) becomes

\begin{matrix} x_{k + 1} = C_{k} (k) x_{k} + D_{k}^{0} (k) u_{k} + D_{k}^{d} (k) u_{k - d} + w_{k} . \end{matrix}

(3)

Following the similar discussion of [19], in virtue of the Pontryagin’s maximum principle for (3) and (2), we have

\begin{matrix} ζ_{N} = & P_{N + 1} x_{N + 1}, \end{matrix}

(4)

\begin{matrix} ζ_{k - 1} = & E [C_{k}^{'} (k) ζ_{k} | F_{k - 1}] + Q_{k} x_{k}, \end{matrix}

(5)

\begin{matrix} 0 = & E [{(D_{k}^{0} (k))}^{'} ζ_{k} + {(D_{k + d}^{d} (k + d))}^{'} ζ_{k + d} | F_{k - 1}] + R_{k} u_{k}, \end{matrix}

(6)

for

k = 0, \dots, N

, where

ζ_{k}

is the costate with

ζ_{k} = 0

for

k > N

For further study, the following coupled Riccati difference equations are given:

\begin{matrix} P_{k} = & C^{'} (k) P_{k + 1} C (k) + ϕ^{2} {\bar{C}}^{'} (k) P_{k + 1} \bar{C} (k) \\ - M_{k}^{'} Ω_{k}^{- 1} M_{k} + Q_{k}, \end{matrix}

(7)

where

\begin{matrix} Ω_{k} & = R_{k} + D_{0}^{'} (k) P_{k + 1} D_{0} (k) + ϕ^{2} {\bar{D}}_{0}^{'} (k) P_{k + 1} {\bar{D}}_{0} (k) \\ + D_{d}^{'} (k + d) P_{k + d + 1} D_{d} (k + d) + ϕ^{2} {\bar{D}}_{d}^{'} (k + d) \\ \times P_{k + d + 1} {\bar{D}}_{d} (k + d) + D_{0}^{'} (k) P_{k + 1}^{d - 1} + {(P_{k + 1}^{d - 1})}^{'} \\ \times D_{0} (k) - \sum_{i = 1}^{d} {(M_{k + i}^{d - i})}^{'} Ω_{k + i}^{- 1} M_{k + i}^{d - i}, \end{matrix}

(8)

\begin{matrix} M_{k} & = D_{0}^{'} (k) P_{k + 1} C (k) + ϕ^{2} {\bar{D}}_{0}^{'} (k) P_{k + 1} \bar{C} (k) \\ + {(P_{k + 1}^{d - 1})}^{'} C (k), \end{matrix}

(9)

with

\begin{matrix} M_{k}^{0} & = D_{0}^{'} (k) P_{k + 1} D_{d} (k) + ϕ^{2} {\bar{D}}_{0}^{'} (k) P_{k + 1} {\bar{D}}_{d} (k) \\ + {(P_{k + 1}^{d - 1})}^{'} D_{d} (k), \end{matrix}

(10)

\begin{matrix} M_{k}^{j} & = D_{0}^{'} (k) P_{k + 1}^{j - 1} + {(P_{k + j + 1}^{d - j - 1})}^{'} D_{d} (k + j) \\ - \sum_{i = 1}^{j} {(M_{k + i}^{d - i})}^{'} Ω_{k + i}^{- 1} M_{k + i}^{j - i}, \end{matrix}

(11)

\begin{matrix} P_{k}^{0} & = C^{'} (k) P_{k + 1} D_{d} (k) + ϕ^{2} {\bar{C}}^{'} (k) P_{k + 1} {\bar{D}}_{d} (k) \\ - M_{k}^{'} Ω_{k}^{- 1} M_{k}^{0}, \end{matrix}

(12)

\begin{matrix} P_{k}^{j} & = C^{'} (k) P_{k + 1}^{j - 1} - M_{k}^{'} Ω_{k}^{- 1} M_{k}^{j}, j = 1, \dots, d - 1 . \end{matrix}

(13)

The terminal values are given by

\begin{matrix} P_{N + 1}, P_{N + i + 1} = 0, P_{N + i}^{j} = 0, \\ M_{N + i}^{j} = 0, Ω_{N + i} = I, i \geq 1, j = 0, \dots, d - 1 . \end{matrix}

(14)

Remark 1.

As can be seen that the costate equations (4)-(6) are quite different from those of Liang [19] and Zhang [20]. What’s more, the coupled Riccati equations (7)-(13) are more complicated than those in Liang [19] and Zhang [20].

It is stressed that the key to solve the optimal LQG control problem is to obtain the solution to the FBSDEs (3) and (4)-(6). We now show the solution to the FBSDEs in the following lemma.

Lemma 1.

Supposing that

Ω_{k}

are positive definite for

k = 0, \dots, N

, the following equation

\begin{matrix} ζ_{k - 1} = P_{k} x_{k} + \sum_{j = 0}^{d - 1} P_{k}^{j} u_{j + k - d} + Φ_{k}, \end{matrix}

(15)

is the solution to FBSDEs (3) and (4)-(6), with

\begin{matrix} Φ_{k} = & C^{'} (k) (P_{k + 1} {\bar{w}}_{k} - M_{k}^{'} Ω_{k}^{- 1} Σ_{k} + Φ_{k + 1}) + {\bar{C}}^{'} (k) P_{k + 1} ρ, \\ Σ_{k} = & D_{0}^{'} (k) (P_{k + 1} {\bar{w}}_{k} + Φ_{k + 1}) + {\bar{D}}_{0}^{'} (k) P_{k + 1} ρ + D_{d}^{'} (k) \\ \times (P_{k + d + 1} {\bar{w}}_{k + d} + Φ_{k + d + 1}) + {\bar{D}}_{d}^{'} (k) P_{k + d + 1} ρ \end{matrix}

(16)

\begin{matrix} + \sum_{j = 0}^{d - 1} {(P_{k + j + 1}^{d - j - 1})}^{'} {\bar{w}}_{k + j} - \sum_{i = 0}^{d} {(M_{k + i}^{d - i})}^{'} Ω_{k + i}^{- 1} Σ_{k + i}, \end{matrix}

(17)

where

Φ_{k + 1} = 0

and

Σ_{k + 1} = 0

for

k \geq N

. Besides,

P_{k}

P_{k}^{j}

satisfy the coupled equations (7), (12), (13).

Proof.

The proof of Lemma 1 is put into Appendix A. □

Now we are ready to present the solution to Problem 1.

Theorem 1.

There exists the unique

F_{k - 1}

-measurable

u_{k}

for Problem 1 if and only if

Ω_{k}

, for

k = 0, \dots, N

, are positive definite. In this case, the optimal controller

u_{k}

is given by

\begin{matrix} u_{k} = - Ω_{k}^{- 1} M_{k} x_{k} - Ω_{k}^{- 1} \sum_{j = 0}^{d - 1} M_{k}^{j} u_{j + k - d} - Ω_{k}^{- 1} Σ_{k} . \end{matrix}

(18)

The associated optimal performance index is as

\begin{matrix} J_{N}^{*} = & x_{0}^{'} P_{0} x_{0} + 2 x_{0}^{'} \sum_{j = 0}^{d - 1} P_{0}^{j} u_{j - d} + \sum_{j = 0}^{d - 1} u_{j - d}^{'} (D_{d}^{'} (j) P_{j + 1} D_{d} (j) \\ + ϕ^{2} {\bar{D}}_{d}^{'} (j) P_{j + 1} {\bar{D}}_{d} (j)) u_{j - d} + 2 \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} u_{j - d}^{'} D_{d}^{'} (j) \\ \times P_{j + 1}^{i - j - 1} u_{i - d} - \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} \sum_{m = 0}^{d - 1} u_{j - d}^{'} {(M_{m}^{j - m})}^{'} Ω_{m}^{- 1} M_{m}^{i - m} \\ \times u_{i - d} + 2 x_{0}^{'} Φ_{0} + 2 \sum_{k = 0}^{N} {\bar{w}}_{k}^{'} Φ_{k + 1} - \sum_{k = 0}^{N} Σ_{k}^{'} Ω_{k}^{- 1} Σ_{k} \\ + \sum_{k = 0}^{N} T r [P_{k + 1} Q_{w_{k}}], \end{matrix}

(19)

where

Ω_{k}

M_{k}

M_{k}^{j}

P_{k}

P_{k}^{j}

Φ_{k}

Σ_{k}

satisfy the coupled equations (7)-(13),(16),(17) and

P_{k}^{j} = 0

M_{k}^{j} = 0

for

j < 0

Proof.

The proof of Theorem 1 is put into Appendix B. □

Remark 2.

We make the coefficients of the system (3) and the cost function (2) to be time-invariant. When there is no time delay in system (3), i.e.,

d = 0

, we have that

D_{d} = {\bar{D}}_{d} = 0

. Considering the noise-uncorrelated case with

{\bar{w}}_{k} = 0

, it is obviously obtained that the coupled equations (10) and (12) can be rewritten as

\begin{matrix} M_{k}^{0} = 0, P_{k}^{0} = - M_{k}^{'} Ω_{k}^{- 1} M_{k}^{0} = 0 . \end{matrix}

Substituting

M_{k}^{0}

and

P_{k}^{0}

into (11) and (13), it can be derived that

M_{k}^{j} = 0, P_{k}^{j} = 0

for

j = 0, \dots, d - 1

. Then the difference equations (8) and (9) yield to

\begin{matrix} Ω_{k} & = R + D_{0}^{'} P_{k + 1} D_{0} + ϕ^{2} {\bar{D}}_{0}^{'} P_{k + 1} {\bar{D}}_{0}, \\ M_{k} & = D_{0}^{'} P_{k + 1} C + ϕ^{2} {\bar{D}}_{0}^{'} P_{k + 1} \bar{C} . \end{matrix}

The optimal controller reduces to

\begin{matrix} u_{k} = - Ω_{k}^{- 1} M_{k} x_{k}, \end{matrix}

which is exactly the result of Moore[2].

Remark 3.

When the system (3) is a time-invariant system, (3) can be rewritten as

\begin{matrix} x_{k + 1} = C (k) x_{k} + D_{0} (k) u_{k} + D_{d} (k) u_{k - d} + w_{k} \end{matrix}

with

C (k) = C + ν_{k} \bar{C}

D_{0} (k) = D_{0} + ν_{k} \bar{D_{0}}

D_{d} (k) = D_{d} + ν_{k} \bar{D_{d}}

. The performance index becomes

\begin{matrix} J_{N} = E \{\sum_{k = 0}^{N} {x_{k}}^{'} Q x_{k} + {u_{k}}^{'} R u_{k} + {x_{N + 1}}^{'} P_{N + 1} x_{N + 1}\} . \end{matrix}

By using the results of Theorem 1, the optimal time-invariant LQG controller yields that

\begin{matrix} u_{k} = - Ω_{k}^{- 1} M_{k} x_{k} - Ω_{k}^{- 1} \sum_{j = 0}^{d - 1} M_{k}^{j} u_{j + k - d} - Ω_{k}^{- 1} Σ_{k}, \end{matrix}

and the minimal cost function is as (19) where the coefficient matrices in

Ω_{k}

M_{k}

M_{k}^{j}

P_{k}

P_{k}^{j}

are time-invariant.

In view of obtaining the special case of optimal LQG control for system (3), now we shall show the results for the general system with multiple delays and multiplicative noises.

Consider the following general discrete time-varying system

\begin{matrix} x_{k + 1} & = C_{k} (k) x_{k} + \sum_{i = 0}^{d} D_{k}^{i} (k) u_{k - i} + w_{k}, \end{matrix}

(20)

and the cost function is as (2).

Problem 2.

Find the unique

F_{k - 1}

-measurable state feedback controller

u_{k}

, for

k = 0, \dots, N

, to minimize the cost function (2) subject to the system (20).

Combining the system (20) and the cost function (2), we apply the Pontryagin’s maximum principle to yield the following costate equations:

\begin{matrix} ζ_{N} = & P_{N + 1} x_{N + 1}, \end{matrix}

(21)

\begin{matrix} ζ_{k - 1} = & E [C_{k}^{'} (k) ζ_{k} | F_{k - 1}] + Q_{k} x_{k}, \end{matrix}

(22)

\begin{matrix} 0 = & E [\sum_{i = 0}^{d} {(D_{k + i}^{i} (k + i))}^{'} ζ_{k + i} | F_{k - 1}] + R_{k} u_{k}, \end{matrix}

(23)

with

i = 0, \dots, d

for

k = 0, \dots, N

, and

ζ_{k} = 0

for

k > N

We introduce the following coupled Riccati equations subject to the system with multiple deleys:

\begin{matrix} Ω_{k} = & R_{k} + \sum_{i = 0}^{d} (D_{i}^{'} (k + i) P_{k + i + 1} D_{i} (k + i) + ϕ^{2} {\bar{D}}_{i}^{'} (k + i) \\ \times P_{k + i + 1} {\bar{D}}_{i} (k + i)) + \sum_{i = 0}^{d - 1} D_{i}^{'} (k + i) P_{k + i + 1}^{d - i - 1} \\ + \sum_{i = 0}^{d - 1} {(P_{k + i + 1}^{d - i - 1})}^{'} D_{i} (k + i) - \sum_{i = 1}^{d} {(M_{k + i}^{d - i})}^{'} Ω_{k + i}^{- 1} M_{k + i}^{d - i}, \end{matrix}

(24)

\begin{matrix} M_{k}^{j} = & \sum_{i = 0}^{j} (D_{i}^{'} (k + i) P_{k + i + 1} D_{i - j + d} (k + i) + ϕ^{2} {\bar{D}}_{i}^{'} (k + i) \\ \times P_{k + i + 1} {\bar{D}}_{i - j + d} (k + i)) + \sum_{i = 0}^{j - 1} D_{i}^{'} (k + i) P_{k + i + 1}^{j - i - 1} \\ + \sum_{i = 0}^{j} {(P_{k + i + 1}^{d - i - 1})}^{'} D_{i - j + d} (k + i) - \sum_{i = 1}^{j} {(M_{k + i}^{d - i})}^{'} Ω_{k + i}^{- 1} M_{k + i}^{j - i}, \end{matrix}

(25)

\begin{matrix} P_{k}^{j} = & C^{'} (k) P_{k + 1} D_{d - j} (k) + ϕ^{2} {\bar{C}}^{'} (k) P_{k + 1} {\bar{D}}_{d - j} (k) \\ + C^{'} (k) P_{k + 1}^{j - 1} - M_{k}^{'} Ω_{k}^{- 1} M_{k}^{j}, \end{matrix}

(26)

for

j = 0, \dots, d - 1

, where the terminal value is as (14).

Now we give the main results for Problem 2 in the following theorem.

Theorem 2.

There exists the unique

F_{k - 1}

-measurable

u_{k}

for Problem 2 if and only if

Ω_{k}

, for

k = 0, \dots, N

, are positive definite. In this case, the optimal controller

u_{k}

is calculated by

\begin{matrix} u_{k} = - Ω_{k}^{- 1} M_{k} x_{k} - Ω_{k}^{- 1} \sum_{j = 0}^{d - 1} M_{k}^{j} u_{j + k - d} - Ω_{k}^{- 1} Σ_{k}, \end{matrix}

(27)

where

\begin{matrix} Σ_{k} = & \sum_{i = 0}^{d} [D_{i}^{'} (k + i) (P_{k + i + 1} {\bar{w}}_{k + i} + Φ_{k + i + 1}) + {\bar{D}}_{i}^{'} (k + i) P_{k + i + 1} \\ \times ρ] + \sum_{j = 0}^{d - 1} {(P_{k + j + 1}^{d - j - 1})}^{'} {\bar{w}}_{k + j} - \sum_{i = 1}^{d} {(M_{k + i}^{d - i})}^{'} Ω_{k + i}^{- 1} Σ_{k + i} \end{matrix}

and the optimal cost is as

\begin{matrix} J_{N}^{*} = & x_{0}^{'} P_{0} x_{0} + 2 x_{0}^{'} \sum_{j = 0}^{d - 1} P_{0}^{j} u_{j - d} + \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} \sum_{m = 0}^{d - 1} u_{j - d}^{'} [D_{m + d - j}^{'} (m) \\ \times P_{m + 1} D_{m + d - i} (m) + ϕ^{2} {\bar{D}}_{m + d - j}^{'} (m) P_{m + 1} {\bar{D}}_{m + d - i} (m) \\ + D_{m + d - j}^{'} (m) P_{m + 1}^{i - m - 1} + {(P_{m + 1}^{i - m - 1})}^{'} D_{m + d - i} (m) - {(M_{m}^{j - m})}^{'} \\ \times Ω_{m}^{- 1} M_{m}^{i - m}] u_{i - d} + \sum_{k = 0}^{d} T r [P_{k + 1} Q_{w_{k}}] 2 x_{0}^{'} Φ_{0} \\ + 2 \sum_{k = 0}^{N} {\bar{w}}_{k}^{'} Φ_{k + 1} - \sum_{k = 0}^{N} Σ_{k}^{'} Ω_{k}^{- 1} Σ_{k} \end{matrix}

(28)

where

D_{i} = 0

for

i > d

In addition, the relationship of the optimal costate

ζ_{k - 1}

and state

x_{k}

is as (16) in Lemma 1.

Proof.

The proof is similar to that of Theorem 1, and to save the space of the paper, we omit it here. □

3. Output Feedback Controller

When the state variable

x_{k}

are partially observed, we study the following discrete-time stochastic system:

\begin{matrix} x_{k + 1} & = [C (k) + ν_{k} \bar{C} (k)] x_{k} + [D_{0} (k) + ν_{k} {\bar{D}}_{0} (k)] u_{k} + [D_{d} (k) \\ + ν_{k} {\bar{D}}_{d} (k)] u_{k - d} + w_{k} \\ z_{k} & = [H (k) + g_{k} \bar{H} (k)] x_{k} + e_{k} \end{matrix}

(29)

where

z_{k} \in R^{q}

is the measurement,

g_{k}

is the scalar white noise with zero mean and variance

Q_{g_{k}}

w_{k}

and

e_{k}

are Gaussian zero-mean white noises with covariance

Q_{w_{k}}

and

Q_{e_{k}}

H (k)

and

\bar{H} (k)

are deterministic matrices with compatible dimensions. In this case, the initial value

x_{0}

is known,

ν_{k}

w_{k}

g_{k}

and

e_{k}

are independent of each other.

Obviously, there exist multiplicative noises

ν_{k}

and

g_{k}

in system (29). As we can not obtain the exact information of the state by (29), we introduce the state estimation to design the controller instead. We first obtain the linear optimal state estimator for by applying standard filtering results in [3]. Then, we will derive the suboptimal linear state estimate feedback controller through the following linearizations.

The aim of this section is to find the suboptimal linear state estimate feedback controller for system (29) in order to minimize the cost function (2).

First, we introduce the linear optimal state estimator in Lemma 2.

Lemma 2.

Based on the system (29) with input delays and multiplicative noises, the linear optimal estimator is given by

\begin{matrix} {\hat{x}}_{k + 1 | k} = & E [x_{k + 1} | z_{0}, \dots, z_{k}] \\ = & C (k) {\hat{x}}_{k | k - 1} + D_{0} (k) u_{k} + D_{d} (k) u_{k - d} + K_{k} {\tilde{z}}_{k}, \end{matrix}

(30)

where

\begin{matrix} {\tilde{z}}_{k} = & z_{k} - C (k) {\hat{x}}_{k | k - 1}, \\ K_{k} = & C (k) Σ_{k | k - 1} H^{'} (k) (H (k) Σ_{k | k - 1} H^{'} (k) + Q_{g_{k}} \bar{H} (k) \\ \times ({\hat{x}}_{k | k - 1} {\hat{x}}_{k | k - 1}^{'} + Σ_{k | k - 1}) {\bar{H}}^{'} (k) + Q_{e_{k}})^{- 1} \end{matrix}

Besides, the estimator error covariance matrix is

\begin{matrix} Σ_{k + 1 | k} = & E [(x_{k + 1} - {\hat{x}}_{k + 1 | k}) {(x_{k + 1} - {\hat{x}}_{k + 1 | k})}^{'} | z_{0}, \dots, z_{k}] \\ = & C (k) Σ_{k | k - 1} C^{'} (k) - K_{k} (H (k) Σ_{k | k - 1}^{'} C^{'} (k)) + ϕ^{2} \\ \times [\bar{C} (k) (Σ_{k | k - 1} + {\hat{x}}_{k | k - 1} {\hat{x}}_{k | k - 1}^{'}) {\bar{C}}^{'} (k) + {\bar{D}}_{0} (k) u_{k} \\ \times u_{k}^{'} {\bar{D}}_{0}^{'} (k) + {\bar{D}}_{d} (k) u_{k - d} u_{k - d}^{'} {\bar{D}}_{d}^{'} (k)] + Q_{w_{k}} . \end{matrix}

The initial values

{\hat{x}}_{0 | - 1} = {\bar{x}}_{0}

and

Σ_{0 | - 1} = {\bar{P}}_{0}^{e}

Proof.

The proof of Lemma 2 is put into Appendix C. □

Now, the state estimation is obtained, and we can consider (30) as the state instead of the unavailable exact state information. Observing (3) and (30), we know that the filter gain

K_{k}

{\hat{x}}_{k | k - 1}

should be affine, so that we can apply the results of Theorem 1 in this section. Then, we will linearize the filter gain

K_{k}

Applying first order of Taylor expansion on

K_{k}

through the fixed point

{\hat{x}}_{k | k - 1} = {\bar{x}}_{0}

, the linearization of

K_{k}

yields

\begin{matrix} K_{k} = K_{k}^{0} + K_{k}^{1} ({\hat{x}}_{k | k - 1} - {\bar{x}}_{0}) + o (∥{\hat{x}}_{k | k - 1}∥) . \end{matrix}

(31)

Ignoring the quadratic and higher order terms in (31), and plug (30) into it, the approximation of

{\hat{x}}_{k + 1 | k}

becomes

\begin{matrix} {\hat{x}}_{k + 1 | k} \approx & C (k) {\hat{x}}_{k | k - 1} + D_{0} (k) u_{k} + D_{d} (k) u_{k - d} + (K_{k}^{0} \\ + K_{k}^{1} ({\hat{x}}_{k | k - 1} - {\bar{x}}_{0})) {\tilde{z}}_{k} \\ = & (C (k) + K_{k}^{1} {\tilde{z}}_{k}) {\hat{x}}_{k | k - 1} + D_{0} (k) u_{k} + D_{d} (k) u_{k - d} \\ + (K_{k}^{0} - K_{k}^{1} {\bar{x}}_{0}) {\tilde{z}}_{k} . \end{matrix}

(32)

With (30)-(32), the cost function (2) can be reorganized as

\begin{matrix} J_{N}^{e} \approx & E [\sum_{k = 0}^{N} ({\hat{x}}_{k | k - 1}^{'} Q_{k} {\hat{x}}_{k | k - 1} + u_{k}^{'} R_{k} u_{k} + T r [Q_{k} Σ_{k | k - 1}]) \\ + {\hat{x}}_{N + 1 | N}^{'} P_{N + 1}^{e} {\hat{x}}_{N + 1 | N} + T r [P_{N + 1}^{e} Σ_{N + 1 | N}]] . \end{matrix}

(33)

In this case, the coupled Riccati equations can be derived as

\begin{matrix} P_{k}^{e} = C^{'} (k) P_{k + 1}^{e} C (k) + η_{k} {(K_{k}^{1})}^{'} P_{k + 1}^{e} (K_{k}^{1}) - {(M_{k}^{e})}^{'} {(Ω_{k}^{e})}^{- 1} M_{k}^{e} + Q_{k}, \end{matrix}

where

\begin{matrix} Ω_{k}^{e} & = R_{k} + D_{0}^{'} (k) P_{k + 1}^{e} D_{0} (k) + D_{d}^{'} (k + d) P_{k + d + 1}^{e} D_{d} (k + d) \\ + D_{0}^{'} (k) P_{k + 1}^{d - 1} + {(P_{k + 1}^{d - 1})}^{'} D_{0} (k) - \sum_{i = 1}^{d} {(M_{k + i}^{d - i})}^{'} Ω_{k + i}^{- 1} M_{k + i}^{d - i}, \\ M_{k}^{e} & = D_{0}^{'} (k) P_{k + 1}^{e} C (k) + {(P_{k + 1}^{d - 1})}^{'} C (k), \end{matrix}

with

\begin{matrix} {(M_{k}^{0})}^{e} & = D_{0}^{'} (k) P_{k + 1}^{e} D_{d} (k) + {({(P_{k + 1}^{d - 1})}^{e})}^{'} D_{d} (k), \\ {(M_{k}^{j})}^{e} & = D_{0}^{'} (k) {(P_{k + 1}^{j - 1})}^{e} + {({(P_{k + j + 1}^{d - j - 1})}^{e})}^{'} D_{d} (k + j) \\ - \sum_{i = 1}^{j} {({(M_{k + i}^{d - i})}^{e})}^{'} {(Ω_{k + i}^{e})}^{- 1} {(M_{k + i}^{j - i})}^{e}, \\ {(P_{k}^{0})}^{e} & = C^{'} (k) P_{k + 1}^{e} D_{d} (k) - {(M_{k}^{e})}^{'} {(Ω_{k}^{e})}^{- 1} {(M_{k}^{0})}^{e}, \\ {(P_{k}^{j})}^{e} & = C^{'} (k) {(P_{k + 1}^{j - 1})}^{e} - {(M_{k}^{e})}^{'} {(Ω_{k}^{e})}^{- 1} {(M_{k}^{j})}^{e}, \end{matrix}

j = 1, \dots, d - 1

, with the terminal values

\begin{matrix} P_{N + 1}^{e}, P_{N + i + 1}^{e} = 0, {(P_{N + i}^{j})}^{e} = 0, \\ {(M_{N + i}^{j})}^{e} = 0, Ω_{N + i}^{e} = I, i \geq 1, j = 0, \dots, d - 1, \end{matrix}

and

\begin{matrix} η_{k} = & E [{\tilde{z}}_{k} {\tilde{z}}_{k}^{'} | z_{0}, \dots, z_{k - 1}] \\ = & H (k) Σ_{k | k - 1} H^{'} (k) + Q_{g_{k}} {\bar{H}}^{'} (k) E [x_{k} x_{k}^{'} | z_{0}, \dots, z_{k - 1}] \\ \times {\bar{H}}^{'} (k) + Q_{e_{k}}, \end{matrix}

where

\begin{matrix} E [x_{k + 1} x_{k + 1}^{'} | z_{0}, \dots, z_{k}] \\ = & C (k) E [x_{k} x_{k}^{'} | z_{0}, \dots, z_{k - 1}] C^{'} (k) + ϕ^{2} {\bar{C}}^{'} (k) E [x_{k} x_{k}^{'} | z_{0}, \dots, z_{k - 1}] \\ \times \bar{C} (k) + D_{0} (k) u_{k} u_{k}^{'} D_{0}^{'} (k) + {\bar{D}}_{0} (k) u_{k} u_{k}^{'} {\bar{D}}_{0}^{'} (k) + D_{d} (k) u_{k - d} \\ \times u_{k - d}^{'} D_{d}^{'} (k) + {\bar{D}}_{d} (k) u_{k - d} u_{k - d}^{'} {\bar{D}}_{d}^{'} (k) Q_{w_{k}} + C (k) {\hat{x}}_{k | k - 1} u_{k}^{'} D_{0}^{'} (k) \\ + ϕ^{2} \bar{C} (k) {\hat{x}}_{k | k - 1} u_{k}^{'} {\bar{D}}_{0}^{'} (k) + C (k) {\hat{x}}_{k | k - 1} u_{k - d}^{'} D_{d}^{'} (k) + ϕ^{2} \bar{C} (k) \\ \times {\hat{x}}_{k | k - 1} u_{k - d}^{'} {\bar{D}}_{d}^{'} (k) + D_{0} (k) u_{k} {\hat{x}}_{k | k - 1}^{'} C^{'} (k) + ϕ^{2} {\bar{D}}_{0} (k) u_{k} {\hat{x}}_{k | k - 1}^{'} \\ \times {\bar{C}}^{'} (k) + D_{0} (k) u_{k} u_{k - d}^{'} D_{d}^{'} (k) + ϕ^{2} {\bar{D}}_{0} (k) u_{k} u_{k - d}^{'} {\bar{D}}_{d}^{'} (k) + D_{d} (k) \\ \times u_{k - d} {\hat{x}}_{k | k - 1}^{'} C^{'} (k) + ϕ^{2} {\bar{D}}_{d} (k) u_{k - d} {\hat{x}}_{k | k - 1}^{'} {\bar{C}}^{'} (k) + D_{d} (k) u_{k - d} \\ \times u_{k}^{'} D_{0}^{'} (k) + ϕ^{2} {\bar{D}}_{d} (k) u_{k - d} u_{k}^{'} {\bar{D}}_{0}^{'} (k) . \end{matrix}

with the initial value

E [x_{0} x_{0}^{'}] = {\bar{x}}_{0} {\bar{x}}_{0}^{'} + {\bar{P}}_{0}^{e}

Now, we can find the suboptimal controller to minimize the cost function (33) subject to (29), by the results of Theorem 1.

Theorem 3.

The suboptimal linear state estimate feedback controller for system (29) that minimizes the cost function (33) is given by

\begin{matrix} u_{k}^{e} = - {(Ω_{k}^{e})}^{- 1} M_{k}^{e} {\hat{x}}_{k | k - 1} - {(Ω_{k}^{e})}^{- 1} \sum_{j = 0}^{d - 1} {(M_{k}^{j})}^{e} u_{j + k - d}^{e} . \end{matrix}

(34)

The minimized cost function is given by

\begin{matrix} {(J_{N}^{e})}^{*} = & {\bar{x}}_{0}^{'} P_{0}^{e} {\bar{x}}_{0} + 2 {\bar{x}}_{0}^{'} \sum_{j = 0}^{d - 1} {(P_{0}^{j})}^{e} u_{j - d} + \sum_{j = 0}^{d - 1} u_{j - d}^{'} D_{d}^{'} (j) P_{j + 1}^{e} \\ \times D_{d} (j) u_{j - d} + 2 \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} u_{j - d}^{'} D_{d}^{'} (j) {(P_{j + 1}^{i - j - 1})}^{e} u_{i - d}^{e} \\ - \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} \sum_{m = 0}^{d - 1} {(u_{j - d}^{e})}^{'} {({(M_{m}^{j - m})}^{e})}^{'} {(Ω_{m}^{e})}^{- 1} {(M_{m}^{i - m})}^{e} \\ \times u_{i - d}^{e} + \sum_{k = 0}^{N} T r [P_{k + 1}^{e} Q_{w_{k}}], \end{matrix}

(35)

4. Numerical examples

Example 1 Consider the scalar case of time-invariant LQG control system (3) in Remark 3. We consider the case that the additive noise

w_{k}

is the zero-mean white noise. The associate parameters are as

\begin{matrix} C = 2, \bar{C} = 1, D_{0} = 3, {\bar{D}}_{0} = 1, D_{d} = 2, {\bar{D}}_{d} = 2, \\ d = 5, ϕ = 1, Q_{w} = 1, ρ = 0.6, \bar{w} = 0.2, \end{matrix}

with the initial values

\begin{matrix} x_{0} = 1, u_{- 5} = - 0.5, u_{- 4} = 0.8, u_{- 3} = - 1.2, \\ u_{- 2} = - 1, u_{- 1} = - 0.6, \end{matrix}

and the cost function (2) with

Q = 1, R = 1, P_{N + 1} = 1

. When the delay

d = 5

, and

N = 30

, by applying Theorem 1 and the equations (7)-(13), direct calculations yield that

P_{k}, Ω_{k}, M_{k}, P_{k}^{j}, M_{k}^{j}

for

k = 0, \dots, N

. It can be obviously known that

Ω_{k}

is positive definite for

k = 0, \dots, N

. Thus, there exists a unique

u_{k}

from Theorem 1, and the optimal controller can be calculated with (18), which is shown in Figure 1.

Accordingly, the associated optimal value of (17) is

J_{N}^{*}

=107.5150.

In order to illustrate that the proposed LQG controller can minimize performance index, let us consider the time-invariant standard state feedback controller

u_{k} = - Ω_{k}^{- 1} M_{k} x_{k}

. Based on the above parameters and by substituting into cost function, the controller

u_{k}

are shown in Figure 2, and the associated value is

J_{N}^{*}

=255.0603, which confirmed the effectiveness of the algorithm.

Example 2 Consider the discrete time-varying LQG control system with multiple delays and multiplicative noises with

x_{k} \in R^{2}

u_{k} \in R^{2}

, and the cost function (2). The associate coefficients are:

\begin{matrix} C (1) = [\begin{matrix} - 0.2 & - 0.1 \\ - 0.8 & 1.1 \end{matrix}], C (2) = [\begin{matrix} - 1.9 & 1.1 \\ - 1.4 & - 1.9 \end{matrix}], \\ C (3) = [\begin{matrix} - 1.4 & - 0.3 \\ 0 & 0.4 \end{matrix}], C (4) = [\begin{matrix} - 1.6 & 0.6 \\ 1.9 & - 0.8 \end{matrix}], \\ \bar{C} (1) = [\begin{matrix} - 1.3 & 0.8 \\ - 1.9 & 0.4 \end{matrix}], \bar{C} (2) = [\begin{matrix} - 1.4 & 1.6 \\ - 0.7 & 0.4 \end{matrix}], \\ \bar{C} (3) = [\begin{matrix} - 0.9 & 0.3 \\ 0.3 & - 1.5 \end{matrix}], \bar{C} (4) = [\begin{matrix} 1.4 & 1 \\ - 1.8 & 1.5 \end{matrix}], \\ D_{0} (1) = [\begin{matrix} - 0.3 & 0 \\ 1.6 & 1.6 \end{matrix}], D_{0} (2) = [\begin{matrix} - 1.9 & - 1.2 \\ - 0.7 & - 0.9 \end{matrix}], \\ D_{0} (3) = [\begin{matrix} - 1.7 & - 1.3 \\ - 1.8 & 0.1 \end{matrix}], D_{0} (4) = [\begin{matrix} 0.5 & - 0.2 \\ 0.8 & 1 \end{matrix}], \\ {\bar{D}}_{0} (1) = [\begin{matrix} 0 & 0.8 \\ 1.7 & 0.7 \end{matrix}], {\bar{D}}_{0} (2) = [\begin{matrix} - 1.9 & 1 \\ 1.3 & 0.8 \end{matrix}], \\ {\bar{D}}_{0} (3) = [\begin{matrix} 1.3 & - 1.2 \\ - 0.8 & - 1.7 \end{matrix}], {\bar{D}}_{0} (4) = [\begin{matrix} - 1.9 & - 1.6 \\ - 1.2 & - 1.3 \end{matrix}], \\ D_{d} (1) = [\begin{matrix} - 1.9 & - 0.3 \\ 0.1 & 0.5 \end{matrix}], D_{d} (2) = [\begin{matrix} 1.3 & - 1.5 \\ 1 & - 0.5 \end{matrix}], \\ D_{d} (3) = [\begin{matrix} 1.7 & 0.5 \\ - 1 & - 1.1 \end{matrix}], D_{d} (4) = [\begin{matrix} 1.7 & - 1.5 \\ 1.4 & 0.6 \end{matrix}], \\ D_{d} (5) = [\begin{matrix} 1.8 & - 1.9 \\ 0 & - 0.3 \end{matrix}], D_{d} (6) = [\begin{matrix} - 0.3 & - 0.2 \\ 0.4 & - 1.9 \end{matrix}], \\ D_{d} (7) = [\begin{matrix} - 0.9 & - 1 \\ 0.3 & 1.8 \end{matrix}], {\bar{D}}_{d} (1) = [\begin{matrix} 0.2 & 0.2 \\ - 1.1 & 1.4 \end{matrix}], \\ {\bar{D}}_{d} (2) = [\begin{matrix} - 1.9 & - 2 \\ - 2 & - 1.3 \end{matrix}], {\bar{D}}_{d} (3) = [\begin{matrix} - 1 & - 1.4 \\ 1 & 1.3 \end{matrix}], \\ {\bar{D}}_{d} (4) = [\begin{matrix} - 1.8 & - 1.2 \\ - 1.4 & 0.6 \end{matrix}], {\bar{D}}_{d} (5) = [\begin{matrix} - 1.5 & - 0.9 \\ 1.8 & 1.5 \end{matrix}], \\ {\bar{D}}_{d} (6) = [\begin{matrix} - 1.8 & 1.9 \\ - 0.4 & 0.1 \end{matrix}], {\bar{D}}_{d} (7) = [\begin{matrix} - 1.7 & 0.5 \\ - 1.2 & - 0.8 \end{matrix}], \\ P_{N + 1} = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}], Q_{1} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], Q_{2} = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}], \\ Q_{3} = [\begin{matrix} 0.8 & 1 \\ 1 & 0.8 \end{matrix}], Q_{4} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], R_{1} = [\begin{matrix} 2 & 0 \\ 0 & 2 \end{matrix}], \\ R_{2} = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}], R_{3} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], R_{4} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], \end{matrix}

When the delay time is

d = 3

and the final time is

N = 4

, given the initial value

\begin{matrix} x_{0} = [\begin{matrix} 1 \\ 1 \end{matrix}], u_{- 3} = [\begin{matrix} 1.7 \\ - 1.9 \end{matrix}], u_{- 2} = [\begin{matrix} 1 \\ 1 \end{matrix}], u_{- 1} = [\begin{matrix} 0 \\ 1.6 \end{matrix}], \\ C (0) = [\begin{matrix} 2 & 0 \\ 1 & 2 \end{matrix}], \bar{C} (0) = [\begin{matrix} 1 & 1 \\ 2 & 1 \end{matrix}], D_{0} (0) = [\begin{matrix} 1 & 2 \\ 1 & 0 \end{matrix}], \\ {\bar{D}}_{0} (0) = [\begin{matrix} 0 & 1 \\ 2 & 1 \end{matrix}], D_{d} (0) = [\begin{matrix} 2 & 1 \\ 1 & 2 \end{matrix}], {\bar{D}}_{d} (0) = [\begin{matrix} 1 & 2 \\ 1 & 0 \end{matrix}], \end{matrix}

by applying Theorem 1 and (8)-(13), it yields that

\begin{matrix} P_{1} = [\begin{matrix} 25.33 & - 2.03 \\ - 2.03 & 2.60 \end{matrix}], P_{2} = [\begin{matrix} 9.10 & 1.45 \\ 1.45 & 12.47 \end{matrix}], \\ P_{3} = [\begin{matrix} 2.83 & - 0.76 \\ - 0.76 & 2.90 \end{matrix}], P_{4} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], \\ Ω_{1} = [\begin{matrix} 69.41 & 48.05 \\ 48.05 & 47.49 \end{matrix}], Ω_{2} = [\begin{matrix} 28.53 & 5.16 \\ 5.16 & 8.26 \end{matrix}], \\ Ω_{3} = [\begin{matrix} 9.46 & 1.83 \\ 1.83 & 7.03 \end{matrix}], Ω_{4} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], \\ M_{1} = [\begin{matrix} - 59.03 & 31.96 \\ - 46.01 & 32.31 \end{matrix}], M_{2} = [\begin{matrix} 15.30 & - 12.32 \\ 3.33 & 4.43 \end{matrix}], \\ M_{3} = [\begin{matrix} 0.97 & 0.36 \\ 2.39 & 1.84 \end{matrix}], M_{4} = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}] . \end{matrix}

For

i = 1, 2, 3, 4

Ω_{i} > 0

, thus, there is an optimal solution to the LQG system with multiple delays and state/control noises from Theorem 1. Based on the above data, the optimal controller can be calculated as

\begin{matrix} u_{0} = [\begin{matrix} 0.59 \\ - 1.35 \end{matrix}], u_{1} = [\begin{matrix} - 0.96 \\ - 0.46 \end{matrix}], u_{2} = [\begin{matrix} 1.38 \\ - 2.34 \end{matrix}], \\ u_{3} = [\begin{matrix} - 0.10 \\ 1.34 \end{matrix}], u_{4} = [\begin{matrix} 0 \\ 0 \end{matrix}] . \end{matrix}

According to (17), the optimal performance index of system (3) is

J_{N}^{*} = 51.1915

5. Conclusions

In this paper, the discrete time-varying optimal linear quadratic Gaussian (LQG) control problem involving multiple delays and state/control- dependent noises has been studied. A necessary and sufficient condition for the existence of unique optimal controller to the problem is given, which is based on the obtained maximum principle and the relationship between the state and costate. Under this context, the optimal controller and the minimized performance index are represented. What’s more, as the state variables observed partially, the suboptimal linear state estimate feedback controllers for the LQG models with input delays and multiplicative noises are derived.

Author Contributions

Conceptualization, Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; methodology, Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; validation, Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; formal analysis,Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; investigation, Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; resources, Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; data curation, Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; writing—original draft preparation, Qiyan Zhang and Chunyang Sheng; writing—review and editing, Qiyan Zhang, Chunyang Sheng, Xiao Lu and Haixia Wang; funding acquisition, Chunyang Sheng, Xiao Lu and Haixia Wang. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (62273213, 62073199, 62103245, 62203280), Natural Science Foundation of Shandong Province for Innovation and Development Joint Funds (ZR2022LZH001), Natural Science Foundation of Shandong Province (ZR2020MF095, ZR2022MF341), Taishan Scholarship Construction Engineering.

Data Availability Statement

The data of this study is included within the article.

Conflicts of Interest

The authors declare no conflict of interest with respect to the research, authorship, and/or publication of this article.

Appendix A The proof of Lemma 1

Utilizing the maximum principle (4)-(6) to system (3) with cost function (2). We can obtain for

k = N

\begin{matrix} 0 = & (D_{0} {(N)}^{'} P_{N + 1} C (N) + ϕ^{2} {\bar{D}}_{0}^{'} (N) P_{N + 1} \bar{C} (N)) x_{N} + (D_{0}^{'} (N) \\ \times P_{N + 1} D_{0} (N) + ϕ^{2} {\bar{D}}_{0}^{'} (N) P_{N + 1} {\bar{D}}_{0} (N) + R_{N}) u_{N} \\ + (D_{0}^{'} (N) P_{N + 1} D_{d} (N) + ϕ^{2} {\bar{D}}_{0}^{'} (N) P_{N + 1} {\bar{D}}_{d} (N)) u_{N - d} \\ + D_{0}^{'} (N) P_{N + 1} {\bar{w}}_{N} + {\bar{D}}_{0}^{'} (N) P_{N + 1} ρ . \end{matrix}

With (9)-(11), the optimal controller

u_{N}

is as

\begin{matrix} u_{N} = & - Ω_{N}^{- 1} M_{N} x_{N} - Ω_{N}^{- 1} \sum_{j = 0}^{d - 1} M_{N}^{j} u_{j + N - d} - Ω_{N}^{- 1} Σ_{N} . \end{matrix}

From (4)-(6), we also have

\begin{matrix} ζ_{N - 1} = & (C^{'} (N) P_{N + 1} C (N) + ϕ^{2} {\bar{C}}^{'} (N) P_{N + 1} C (N) + Q_{N}) x_{N} \\ + (C^{'} (N) P_{N + 1} D_{0} (N) + ϕ^{2} \bar{C} (N) P_{N + 1} {\bar{D}}_{0} (N)) u_{N} \\ + (C^{'} (N) P_{N + 1} D_{d} (N) + ϕ^{2} \bar{C} (N) P_{N + 1} {\bar{D}}_{d} (N)) u_{N - d} \\ + C^{'} (N) P_{N + 1} {\bar{w}}_{N} + {\bar{C}}^{'} (N) P_{N + 1} ρ \end{matrix}

Substituting (7), (12) and (13),

ζ_{N - 1}

yields

\begin{matrix} ζ_{N - 1} = & P_{N} x_{N} + \sum_{j = 0}^{d - 1} P_{N}^{j} u_{j + N - d} + Φ_{N} . \end{matrix}

We have verified (15) for

k = N

. Assuming that

ζ_{k - 1}

are as (15) for all

k \geq n + 1

with

n > N - d

, then we will show that (15) also holds for

k = n

. Set

u_{k}

to be optimal for all

k \geq n + 1

, with equations (3) and (15),

ζ_{n}

can be calculated as

\begin{matrix} ζ_{n} = & P_{n + 1} (C_{n} (n) x_{n} + D_{n}^{0} (n) u_{n} + D_{n}^{d} (n) u_{n - d} + w_{n}) \\ + \sum_{j = 0}^{d - 1} P_{n + 1}^{j} u_{j + n + 1 - d} + Φ_{n + 1} . \end{matrix}

(A1)

Insert

ζ_{n}

to (5), (5) will become

\begin{matrix} 0 & = M_{n} x_{n} + Ω_{n} u_{n} + \sum_{j = 0}^{d - 1} M_{n}^{j} u_{j + n - d} + Σ_{n} \end{matrix}

Thus, the optimal controller is given by

\begin{matrix} u_{n} = - Ω_{n}^{- 1} M_{n} x_{n} - Ω_{n}^{- 1} \sum_{j = 0}^{d - 1} M_{n}^{j} u_{j + n - d} - Ω_{n}^{- 1} Σ_{n}, \end{matrix}

for

n = N, \dots, N - d + 1

. Using the equations (3),(5) and (A1),

ζ_{n - 1}

yields that

\begin{matrix} ζ_{n - 1} \\ = & (C^{'} (n) P_{n + 1} C (n) + ϕ^{2} {\bar{C}}^{'} (n) P_{n + 1} \bar{C} (n) + Q_{n} - M_{n}^{'} Ω_{n}^{- 1} M_{n}) \\ \times x_{n} + (C^{'} (n) P_{n + 1} D_{d} (n) + ϕ^{2} {\bar{C}}^{'} (n) P_{n + 1} {\bar{D}}_{d} (n) - M_{n}^{'} \\ \times Ω_{n}^{- 1} M_{n}^{0}) u_{n - d} + \sum_{j = 1}^{d - 1} (C^{'} (n) P_{n}^{j - 1} - M_{n}^{'} Ω_{n}^{- 1} M_{n}^{j}) u_{j + n - d} \\ - M_{n}^{'} Ω_{n}^{- 1} Σ_{n} + C^{'} (n) (P_{n + 1} {\bar{w}}_{n} + Φ_{n + 1}) + {\bar{C}}^{'} (n) P_{n + 1} ρ \\ = & P_{n} x_{n} + \sum_{j = 0}^{d - 1} P_{n}^{j} u_{j + n - d} + Φ_{n}, \end{matrix}

which implies that (15) holds for

k = n, N - d < n \leq N

Then we obtained

\begin{matrix} ζ_{N - d} & = P_{N - d + 1} x_{N - d + 1} + \sum_{j = 0}^{d - 1} P_{N - d + 1}^{j} u_{j + N - 2 d + 1} + Φ_{N - d + 1}, \\ u_{N - d + 1} & = - Ω_{N - d + 1}^{- 1} M_{N - d + 1} x_{N - d + 1} - Ω_{N - d + 1}^{- 1} \sum_{j = 0}^{d - 1} M_{N - d + 1}^{j} \\ \times u_{j + k - d} - Ω_{N - d + 1}^{- 1} Σ_{N - d + 1} . \end{matrix}

(A2)

Analogy with the method, assuming that

ζ_{k - 1}

are as (15) for all

k \geq n + 1, n = 0, \dots, N - d

, and we will verify that (15) also holds for

k = n

. As

ζ_{n}

is calculated as (A1), then for

n = 0, \dots, N - d

, (6) will be obtained

\begin{matrix} 0 & = Ψ + (D_{d}^{'} (n) P_{n + d + 1} C (n) + ϕ^{2} {\bar{D}}_{d}^{'} (n) P_{n + d + 1} \bar{C} (n)) x_{n + d} \\ + {(M_{n + d}^{0})}^{'} (- Ω_{n + d}^{- 1} x_{n + d} - Ω_{n + d}^{- 1} \sum_{j = 0}^{d - 1} M_{n + d}^{j} u_{j + n} - Ω_{n + d}^{- 1} \\ \times Σ_{n + d}) + D_{0}^{'} (n) \sum_{j = 0}^{d - 2} P_{n + 1}^{j} u_{j + n - d + 1} + D_{d}^{'} (n + d) \sum_{j = 0}^{d - 2} P_{n + d + 1}^{j} u_{j + n + 1} \\ = Ψ + ({(P_{n + d - 1}^{1})}^{'} C (n + d) - {(M_{n + d - 2}^{2})}^{'} Ω_{n + d - 2}^{- 1} M_{n + d - 2}) \\ \times x_{n + d - 2} - {(M_{n + d - 2}^{2})}^{'} Ω_{n + d - 2}^{- 1} \sum_{j = 0}^{d - 1} M_{n + d - 2}^{j} u_{j + n - 2} \\ + ({(P_{n + d - 1}^{1})}^{'} D_{d} (n + d - 2) + D_{0}^{'} (n) P_{n + 1}^{d - 3}) u_{n - 2} \\ + ({(P_{n + d}^{0})}^{'} D_{d} (n + d - 1) + D_{0}^{'} (n) P_{n + 1}^{d - 2} - {(M_{n + d - 1}^{1})}^{'} \\ \times Ω_{n + d - 1}^{- 1} M_{n + d - 1}^{0}) u_{n - 1} - {(M_{n + d}^{0})}^{'} Ω_{n + d}^{- 1} \sum_{j = 0}^{d - 3} M_{n + d}^{j} u_{j + n} \\ - {(M_{n + d - 1}^{1})}^{'} Ω_{n + d - 1}^{- 1} \sum_{j = 1}^{d - 2} M_{n + d - 1}^{j} u_{j + n - 1} + D_{0}^{'} (n) \sum_{j = 0}^{d - 4} P_{n + 1}^{j} \\ \times u_{j + n - d + 1} + D_{d}^{'} (n + d) \sum_{j = 0}^{d - 4} P_{n + d + 1}^{j} u_{j + n + 1} \\ - \sum_{i = d - 1}^{d} {(M_{n + i}^{d - i})}^{'} Ω_{n + i}^{- 1} Σ_{n + i} + \sum_{i = d - 2}^{d - 1} P_{n + 1 + i}^{d - 1 - i} {\bar{w}}_{n + i} \end{matrix}

where

\begin{matrix} Ψ & = (D_{0}^{'} (n) P_{n + 1} C (n) + ϕ^{2} {\bar{D}}_{0}^{'} (n) P_{n + 1} \bar{C} (n)) x_{n} + (D_{0}^{'} (n) \\ \times P_{n + 1} D_{0} (n) + ϕ^{2} {\bar{D}}_{0}^{'} (n) P_{n + 1} {\bar{D}}_{0} (n) + R_{n} + D_{0}^{'} (n) P_{n + 1}^{d - 1} \\ + D_{d}^{'} (n + d) P_{n + d + 1} D_{d} (n + d) + ϕ^{2} {\bar{D}}_{d}^{'} (n + d) P_{n + d + 1} \\ \times {\bar{D}}_{d} (n + d)) u_{n} + (D_{0}^{'} (n) P_{n + 1} D_{d} (n) + ϕ^{2} {\bar{D}}_{0}^{'} (n) P_{n + 1} \\ \times {\bar{D}}_{d} (n)) u_{n - d} + D_{0}^{'} (n) (P_{n + 1} {\bar{w}}_{n} + Φ_{n + 1}) + {\bar{D}}_{0}^{'} (n) P_{n + 1} ρ \\ + D_{d}^{'} (n + d) (P_{n + d + 1} {\bar{w}}_{n + d} + Φ_{n + 1}) + {\bar{D}}_{d}^{'} (n + d) P_{n + d + 1} ρ \end{matrix}

After inserting (3) and (A2), and combing like terms, we can summarize that

\begin{matrix} 0 & = Ψ + {(P_{n + 1}^{d - 1})}^{'} x_{n + 1} + ({(P_{n + 2}^{d - 2})}^{'} D_{d} (n + 1) + D_{0}^{'} (n) P_{n + 1}^{0} \\ - {(M_{n + 1}^{d - 1})}^{'} Ω_{n + 1}^{- 1} M_{n + 1}^{0}) u_{n - d + 1} + ({(P_{n + 3}^{d - 3})}^{'} D_{d} (n + 2) \\ + D_{0} (n) P_{n + 1}^{1} - \sum_{i = 1}^{2} {(M_{n + i}^{d - i})}^{'} Ω_{n + i}^{- 1} M_{n + i}^{2 - i}) u_{n - d + 2} + \dots \\ + ({(P_{n + d - 1}^{1})}^{'} D_{d} (n + d - 2) + D_{0}^{'} (n) P_{n + 1}^{d - 3} - \sum_{i = 1}^{d - 2} {(M_{n + i}^{d - i})}^{'} \\ \times Ω_{n + i}^{- 1} M_{n + i}^{d - 2 - i}) u_{n - 2} + ({(P_{n + d}^{0})}^{'} D_{d} (n + d - 1) + D_{0}^{'} P_{n + 1}^{d - 2} \\ - \sum_{i = 1}^{d - 1} {(M_{n + i}^{d - i})}^{'} Ω_{n + i}^{- 1} M_{n + i}^{d - 1 - i}) u_{n - 1} + \sum_{i = 1}^{d} {(M_{n + i}^{d - i})}^{'} Ω_{n + i}^{- 1} \\ \times M_{n + i}^{d - i} u_{n} + \sum_{i = 1}^{d - 1} P_{n + 1 + i}^{d - 1 - i} {\bar{w}}_{n + i} - \sum_{i = 1}^{d} {(M_{n + i}^{d - i})}^{'} Ω_{n + i}^{- 1} Σ_{n + i} \\ = M_{n} x_{n} + Ω_{n} u_{n} + \sum_{j = 0}^{d - 1} M_{n}^{j} u_{j + n - d} + Σ_{n} . \end{matrix}

Now, the optimal controller for

n = 0, \dots, N - d

is obtained as

\begin{matrix} u_{n} = - Ω_{n}^{- 1} M_{n} x_{n} - Ω_{n}^{- 1} \sum_{j = 0}^{d - 1} M_{n}^{j} u_{j + n - d} - Ω_{n}^{- 1} Σ_{n} . \end{matrix}

In the same way, substituting

u_{n}

into (5), we can also prove that

\begin{matrix} ζ_{n - 1} = P_{n} x_{n} + \sum_{j = 0}^{d - 1} P_{n}^{j} u_{j + n - d} + Φ_{n}, n = 0, \dots, N - d . \end{matrix}

This completes the proof of the lemma.

Appendix B The proof of Theorem 1

"Necessity": Suppose there exists the unique

F_{k - 1}

-measurable

u_{k}

to make the cost function (2) minimized. We will show by induction that

Ω_{k}, k = d, \dots, N

are positive definite and the optimal controller can be designed as (15). Define

\begin{matrix} J (k) = \sum_{i = k}^{N} E [x_{i}^{'} Q_{i} x_{i} + u_{i}^{'} R_{i} u_{i} + x_{N + 1}^{'} P_{N + 1} x_{N + 1}], \end{matrix}

for

k = 0, \dots, N

, and when

k = N

the above equation becomes

\begin{matrix} J (N) & = E [x_{N}^{'} Q_{N} x_{N} + u_{N}^{'} R_{N} u_{N} + (C_{N} (N) x_{N} + D_{N}^{0} (N) u_{N} \\ + D_{N}^{d} (N) u_{N - d} + w_{N})^{'} P_{N + 1} (C_{N} (N) x_{N} + D_{N}^{0} (N) \\ \times u_{N} + D_{N}^{d} (N) u_{N - d} + w_{N})] . \end{matrix}

Using (3), we can obviously know that the uniqueness of the optimal controller only depends on whether

u_{N} > 0

. Then setting

x_{N} = 0

, and

u_{N - d} = 0

J (N)

can be presented as

\begin{matrix} J (N) & = u_{N}^{'} Ω_{N} u_{N} + 2 u_{N}^{'} (D_{0}^{'} (N) P_{N + 1} {\bar{w}}_{N} + {\bar{D}}_{0}^{'} (N) P_{N + 1} ρ) \\ + T r [P_{N + 1} Q_{w_{N}}] . \end{matrix}

(A3)

We know that

J (N)

is expressed as a quadratic function of

u_{N}

, and as there is a unique solution for system (3), then

J (N) > 0

, it follows that

Ω_{N} > 0

, i.e.

Ω_{k}

is positive definite for

k = N

. In order to accomplish the proof, we assume

Ω_{k} > 0

for all

k \geq n + 1

. Then we will prove that

Ω_{n} > 0

. With (3), (5) and (6), for

k \geq n + 1

, we construct that

\begin{matrix} E [x_{k}^{'} ζ_{k - 1} - x_{k + 1}^{'} ζ_{k}] \\ = & E [x_{k}^{'} Q_{k} x_{k} + u_{k}^{'} R_{k} u_{k}] + E [u_{k}^{'} {(D_{k + d}^{d} (k + d))}^{'} ζ_{k + d} \\ - u_{k - d}^{'} {(D_{k}^{d} (k))}^{'} ζ_{k}] - E [w_{k}^{'} ζ_{k}] . \end{matrix}

Adding from

k = n + 1

k = N

on both sides of the above equation in order to get the form of

J (N)

, we have

\begin{matrix} E [x_{n + 1}^{'} ζ_{n} - x_{N + 1}^{'} ζ_{N}] = \sum_{k = n + 1}^{N} E [x_{k}^{'} Q_{k} x_{k} + u_{k}^{'} R_{k} u_{k}] + \sum_{k = n + 1}^{N} \\ E [u_{k}^{'} {(D_{k + d}^{d} (k + d))}^{'} ζ_{k + d} - u_{k - d}^{'} {(D_{k}^{d} (k))}^{'} ζ_{k}] - \sum_{k = n + 1}^{N} [w_{k}^{'} ζ_{k}] . \end{matrix}

Then

\begin{matrix} E [\sum_{k = n + 1}^{N} (x_{k}^{'} Q_{k} x_{k} + u_{k}^{'} R_{k} u_{k}) + x_{N + 1}^{'} P_{N + 1} x_{N + 1}] \\ = & E [x_{n + 1}^{'} ζ_{n} + \sum_{k = n + 1}^{n + d} u_{k - d}^{'} {(D_{k}^{d} (k))}^{'} ζ_{k} + \sum_{k = n + 1}^{N} w_{k}^{'} ζ_{k}] . \end{matrix}

Using (2), it yields that

\begin{matrix} J (n) = & E [x_{n + 1}^{'} ζ_{n} + x_{n}^{'} Q_{n} x_{n} + u_{n}^{'} R_{n} u_{n} + \sum_{k = n + 1}^{n + d} u_{k - d}^{'} {(D_{k}^{d} (k))}^{'} ζ_{k} \\ + \sum_{k = n + 1}^{N} w_{k}^{'} ζ_{k}] . \end{matrix}

(A4)

Setting

x_{n} = 0

u_{n - i} = 0

as same as the condition

k = N

. And plugging (16) into (A4), we obtain

\begin{matrix} J (n) & = u_{n}^{'} (D_{0}^{'} (n) P_{n + 1} D_{0} (n) + ϕ^{2} {\bar{D}}_{0}^{'} (n) P_{n + 1} {\bar{D}}_{0} (n) + R_{n} \\ + D_{0}^{'} (n) P_{n + 1}^{d - 1} + D_{d}^{'} (n + d) P_{n + d + 1} D_{d} (n + d) + ϕ^{2} \\ \times {\bar{D}}_{d}^{'} (n + d) P_{n + d + 1} {\bar{D}}_{d} (n + d) + {(P_{n + 1}^{d - 1})}^{'} D_{0} (n) \\ - \sum_{i = 1}^{d} {(M_{n + i}^{d - i})}^{'} Ω_{n + i}^{- 1} M_{n + i}^{d - i}) u_{n} + \sum_{k = n}^{N} w_{k}^{'} ζ_{k} + u_{n}^{'} (D_{0}^{'} (n) \\ \times P_{n + 1} {\bar{w}}_{n} + {\bar{D}}_{0}^{'} (n) P_{n + 1} ρ) + u_{n}^{'} (D_{d}^{'} (n + d) P_{n + d + 1} \\ \times {\bar{w}}_{n} + {\bar{D}}_{d}^{'} (n + d) P_{n + d + 1} ρ) + u_{n}^{'} \sum_{i = 0}^{d - 1} {(P_{n + d - i}^{i})}^{'} \\ \times {\bar{w}}_{n + d - 1 - i} + D_{0}^{'} (n) Φ_{n + 1} + D_{d}^{'} (n + d) Φ_{n + d + 1} . \end{matrix}

Similarly to the case

Ω_{N} > 0

above, we obviously get

Ω_{n} > 0

for all

k = 0, \dots, N

. This ends the proof of necessity.

"Sufficiency": Suppose

Ω_{k} > 0

for

k \geq 0

is true, we will show the uniqueness of the

F_{k - 1}

-measurable

u_{k}

to minimize (2). Denoted by

\begin{matrix} V_{k} (x_{k}) \\ = & E [x_{k}^{'} P_{k} x_{k} + 2 x_{k}^{'} \sum_{j = 0}^{d - 1} P_{k}^{j} u_{j - d + k} + \sum_{j = 0}^{d - 1} u_{j - d + k}^{'} ({(D_{k + j}^{d} (k + j))}^{'} \\ \times P_{k + j + 1} D_{k + j}^{d} (k + j)) u_{j - d + k} + 2 \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} u_{j - d + k}^{'} \\ \times {(D_{k + j}^{d} (k + j))}^{'} P_{k + j + 1}^{i - j - 1} u_{i - d + k} - \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} \sum_{m = 0}^{d - 1} u_{j - d + k}^{'} \\ \times {(M_{k + m}^{j - m})}^{'} Ω_{k + m}^{- 1} M_{k + m}^{i - m} u_{i - d + k}] + 2 x_{k}^{'} Φ_{k} . \end{matrix}

First, for

V_{k + 1} (x_{k + 1})

, using the equivalent substitution

j = j + 1

i = i + 1

, and

m = m + 1

, we calculate as follows

\begin{matrix} V_{k + 1} (x_{k + 1}) \\ = & E [x_{k + 1}^{'} P_{k + 1} x_{k + 1} + 2 x_{k + 1}^{'} \sum_{j = 0}^{d - 1} P_{k + 1}^{j - 1} u_{j - d + k} + \sum_{j = 0}^{d - 1} u_{j - d + k}^{'} \\ \times {(D_{k + j}^{d} (k + j))}^{'} P_{k + j + 1} D_{k + j}^{d} (k + j) u_{j - d + k} + 2 \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} \\ u_{j - d + k}^{'} {(D_{k + j}^{d} (k + j))}^{'} P_{k + j + 1}^{i - j - 1} u_{i - d + k} - \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} \sum_{m = 0}^{d - 1} u_{j - d + k}^{'} \\ \times {(M_{k + m}^{j - m})}^{'} Ω_{k + m}^{- 1} M_{k + m}^{i - m} u_{i - d + k}] + E [2 x_{k + 1}^{'} P_{k + 1}^{d - 1} u_{k} + u_{k}^{'} \\ \times {(D_{k + d}^{d} (k + d))}^{'} P_{k + d + 1} D_{k + d}^{d} (k + d) u_{k} - u_{k - d}^{'} {(D_{k}^{d} (k))}^{'} \\ \times P_{k + 1} D_{k}^{d} (k) u_{k - d} + 2 \sum_{j = 0}^{d - 1} u_{j - d + k}^{'} {(D_{k + j}^{d} (k + j))}^{'} P_{k + 1 + j}^{d - 1 - j} u_{k} \\ - 2 \sum_{i = 0}^{d - 1} u_{k - d}^{'} {(D_{k}^{d} (k))}^{'} P_{k + 1}^{i - 1} u_{i - d + k} - 2 u_{k - d} {(D_{k}^{d} (k))}^{'} P_{k + 1}^{d - 1} u_{k} \\ + \sum_{j = 0}^{d - 1} \sum_{i = 0}^{d - 1} u_{j - d + k}^{'} {(M_{k}^{j})}^{'} Ω_{k}^{- 1} M_{k}^{i} u_{k - d + i} - \sum_{m = 0}^{d - 1} u_{k}^{'} {(M_{k + m}^{d - m})}^{'} \\ \times Ω_{k + m}^{- 1} M_{k + m}^{d - m} u_{k} - u_{k}^{'} {(M_{k + d}^{0})}^{'} Ω_{k + d}^{- 1} M_{k + d}^{0} u_{k} + u_{k}^{'} {(M_{k}^{d})}^{'} \\ \times Ω_{k}^{- 1} M_{k}^{d} u_{k} - \sum_{j = 0}^{d - 1} \sum_{m = 0}^{d - 1} u_{j - d + k}^{'} {(M_{k + m}^{j - m})}^{'} Ω_{k + m}^{- 1} M_{k + m}^{d - m} u_{k} \\ - \sum_{i = 0}^{d - 1} \sum_{m = 0}^{d - 1} u_{k}^{'} {(M_{k + m}^{d - m})}^{'} Ω_{k + m}^{- 1} M_{k + m}^{i - m} u_{i - d + k}] + 2 x_{k + 1}^{'} Φ_{k + 1} . \end{matrix}

Construct the equation

V_{k} (x_{k}) - V_{k + 1} (x_{k + 1})

, then we have

\begin{matrix} V_{k} (x_{k}) - V_{k + 1} (x_{k + 1}) \\ = & x_{k}^{'} Q_{k} x_{k} + u_{k}^{'} R_{k} u_{k} + (u_{k} + Ω_{k}^{- 1} M_{k} x_{k} + Ω_{k}^{- 1} \sum_{j = 0}^{d - 1} M_{k}^{j} u_{j + k - d} \\ + Ω_{k}^{- 1} Σ)^{'} Ω_{k} (u_{k} + Ω_{k}^{- 1} M_{k} x_{k} + Ω_{k}^{- 1} \sum_{j = 0}^{d - 1} M_{k}^{j} u_{j + k - d} Ω_{k}^{- 1} \\ \times Σ_{k}) + Σ_{k}^{'} Ω_{k}^{- 1} Σ_{k} - T r [P_{k + 1} Q_{w_{k}}] - 2 {\bar{w}}_{k}^{'} Φ_{k + 1} . \end{matrix}

(A5)

Denote

\begin{matrix} Δ_{k} = u_{k} + Ω_{k}^{- 1} M_{k} x_{k} + Ω_{k}^{- 1} \sum_{j = 0}^{d - 1} M_{k}^{j} u_{j + k - d} + Ω_{k}^{- 1} Σ_{k}, \end{matrix}

(A6)

and by virtue of (A6) and (7)-(13), and adding from

k = 0

k = N

on both sides of (A5), then we get

\begin{matrix} V_{0} (x_{0}) - V_{N + 1} (x_{N + 1}) \\ = & \sum_{k = 0}^{N} [x_{k}^{'} Q_{k} x_{k} + u_{k}^{'} R_{k} u_{k} - Δ_{k}^{'} Ω_{k} Δ_{k} + Σ_{k}^{'} Ω_{k}^{- 1} Σ_{k} \\ - 2 {\bar{w}}_{k}^{'} Φ_{k + 1} - T r [P_{k + 1} Q_{w_{k}}]] \end{matrix}

Then the cost function (2) becomes

\begin{matrix} J_{N} & = V_{0} (x_{0}) + \sum_{k = 0}^{N} (Δ_{k}^{'} Ω_{k} Δ_{k} - Σ_{k}^{'} Ω_{k}^{- 1} Σ_{k} + 2 {\bar{w}}_{k}^{'} Φ_{k + 1}) \\ + \sum_{k = 0}^{N} T r [P_{k + 1} Q_{w_{k}}] . \end{matrix}

Ω_{k} > 0

, the unique optimal controller must match the condition

Δ_{k} = 0

. In this case, the cost function (2) will be the minimum, i.e., the optimal controller is

\begin{matrix} u_{k}^{*} = - Ω_{k}^{- 1} M_{k} x_{k} - Ω_{k}^{- 1} \sum_{j = 0}^{d - 1} M_{k}^{j} u_{j + k - d} - Ω_{k}^{- 1} Σ_{k} . \end{matrix}

(A7)

and the optimal cost is as (19).

Above all, the proof of sufficiency is completed.

Appendix C The proof of Lemma 2

By applying standard filtering results in [? ], we can obtain the linear optimal estimator for system (29) as follows.

\begin{matrix} {\hat{x}}_{k + 1 | k} = & E [x_{k + 1} | z_{0}, \dots, z_{k}] = E [x_{k + 1} | {\tilde{z}}_{0}, \dots, {\tilde{z}}_{k}] \\ = & E [x_{k + 1} | {\tilde{z}}_{k}] + E [x_{k + 1} | {\tilde{z}}_{0}, \dots, {\tilde{z}}_{k - 1}] - E [x_{k + 1}] \end{matrix}

In view of the jointly gaussian nature of

x_{k + 1}

and

{\tilde{z}}_{k}

, we know

\begin{matrix} E [x_{k + 1} | {\tilde{z}}_{k}] = E [x_{k + 1}] + c o v (x_{k + 1}, {\tilde{z}}_{k}) {[c o v ({\tilde{z}}_{k}, {\tilde{z}}_{k})]}^{- 1} {\tilde{z}}_{k} . \end{matrix}

(A8)

Using (29) and the orthogonality of

{\hat{x}}_{k | k - 1}

and

x_{k} - {\hat{x}}_{k | k - 1}

, the covariance matrixes yield

\begin{matrix} c o v (x_{k + 1}, {\tilde{z}}_{k}) & = C (k) Σ_{k | k - 1} H^{'} (k), \\ c o v ({\tilde{z}}_{k}, {\tilde{z}}_{k}) & = H (k) Σ_{k | k - 1} H^{'} (k) + Q_{g_{k}} \bar{H} (k) ({\hat{x}}_{k | k - 1} {\hat{x}}_{k | k - 1}^{'} \\ + Σ_{k | k - 1}) {\bar{H}}^{'} (k) + Q_{e_{k}}, \end{matrix}

where

x_{k} - {\hat{x}}_{k | k - 1}

is independent of

e_{k}

with zero mean,and the error covariance matrix

\begin{matrix} Σ_{k + 1 | k} = & C (k) Σ_{k | k - 1} C^{'} (k) - K_{k} (H (k) Σ_{k | k - 1}^{'} C^{'} (k)) + ϕ^{2} \\ \times [\bar{C} (k) (Σ_{k | k - 1} + {\hat{x}}_{k | k + 1} {\hat{x}}_{k | k + 1}^{'}) {\bar{C}}^{'} (k) + {\bar{D}}_{0} (k) \\ \times u_{k} u_{k}^{'} {\bar{D}}_{0}^{'} (k) + {\bar{D}}_{d} (k) u_{k - d} u_{k - d}^{'} {\bar{D}}_{d}^{'} (k)] + Q_{f_{k}} . \end{matrix}

Substituting above equations into (A8), it becomes

\begin{matrix} {\hat{x}}_{k + 1 | k} = & C (k) {\hat{x}}_{k | k - 1} + D_{0} (k) u_{k} + D_{d} (k) u_{k - d} + K_{k} {\tilde{z}}_{k} \end{matrix}

with

\begin{matrix} K_{k} & = C (k) Σ_{k | k - 1} H^{'} (k) (H (k) Σ_{k | k - 1} H^{'} (k) + Q_{g_{k}} \bar{H} (k) \\ \times ({\hat{x}}_{k | k - 1} {\hat{x}}_{k | k - 1}^{'} + Σ_{k | k - 1}) {\bar{H}}^{'} (k) + Q_{e_{k}})^{- 1} . \end{matrix}

The proof of Lemma 2 is completed.

References

H. Kwakernaak, Optimal filtering in linear systems with time delays, IEEE Transactions on Automatic Control, vol. 12, no. 2, pp. 169-173, 1967. [CrossRef]
J. B. Moore, X. Y. Zhou, A. E. B. Lim, Discrete time LQG controls with control dependent noise, Systems and Control Letters, vol. 36, no. 3, pp. 199-206, 1999. [CrossRef]
B. D. O. Anderson, J. B. Moore, Optimal filtering, Courier Corporation, 2012.
W. Sun, K. Wei, Z. Liu, et al, Linear quadratic gaussian control for wireless communication reliability for a mobile monitoring robot in a UHV power substation, IEEE Systems Journal, 2022.
A. Kalbat, Linear quadratic gaussian (lqg) control of wind turbines, 2013 3rd International Conference on Electric Power and Energy Conversion Systems, pp. 1-5, 2013.
X. Yu, X. Liu, X. Wang, et al, Vibration control of improved LQG for wheel drive electric vehicle based on uncertain parameters, Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering, vol. 235, no. 8, pp. 2253-2264, 2021. [CrossRef]
A. Florescu, A. Bratcu, I. Munteanu, et al, LQG optimal control applied to on-board energy management system of all-electric vehicles, IEEE Transactions on Control Systems Technology, vol. 23, no. 4, pp. 1427-1439, 2014. [CrossRef]
M. Basin, J. Rodriguez-Gonzalez, R. Martinez-Zuniga, Optimal control for linear systems with time delay in control input, Journal of The Franklin Institute, vol. 341, no. 3, pp. 267-278, 2004.
E. G. Strangas, G. Clerc and H. Razik,and A. Soualhi, Electrical safety-engineering of renewable energy systems, Wiley Press, 2021.
F. Cacace, F. Conte, A. Germani, Memoryless approach to the LQ and LQG problems with variable input delay, IEEE Transactions on Automatic Control, vol. 61, no. 1, pp. 216-221, 2015. [CrossRef]
N. Matni, J. C. Doyle, Optimal distributed LQG state feedback with varying communication delay, 52nd IEEE Conference on Decision and Control, pp. 5890-5896, 2013.
M. Basin M, J. Rodriguez-Gonzalez, Optimal control for linear systems with multiple time delays in control input, IEEE Transactions on Automatic Control, vol. 51, no. 1, pp. 91-97, 2006.
X. Lu, N. Wang, Q. Qi, et al, Further Study on Networked Control Systems with Unreliable Communication Channels, International Journal of Control Automation and Systems, vol. 19, no. 2, pp. 1-10, 2020.
H. Zhang, G. Duan, L. Xie, Linear quadratic regulation for linear time-varying systems with multiple input delays, Automatica, vol. 42, no. 9, pp. 1465-1476, 2006. [CrossRef]
V. Gupta, B. Hassibi, R. M. Murray, Optimal LQG control across packet-dropping links, Systems and Control Letters, vol. 56, no. 6, pp. 439-446, 2007. [CrossRef]
X. Liang, J. Xu, Control for networked control systems with remote and local controllers over unreliable communication channel, Automatica, vol. 98, pp. 86-94, 2018. [CrossRef]
X. Chen, Y. Zhu, Uncertain random linear quadratic control with multiplicative and additive noises, Asian Journal of Control, vol. 23, no. 6, pp. 2849-2864, 2021. [CrossRef]
H. Zhang, L. Li, J. Xu, et al, Linear quadratic regulation and stabilization of discrete-time systems with delay and multiplicative noise, IEEE Transactions on Automatic Control, vol. 60, no. 10, pp. 2599-2613, 2015.
X. Liang, J. Xu, H. Zhang, Discrete-time LQG control with input delay and multiplicative noise, IEEE Transactions on Aerospace and Electronic Systems, vol. 53, no. 6, pp. 3079-3090, 2017. [CrossRef]
X. Liang, J. Xu, H. Zhang, Optimal control and stabilization for networked control systems with packet dropout and input delay, IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 64, no. 9, pp. 1087-1091, 2016. [CrossRef]

Figure 1. The Optimal Controller

u_{k}

Figure 1. The Optimal Controller

u_{k}

Figure 2. The Non-optimal Controller

u_{k}

Figure 2. The Non-optimal Controller

u_{k}

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

MDPI Initiatives

Important Links

Choose an area of interest and we will send you notifications of new preprints at your preferred frequency.

Disclaimer

Optimal Control of Discrete Time-varying System with Multiple Delays and Multiplicative Noises

Abstract

1. Introduction

2. State Feedback Controller

3. Output Feedback Controller

4. Numerical examples

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A The proof of Lemma 1

Appendix B The proof of Theorem 1

Appendix C The proof of Lemma 2

References

MDPI Initiatives

Important Links

Subscribe