Tensor Conjugate-Gradient Methods With Automatically Determination of Regularization Parameters for Ill-Posed Problems With T-product

Preprint

Article

Tensor Conjugate-Gradient Methods With Automatically Determination of Regularization Parameters for Ill-Posed Problems With T-product

Altmetrics

Downloads

105

Views

Comments

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

26 October 2023

Posted:

26 October 2023

You are already at the latest version

Alerts

Abstract

This paper presents three types of tensor Conjugate-Gradient methods for solving large-scale linear discrete ill-posed problems based on the t-product between third-order tensors. An automatical determination strategy of a suitable regularization parameter is proposed for the tensor conjugate gradient (tCG) method. A truncated version and a preprocessed verion of the tCG method are further presented. The discrepancy principle is employed to determine a suitable regularization parameter. Several numerical examples are given to show the effectiveness of the proposed tCG methods in image and video restoration.

Keywords:

Subject: Computer Science and Mathematics - Computational Mathematics

1. Introduction

Tensors are high-dimensional arrays that have many applications in science and engineering, including in image, video and signal processing, computer vision, and network analysis [11,12,16,17,18,19,20,26]. A new t-product based on third-order tensors proposed by Kilmer et al [1,2]. When using high-dimensional data, t-product shows a greater potential value than matricization, see [2,6,11,12,21,22,24,25,27]. The t-product has been found to have special value in many application fields, including image deblurring problems [1,6,11,12], image and video compression [26], facial recognition problems [2], etc.

In this paper, we consider the solution of large minimization problems of the form

min_{\vec{X} \in R^{m \times 1 \times n}} {∥ A * \vec{X} - \vec{B} ∥}_{F}, A = {[a]}_{i, j, k = 1}^{l, m, n} \in R^{l \times m \times n}, \vec{B} \in R^{l \times 1 \times n} .

(1)

The Frobenius norm of singular tube of

A

rapidly attenuates to zero with the increase of the index number. In particular,

A

has ill-determined tubal rank. Many of its singular tubes are nonvanishing with tiny Frobenius norm of different orders of magnitude. Problems (1) with such a tensor is called the tensor discrete linear ill-posed problems. They arise from the restoration of color image and video, see e.g., [1,11,12]. Throughout this paper, the operation ∗ represents tensor t-product and

{∥ \cdot ∥}_{F}

denotes the tensor Frobenius norm or the spectral matrix norm.

We assume that the observed tensor

\vec{B} \in R^{m \times 1 \times n}

is polluted by an error tensor

\vec{E} \in R^{m \times 1 \times n}

, i.e.,

\vec{B} = {\vec{B}}_{t r u e} + \vec{E},

(2)

where

{\vec{B}}_{t r u e} \in R^{m \times 1 \times n}

is an unknown and unavailable error-free tensor related to

\vec{B}

{\vec{B}}_{t r u e}

is determined by

A * \vec{X} = {\vec{B}}_{t r u e}

, where

{\vec{X}}_{t r u e}

represents the explicit solution of problems (1) that is to be found. We assume that the upper bound of the Frobenius norm of

\vec{E}

is known, i.e,

{∥ \vec{E} ∥}_{F} \leq δ .

(3)

Straightforward solution of (1) is usually meanless to get an approximation of

{\vec{B}}_{t r u e}

because of the illposeness of

A = {[a]}_{i, j, k = 1}^{l, m, n}

and the error

\vec{E}

will be amplified severely. We use Tikhonov regularization to reduce this effect in this paper and replace (1) with penalty least-squares problems

min_{\vec{X} \in R^{m \times 1 \times n}} \{{∥ A * \vec{X} - \vec{B} ∥}_{F}^{2} + μ {∥ \vec{X} ∥}_{F}^{2}\},

(4)

where

μ

is a regularization parameter. We assume that

N (A) \cap N (I) = \vec{O},

(5)

where

N (A)

denotes the null space of

A

I

is the identity tensor and

\vec{O} \in R^{m \times 1 \times n}

is a lateral slice whose elements are all zero. The normal equation of minimization problem (4) is

(A^{T} * A + μ I) * \vec{X} = A^{T} * \vec{B},

(6)

then

{\vec{X}}_{μ} = {(A^{T} * A + μ I)}^{- 1} * A^{T} * \vec{B}

(7)

is the unique solution of the Tikhonov minimization problem (4) under the assumption (5).

There are many techniques to determine the regularization parameter

μ

, such as the L-curve criterion, generalized cross validation (GCV), and the discrepancy principle. We refer to [4,5,8,9,10] for more details. In this paper, the discrepancy principle is extended to tensors based on t-product and is employed to determine a suitable

μ

in (4). The solution

{\vec{X}}_{μ}

of (4) satisfies

{∥ A * {\vec{X}}_{μ} - \vec{B} ∥}_{F} \leq η δ,

(8)

where

η > 1

is usually a user-specified constant and is independent of

δ

in (3). When

{∥ \vec{E} ∥}_{F}

is smaller enough, and

δ

approaches 0, result in

{\vec{X}}_{μ} \to {\vec{X}}_{t r u e}

. For more details on the discrepancy principle, see e.g., [7].

In this paper, we also consider the expansion of minimization problem (1) of the form

min_{X \in R^{m \times p \times n}} \{{∥ A * X - B ∥}_{F}^{2} + μ {∥ X ∥}_{F}^{2}\},

(9)

where

B \in R^{m \times p \times n}

p > 1

There are many methods for solving large-scale discrete linear ill-posed problems (1). Recently, a tensor Golub- Kahan bidiagonalization method [11] and a GMRES method [12] were introduced for solving large-scale linear ill-posed problems (4). The randomized tensor singular value decomposition (rt-SVD) method in [3] was presented for computing super large data sets, and has prospects in image data compression and analysis. Ugwu and Reichel [23] proposed a new random tensor singular value decomposition (R-tSVD), which improves the truncated tensor singular value decomposition (T-tSVD) in [1]. Kilmer et al. [2] presented a tensor Conjugate-Gradient (t-CG) method for tensor linear systems

A * \vec{X} = \vec{B}

corresponding to the least-squares problems. The regularization parameter in the t-CG method is user-specified. In this paper, we further discuss the automatical determinization of suitable regularization parameters of the tCG method by the discrepancy principle. The proposed method is called the tCG method with automatical determination of regularization parameters (auto-tCG). We also present a truncated auto-tCG method (auto-ttCG) to improve the auto-tCG method by reducing the computation. At last, a preprocessed version of the auto-ttCG method is proposed, which is abbreviated as auto-ttpCG.

The rest of this paper is organized as follows. Section 2 introduces some symbols and preliminary knowledge that will be used in the context. Section 3 presents the auto-tCG, auto-ttCG and auto-ttpCG methods for solving the minimization problems (4) and (9). Section 4 gives several examples on image and video restoration and Section 5 draws some conclusions.

2. Preliminaries

This section gives some notations and definitions, and briefly summarizes some results that will be used later. For a third-order tensor

A \in R^{l \times m \times n}

, Figure 1 shows the frontal slices

A_{(:, :, k)}

, lateral slices

A_{(:, j, :)}

and tube fibers

A_{(i, j, :)}

. We abbreviate

A_{k} = A_{(:, :, k)}

for simplication. An

l n \times m

matrix is obtained by the operator

unfold (A)

, whereas the operator

fold

folds this matrix back to the tensor

A

, i.e.,

unfold (A) = [\begin{matrix} A_{1} \\ A_{2} \\ ⋮ \\ A_{n} \end{matrix}], fold (unfold (A)) = A .

Definition 1.

Let

A \in R^{l \times m \times n}

, then a block-circulant matrix of

A

is denoted by

bcirc (A)

, i.e.,

bcirc (A) = [\begin{matrix} \begin{matrix} A_{1} \\ A_{2} \\ ⋮ \\ A_{n} \end{matrix} & \begin{matrix} A_{n} \\ A_{1} \\ ⋮ \\ A_{n - 1} \end{matrix} & \begin{matrix} \dots \\ \dots \\ ⋱ \\ \dots \end{matrix} & \begin{matrix} A_{2} \\ A_{3} \\ ⋮ \\ A_{1} \end{matrix} \end{matrix}] .

Definition 2. ([1]) Given two tensors

A \in R^{l \times m \times n}

and

B \in R^{m \times p \times n}

, the t-product

A * B

is defined as

A * B = fold (bcirc (A) unfold (B)) = C,

(10)

where

C \in R^{l \times p \times n}

The following remarks will be used in Section 3.

Remark 1. ([14]) For suitable tensors

A

and

B

, it holds that

(1).

bcirc (A * B) = bcirc (A) * bcirc (B)

(2).

bcirc (A^{T}) = bcirc {(A)}^{T}

(3).

bcirc (A + B) = bcirc (A) + bcirc (B)

Let

F_{n}

be an n-by-n unitary discrete Fourier transform matrix, i.e,

F_{n} = \frac{1}{\sqrt{n}} [\begin{matrix} 1 & 1 & 1 & \dots & 1 \\ 1 & ω & ω^{2} & \dots & ω^{n - 1} \\ 1 & ω^{2} & ω^{4} & \dots & ω^{2 (n - 1)} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & ω^{n - 1} & ω^{2 (n - 1)} & \dots & ω^{(n - 1) (n - 1)} \end{matrix}],

where

ω = e^{\frac{- 2 π i}{n}}

, then we get the tensor

\hat{A}

generated by using FFT along each tube of

A

, i.e,

bdiag (\hat{A}) = [\begin{matrix} {\hat{A}}_{1} \\ {\hat{A}}_{2} \\ ⋱ \\ {\hat{A}}_{n} \end{matrix}] = (F_{n} \otimes I_{l}) bcirc (A) (F_{n}^{*} \otimes I_{m}),

(11)

where ⊗ is the Kronecker product,

F_{n}^{*}

is the conjugate transposition of

F_{n}

and

{\hat{A}}_{i}

denetes the frontal slices of

\hat{A}

. Thus the t-product of

A

and

B

in (10) can be expressed by

A * B = fold ((F_{n}^{*} \otimes I_{l}) ((F_{n} \otimes I_{l}) bcirc (A) (F_{n}^{*} \otimes I_{m})) (F_{n} \otimes I_{m}) unfold (B)),

(12)

and (10) is reformulated as

(\begin{matrix} {\hat{A}}_{1} \\ {\hat{A}}_{2} \\ ⋱ \\ {\hat{A}}_{n} \end{matrix}) (\begin{matrix} {\hat{B}}_{1} \\ {\hat{B}}_{2} \\ ⋮ \\ {\hat{B}}_{n} \end{matrix}) = (\begin{matrix} {\hat{C}}_{1} \\ {\hat{C}}_{2} \\ ⋮ \\ {\hat{C}}_{n} \end{matrix}) .

(13)

It is easy to calculate (12) in MATLAB.

For a non-zero tensor

\vec{X} \in R^{m \times 1 \times n}

, we can decompose it in the form

\vec{X} = \vec{D} * d,

(14)

where

\vec{D} \in R^{m \times 1 \times n}

is a normalized tensor; see, e.g., [6] and

d \in R^{1 \times 1 \times n}

is a tube scalar. Algorithm 1 summarizes the decomposition in (14).

Algorithm 1 Normalization

Input:

\vec{X} \in R^{m \times 1 \times n}

is a nonzero tensor

Output:

\vec{D}

d

with

\vec{X} = \vec{D} * d

∥ \vec{D} ∥ = 1

\vec{D} \leftarrow

fft(

\vec{X}

,[ ],3)

for

j = 1, 2, \dots, n

d_{j} \leftarrow {∥ \vec{D_{j}} ∥}_{2}

(

{\vec{D}}_{j}

is a vector)

d_{j} > t o l

then

\vec{D_{j}} \leftarrow \frac{1}{d_{j}} {\vec{D}}_{j}

else

{\vec{D}}_{j} \leftarrow randn (m, 1)

;

d_{j} \leftarrow {∥ \vec{D_{j}} ∥}_{2}

;

\vec{D_{j}} \leftarrow \frac{1}{d_{j}} {\vec{D}}_{j}

;

d_{j} \leftarrow 0

end if

end for

\vec{D} \leftarrow

ifft(

\vec{D}

,[ ],3);

d \leftarrow

ifft(

d

,[ ],3)

Given a tensor

A \in R^{l \times m \times n}

, the singular value decomposition (tSVD) of

A

is expressed as

A = U * S * V^{T},

where

U \in R^{l \times l \times n}

and

V \in R^{m \times m \times n}

are orthogonal under t-product,

S = diag [s_{1}, s_{2}, . . ., s_{min {l, m}}] \in R^{m \times l \times n}

is an upper triangular tensor with the singular tubes

s_{j}

satisfying

{∥ s_{1} ∥}_{F} \geq {∥ s_{2} ∥}_{F} \geq \dots \geq {∥ s_{min \{l, m\}} ∥}_{F} .

The operators

squeeze

and

twist

[13] are expressed by

X = squeeze ({\vec{X}}_{j}) ⟹ X (i, j) = {\vec{X}}_{(i, 1, j)}, twist (squeeze (\vec{X})) = \vec{X} .

Figure 2 illustrates the transformation between a matrix and a tensor column by using

squeeze

and

twist

. Generally, the operators

{multi}_{-} squeeze

and

{multi}_{-} twist

are defined for a third-order tensor to make it squeezed or twisted. For a tensor

D \in R^{m \times p \times n}

with

p > 1

C = {multi}_{-} squeeze (D)

means that all side slices of

D

are squeezed and stacked as front slices of

C

, the operator

{multi}_{-} twist

is the reverse operation of

{multi}_{-} squeeze

. Thus

{multi}_{-} twist ({multi}_{-} squeeze (D)) = D

. We refer to Table 1 for more notations and definitions.

3. Tensor Conjugate-Gradient methods

This section first discusses the automatical determination of a suitable regularization parameter for the tensor conjugate gradient (tCG) method presented by Kilmer et al. in [13]. We abbreviate the improved method as auto-tCG. A truncated auto-tCG method is developed to improve the auto-tCG method and is abbreviated as auto-ttCG. A preprocessed version of the auto-ttCG method is presented, which is abbreviated as auto-ttpCG.

3.1. The auto-tCG Method

The tensor Conjugate-Gradient (t-CG) method is presented in [2] for the least-squares solution of the tensor linear systems

A * \vec{X} = \vec{B}

. The regularization parameter in the t-CG method was not discussed and was user-specified. This subsection improves the t-CG method by employing the discrepancy principle to determine a suitable regularization parameter under the assumption (3) and uses it to solve the normal equation (6). We consider the polynomial function

μ_{k} = μ_{0} q^{k}, k = 0, 1, \dots,

(15)

where

q \in (0, 1)

. We set

μ_{0} = {∥ A ∥}_{F}

, and obtain an optimal regularization parameter by continuously reducing the parameter. An effective method to deal with the general problems (9) is to regard it as p independent subproblems (4), i.e.,

min_{{\vec{X}}_{j} \in R^{m \times 1 \times n}} \{{∥ A * {\vec{X}}_{j} - {\vec{B}}_{j} ∥}_{F}^{2} + μ {∥ {\vec{X}}_{j} ∥}_{F}^{2}\}, j = 1, \dots, p,

(16)

where

{\vec{B}}_{j}

is the tensor column of the tensor

B

and is polluted by the noise

\vec{E_{j}}

{\vec{B}}_{j, t r u e}

represents unknown error-free tensor. Assume the noise tensor

\vec{E_{j}} = \vec{B_{j}} - {\vec{B}}_{j, t r u e}

can be used or the norm of

{\vec{E}}_{j}

can be estimated, i.e.,

{∥ {\vec{E}}_{j} ∥}_{F} \leq δ_{j}, j = 1, \dots, p .

Algorithm 2 summarizes the auto-tCG method for solving (9). The initial tensor of Algorithm 2 is set as zero tensor. The iteration is stopped when the Frobenius norm of the residual tensor

{\vec{R}}_{j, μ_{k}}^{i} = A^{T} * {\vec{B}}_{j} - (A^{T} * A + μ_{k} I) * {\vec{X}}_{j, μ_{k}}^{i}

is small enough, where

{\vec{R}}_{j, μ_{k}}^{i}

denotes the residual generated by the i-th iterative solution

{\vec{X}}_{j, μ_{k}}^{i}

of the normal equation with

μ_{k}

of the j-th independent subproblem. Let

{\vec{X}}_{i n t} = {\vec{X}}_{μ_{k}}^{*}

be the initial tensor of the normal equation of

μ_{k + 1}

. When

μ = μ_{k}

with m iterations for the CG-process, the affine space is

{\vec{X}}_{μ_{k}}^{0} + K_{m} (A^{T} * A + μ_{k} I, r_{μ_{k}}^{0})

, where

r_{μ_{k}}^{0} = A^{T} * \vec{B} - (A^{T} * A + μ_{k} I) * {\vec{X}}_{μ_{k}}^{0}

Algorithm 2 The auto-tCG method for sloving (9).

Input:

A \in R^{m \times m \times n}, \vec{B_{j}} \in R^{m \times 1 \times n}

δ_{j}

j = 1, . . ., p

μ_{0}

η > 1

Output: Approximate solution

X^{*}

of problem (9).

for

j = 1, 2, . . . p

{\vec{X}}_{i n t} = 0, k = 0

while

do {∥ A * {\vec{X}}_{j, μ_{k}}^{*} - {\vec{B}}_{j} ∥}_{F}^{2} > η^{2} δ_{j}^{2}

k = k + 1

(A^{T} * A + μ_{k} I) * {\vec{X}}_{j} = A^{T} * {\vec{B}}_{j}

, e.g.,

μ_{k} = μ_{0} q^{k}

[{\vec{R}}_{0}, a] \leftarrow

Normalize

(A^{T} * {\vec{B}}_{j} - (A^{T} * A + μ_{k} I) * {\vec{X}}_{i n t})

;

{\vec{P}}_{0} \leftarrow {\vec{R}}_{0}

i = 0

σ > t o l

while

σ > t o l

i = i + 1

c = {({\vec{P}}_{i - 1}^{T} * (A^{T} * A + μ_{k} I) * {\vec{P}}_{i - 1})}^{- 1} * ({\vec{R}}_{i - 1}^{T} * {\vec{R}}_{i - 1})

{\vec{X}}_{i} = {\vec{X}}_{i - 1} + {\vec{P}}_{i - 1} * c

{\vec{R}}_{i} = {\vec{R}}_{i - 1} - (A^{T} * A + μ_{k} I) * ({\vec{P}}_{i + 1} * c) .

σ = | {∥ {\vec{R}}_{i} ∥}_{F} - {∥ {\vec{R}}_{i - 1} ∥}_{F} | .

d = {({\vec{R}}_{i - 1}^{T} * {\vec{R}}_{i - 1})}^{- 1} * ({\vec{R}}_{i}^{T} * {\vec{R}}_{i})

{\vec{P}}_{i} = {\vec{R}}_{i} + {\vec{P}}_{i - 1} * d

end while

{\vec{X}}_{j, μ_{k}}^{*} = {\vec{X}}_{i} * a

(

{\vec{X}}_{j, μ_{k}}^{*}

is the solution of the normal equation about

μ_{k}

of the j-th independent subproblem (4)).

{\vec{X}}_{i n t} = {\vec{X}}_{j, μ_{k}}^{*} .

end while

X_{(:, j, :)}^{*} = {\vec{X}}_{j, μ_{k}}^{*}

end for

3.2. The truncated tensor Conjugate-Gradient method

Frommer and Maass in [15] proposed a good condition that can roughly judge some inappropriate value of

μ

. We introduce this condition to improve Algorithm 2 by excluding some unsuitable value of

μ

, and present a truncated tensor conjugate-gradient method for solving (9). We first give the following results.

Theorem 1.

Given

A \in R^{l \times m \times n}

, define a t-linear operator T:

R^{m \times 1 \times n} \to R^{l \times 1 \times n}

, i.e.,

T (\vec{X}) = A * \vec{X}

with

\vec{X} \in R^{m \times 1 \times n}

. Let

{\vec{X}}_{μ}^{*}

be the exact solution of the normal equations

(A^{T} * A + μ I) * \vec{X} = A^{T} * \vec{B},

then for an arbitrary

X \in R^{m \times 1 \times n}

, we have

{∥ A * {\vec{X}}_{μ}^{*} - \vec{B} ∥}_{F}^{2} \geq {∥ A * \vec{X} - \vec{B} ∥}_{F}^{2} - \frac{1}{4 μ} {∥ A^{T} * \vec{B} - (A^{T} * A + μ I) * \vec{X} ∥}_{F}^{2} .

Proof.

For an arbitrary

\vec{X} \in R^{m \times 1 \times n}

, set

\vec{Z} = {\vec{X}}_{μ}^{*} - \vec{X}

. Let the singular value decomposition of

A

A = U * S * V^{T}

, then

A * \vec{Z} = U * S * V^{T} * \vec{Z} .

Suppose

V^{T} * \vec{Z} = \vec{D} \in R^{m \times 1 \times n}

, then

{∥ A * \vec{Z} ∥}_{F}^{2} = {∥ U * S * V^{T} * \vec{Z} ∥}_{F}^{2} = {∥ S * \vec{D} ∥}_{F}^{2} = {∥ bcirc (S) unfold (\vec{D}) ∥}_{2}^{2} .

(17)

Thus

\begin{matrix} {∥ (A^{T} * A + μ I) * \vec{Z} ∥}_{F}^{2} \\ = & {∥ V * (S^{T} * S + μ I) * V^{T} * \vec{Z} ∥}_{F}^{2} = {∥ V * (S^{T} * S + μ I) * \vec{D} ∥}_{F}^{2} \\ = & {∥ (S^{T} * S + μ I) * \vec{D} ∥}_{F}^{2} = {∥ (bcirc (S^{T} * S) + μ bcirc (I)) unfold (\vec{D}) ∥}_{2}^{2} \\ = & {∥ (bcirc {(S)}^{T} bcirc (S) + μ bcirc (I)) unfold (\vec{D}) ∥}_{2}^{2} . \end{matrix}

(18)

Denote

bcirc (S) = S \in R^{n l \times n m}

bcirc (I) = I \in R^{n m \times n m}

and

unfold (\vec{D}) = d \in R^{n m \times 1}

, then

{∥ A * \vec{Z} ∥}_{F}^{2} = {∥ S d ∥}_{2}^{2}

and

{∥ (A^{T} * A + μ I) * \vec{Z} ∥}_{F}^{2} = {∥ (S^{T} S + μ I) d ∥}_{2}^{2}

. Thus we transform the tensor norm into the equivalent matrix norm. Let the singular value decomposition of

S

S = U Σ V^{T}

, where

Σ = diag (σ_{1}, σ_{2}, . . ., σ_{r}), r \leq min \{n l, n m\}

U = [u_{1}, u_{2}, . . ., u_{r}]

and

V = [v_{1}, v_{2}, . . ., v_{r}]

are orthogonal matrices with orthogonal columns

u_{k} \in R^{n l \times 1}

and

v_{k} \in R^{n m \times 1}

, respectively. Thus we have

S d = \sum_{σ_{k} > 0} σ_{k} 〈d, v_{k}〉 u_{k} .

Using the equation

s^{2} = {(s + μ s^{- 1})}^{- 2} {(s^{2} + μ)}^{2}

with the estimate

\frac{1}{s + μ s^{- 1}} \leq \frac{1}{2 \sqrt{μ}}, (s, μ > 0),

we have

\begin{matrix} {∥ Sd ∥}_{2}^{2} & = \sum_{σ_{k} > 0} {σ_{k}^{2} |〈d, v_{k}〉|}^{2} = \sum_{σ_{k} > 0} {(σ_{k} + μ σ_{k}^{- 1})}^{- 2} {(σ_{k}^{2} + μ)}^{2} {|〈d, v_{k}〉|}^{2} \\ \leq \frac{1}{4 μ} \sum_{σ_{k} > 0} {(σ_{k}^{2} + μ)}^{2} {|〈d, v_{k}〉|}^{2} . \end{matrix}

(19)

Note that

{∥ (S^{T} S + μ I) d ∥}_{2}^{2} = \sum_{σ_{k} > 0} {{(σ_{k}^{2} + μ)}^{2} |〈d, v_{k}〉|}^{2},

(20)

It results from (19) and (20) that

{∥ Sd ∥}_{2}^{2} \leq \frac{1}{4 μ} {∥ (S^{T} S + μ I) d ∥}_{2}^{2} .

Note that

{∥ A * \vec{Z} ∥}_{F}^{2} = {∥ S d ∥}_{2}^{2}

and

{∥ (A^{T} * A + μ I) * \vec{Z} ∥}_{F}^{2} = {∥ (S^{T} S + μ I) d ∥}_{2}^{2}

, we have

{∥ A * \vec{Z} ∥}_{F}^{2} \leq \frac{1}{4 μ} {∥ (A^{T} * A + μ I) * \vec{Z} ∥}_{F}^{2} .

(21)

Thus

\begin{matrix} {∥ A * {\vec{X}}_{μ}^{*} - \vec{B} ∥}_{F}^{2} & = {∥ A * \vec{X} - \vec{B} + A * ({\vec{X}}_{μ}^{*} - \vec{X}) ∥}_{F}^{2} \\ \geq {∥ A * \vec{X} - \vec{B} ∥}_{F}^{2} - {∥ A * \vec{Z} ∥}_{F}^{2} \\ \geq {∥ A * \vec{X} - \vec{B} ∥}_{F}^{2} - \frac{1}{4 μ} {∥ (A^{T} * A + μ \vec{I}) \vec{Z} ∥}_{F}^{2} \end{matrix}

(22)

Note that

\begin{matrix} (A^{T} * A + μ I) * \vec{Z} & = (A^{T} * A + μ I) * ({\vec{X}}_{μ}^{*} - \vec{X}) = A^{T} * \vec{B} - (A^{T} * A + μ I) * \vec{X}, \end{matrix}

(23)

then (23) and (22) result in

{∥ A * {\vec{X}}_{μ}^{*} - \vec{B} ∥}_{F}^{2} \geq {∥ A * \vec{X} - \vec{B} ∥}_{F}^{2} - \frac{1}{4 μ} {∥ A^{T} * \vec{B} - (A^{T} * A + μ I) * \vec{X} ∥}_{F}^{2} .

□

We will apply Theorem 1 to predict in advance whether the exact solution

{\vec{X}}_{μ_{k}}^{*}

satisfies the discrepancy principle in Algorithm 2. We add the condition

{∥ A * {\vec{X}}_{μ_{k}}^{i} - \vec{B} ∥}_{F}^{2} - \frac{1}{4 μ_{k}} {∥ {\vec{R}}_{μ_{k}}^{i} ∥}_{F}^{2} > η^{2} δ^{2}

(24)

in steps 9-16 of Algorithm 2. If the i-th iteration solution of the normal equation with

μ_{k}

{\vec{X}}_{μ_{k}}^{i}

and its residual

{\vec{R}}_{μ_{k}}^{i}

satisfies (24), then

{∥ A * {\vec{X}}_{μ_{k}}^{*} - \vec{B} ∥}_{F}^{2} > η^{2} δ^{2}

. This indicates that the exact solution of the normal equation with

μ_{k}

does not satisfy the discrepancy principle, so continue to solve next normal equation with

μ_{k + 1}

. Therefore, we obtain a truncated tensor conjugate-gradient method of automatical determination of a suitable regularization parameter, which is abbreviated as auto-ttCG. Algorithm 3 summarizes the auto-ttCG method.

Algorithm 3 The auto-ttCG method for sloving (9)

Input:

A \in R^{m \times m \times n}, \vec{B_{j}} \in R^{m \times 1 \times n}

δ_{j}

j = 1, . . ., p

μ_{0}

η > 1

, tol.

Output: Approximate solution

X^{*}

of problem (9).

for

j = 1, 2, . . . p

{\vec{X}}_{i n t} = 0, k = 0

while

{∥ A * {\vec{X}}_{j, μ_{k}}^{i} - {\vec{B}}_{j} ∥}_{F}^{2} > η^{2} δ_{j}^{2}

k = k + 1

(A^{T} * A + μ_{k} I) * {\vec{X}}_{j} = A^{T} * {\vec{B}}_{j}

, e.g.

μ_{k} = μ_{0} q^{k}

[{\vec{R}}_{0}, a] \leftarrow

Normalize

(A^{T} * {\vec{B}}_{j} - (A^{T} * A + μ_{k} I) * {\vec{X}}_{i n t})

;

{\vec{P}}_{0} \leftarrow {\vec{R}}_{0}

i = 0

σ

=10tol,

{\vec{X}}_{j, μ_{k}}^{0} = {\vec{X}}_{i n t}

while

σ > t o l

and

{∥ A * {\vec{X}}_{j, μ_{k}}^{i} - \vec{B} ∥}_{F}^{2} - \frac{1}{4 μ_{k}} {∥ {\vec{R}}_{i} * a ∥}_{F}^{2} < η^{2} δ^{2}

i = i + 1

c = {({\vec{P}}_{i - 1}^{T} * (A^{T} * A + μ_{k} I) * {\vec{P}}_{i - 1})}^{- 1} * ({\vec{R}}_{i - 1}^{T} * {\vec{R}}_{i - 1})

{\vec{X}}_{i} = {\vec{X}}_{i - 1} + {\vec{P}}_{i - 1} * c

{\vec{X}}_{j, μ_{k}}^{i} = {\vec{X}}_{i} * a

{\vec{R}}_{i} = {\vec{R}}_{i - 1} - (A^{T} * A + μ_{k} I) * ({\vec{P}}_{i + 1} * c)

σ = | {∥ {\vec{R}}_{i} ∥}_{F} - {∥ {\vec{R}}_{i - 1} ∥}_{F} |

d = {({\vec{R}}_{i - 1}^{T} * {\vec{R}}_{i - 1})}^{- 1} * ({\vec{R}}_{i}^{T} * {\vec{R}}_{i})

{\vec{P}}_{i} = {\vec{R}}_{i} + {\vec{P}}_{i - 1} * d

end while

{\vec{X}}_{i n t} = {\vec{X}}_{μ_{k}}^{i}

end while

X_{(:, j, :)}^{*} = {\vec{X}}_{j, μ_{k}}^{i}

end for

3.3. A preconditioned truncated tensor Conjugate-Gradient method

In this section, we consider the acceleration of Algorithm 3 by preconditioning. When the tensor

M

is symmetric positive definite under the t-product structure, we can get its tensor approximate Cholesky decomposition (tChol) by Algorithm 4.

Algorithm 4 Tensor Cholesky decomposition (tChol)

1:: Input: $M \in R^{m \times m \times n} \neq O$
2:: Output: $H \in R^{m \times m \times n}$ and $M = H * H^{T}$ .
3:: $\hat{M} \leftarrow$ fft( $M$ ,[ ],3)
4:: for $j = 1, 2, . . ., n$ do
5:: $H \leftarrow c h o l ({\hat{M}}_{(:, :, j)})$ , H is the lower triangular matrix, which is obtained by approximate Cholesky decomposition.
6:: ${\hat{H}}_{(:, :, j)} \leftarrow H$ .
7:: end for
8:: $H \leftarrow$ ifft( $\hat{H}$ ,[ ],3).

In Algorithm 3, the coefficient tensor

A^{T} * A + μ_{k} I

of the

k -

th normal equation

(A^{T} * A + μ_{k} I) * \vec{X} = A^{T} * \vec{B}

(25)

is symmetric and positive definite. We set

M = A^{T} * A + μ_{k} I

and apply Algorithm 4 to obtain the decomposition of

M = H * H^{T}

, where each frontal slice of

H

is a fully sparse lower triangular matrix. After the normal equation (25) is preconditioned by

M

, we solve the preconditioned normal equations

\tilde{A} * \tilde{\vec{X}} = \tilde{\vec{B}},

(26)

instead of equations (25) in Algorithm 3, where

\tilde{A} = H^{- 1} * (A^{T} * A + μ_{k} I) * H^{- T}

\tilde{\vec{X}} = H^{T} * \vec{X}

\tilde{\vec{B}} = H^{- 1} * A^{T} * \vec{B}

Applying Algorithm 3 to solve (26) instead of (25). Let

{\vec{X}}_{i}

and

{\tilde{\vec{X}}}_{i}

denote the solution of (25) and (26), respectively. Then we have

\begin{matrix} {\tilde{\vec{R}}}_{i} & = \tilde{\vec{B}} - \tilde{A} * {\tilde{\vec{X}}}_{i} \\ = H^{- 1} * A^{T} * \vec{B} - (H^{- 1} * (A^{T} * A + μ_{k} I) * H^{- T}) * H^{T} * {\vec{X}}_{i} \end{matrix}

(27)

\begin{matrix} = H^{- 1} * (A^{T} * \vec{B} - (A^{T} * A + μ_{k} I) * {\vec{X}}_{i}) \end{matrix}

(28)

\begin{matrix} = H^{- 1} * {\vec{R}}_{i}, \end{matrix}

(29)

Let

{\vec{W}}_{i} = H^{- 1} * {\vec{R}}_{i}

{\tilde{\vec{P}}}_{i - 1} = H^{T} * {\vec{P}}_{i - 1}

, then we have

\begin{matrix} \tilde{d} & = {({\tilde{\vec{R}}}_{i - 1}^{T} * {\tilde{\vec{R}}}_{i - 1})}^{- 1} * ({\tilde{\vec{R}}}_{i}^{T} * {\tilde{\vec{R}}}_{i}) \\ = {({(H^{- 1} * {\vec{R}}_{i - 1})}^{T} * H^{- 1} * {\vec{R}}_{i - 1})}^{- 1} * ({(H^{- 1} * {\vec{R}}_{i})}^{T} * H^{- 1} * {\vec{R}}_{i}) \end{matrix}

(30)

\begin{matrix} = {({\vec{W}}_{i - 1}^{T} * {\vec{W}}_{i - 1})}^{- 1} * ({\vec{W}}_{i}^{T} * {\vec{W}}_{i}), \end{matrix}

(31)

and

\begin{matrix} \tilde{c} & = {({\tilde{\vec{P}}}_{i - 1}^{T} * \tilde{A} * {\tilde{\vec{P}}}_{i - 1})}^{- 1} * ({\tilde{\vec{R}}}_{i - 1}^{T} * {\tilde{\vec{R}}}_{i - 1}) \\ = {({(H^{T} * {\vec{P}}_{i - 1})}^{T} * H^{- 1} * (A^{T} * A + μ_{k} I) * H^{- T} * (H^{T} * {\vec{P}}_{i - 1}))}^{- 1} * ({(H^{- 1} * {\vec{R}}_{i - 1})}^{T} * H^{- 1} * {\vec{R}}_{i - 1}) \\ = {({(H^{T} * {\vec{P}}_{i - 1})}^{T} * H^{- 1} * (A^{T} * A + μ_{k} I) * {\vec{P}}_{i - 1})}^{- 1} * {\vec{W}}_{i - 1}^{T} * {\vec{W}}_{i - 1} \\ = {({\vec{P}}_{i - 1}^{T} * (A^{T} * A + μ_{k} I) * {\vec{P}}_{i - 1})}^{- 1} * {\vec{W}}_{i - 1}^{T} * {\vec{W}}_{i - 1} . \end{matrix}

(32)

In addition, we have the iteration

\begin{matrix} {\tilde{\vec{X}}}_{i} & = {\tilde{\vec{X}}}_{i - 1} + {\tilde{\vec{P}}}_{i - 1} * \tilde{c} \\ H^{T} * {\vec{X}}_{i} & = H^{T} * {\vec{X}}_{i - 1} + H^{T} * {\vec{P}}_{i - 1} * \tilde{c} \\ {\vec{X}}_{i} & = {\vec{X}}_{i - 1} + {\vec{P}}_{i - 1} * \tilde{c}, \end{matrix}

(33)

and

\begin{matrix} {\tilde{\vec{R}}}_{i} & = {\tilde{\vec{R}}}_{i - 1} - \tilde{A} * {\tilde{\vec{P}}}_{i + 1} * \tilde{c} \\ H^{- 1} * {\vec{R}}_{i} & = H^{- 1} * {\vec{R}}_{i - 1} - H^{- 1} * (A^{T} * A + μ_{k} I) * H^{- T} * H^{T} * {\vec{P}}_{i + 1} * \tilde{c} \\ {\vec{R}}_{i} & = {\vec{R}}_{i - 1} - (A^{T} * A + μ_{k} I) * {\vec{P}}_{i + 1} * \tilde{c}, \end{matrix}

(34)

together with

\begin{matrix} {\tilde{\vec{P}}}_{i} & = {\tilde{\vec{R}}}_{i} + {\tilde{\vec{P}}}_{i - 1} * \tilde{d} \\ H^{T} * {\vec{P}}_{i} & = H^{- 1} * {\vec{R}}_{i} + H^{T} * {\vec{P}}_{i - 1} * \tilde{d} \\ {\vec{P}}_{i} & = H^{- T} * H^{- 1} * {\vec{R}}_{i} + {\vec{P}}_{i - 1} * \tilde{d} = H^{- T} * {\vec{W}}_{i} + {\vec{P}}_{i - 1} * \tilde{d} . \end{matrix}

(35)

Taking the preprocessing procedure (27)-(35) into Algorithm 3, we obtain the improved auto-ttCG method, which is called the truncated tensor preconditioned conjugate-gradient method of automatical determination of a suitable regularization parameter, and is abbreviated as auto-ttpCG. Algorithm 5 summarizes the auto-ttpCG method. Numerical experiments in Section show Algorighm 5 converges faster than Algorithm 3.

Algorithm 5 The auto-ttpCG method for sloving (9)

Input:

A \in R^{m \times m \times n}, \vec{B_{j}} \in R^{m \times 1 \times n}

δ_{j}

j = 1, . . ., p

μ_{0}

η > 1

, tol.

Output: Approximate solution

X^{*}

of problem (9).

for

j = 1, 2, . . . p

{\vec{X}}_{i n t} = 0, k = 0

while

{∥ A * {\vec{X}}_{j, μ_{k}}^{i} - {\vec{B}}_{j} ∥}_{F}^{2} > η^{2} δ_{j}^{2}

k = k + 1

μ_{k} = μ_{0} q^{k}

H = t C h o l (A^{T} * A + μ_{k} I

[{\vec{R}}_{0}, a] \leftarrow

Normalize

(A^{T} * {\vec{B}}_{j} - (A^{T} * A + μ_{k} I) * {\vec{X}}_{i n t})

{\vec{W}}_{0} = H^{- 1} * {\vec{R}}_{0}

{\vec{P}}_{0} = H^{- T} * {\vec{W}}_{0}

i = 0

σ

=10tol,

{\vec{X}}_{j, μ_{k}}^{0} = {\vec{X}}_{i n t}

while

σ > t o l

and

{∥ A * {\vec{X}}_{j, μ_{k}}^{i} - {\vec{B}}_{j} ∥}_{F}^{2} - \frac{1}{4 μ_{k}} {∥ {\vec{R}}_{i} * a ∥}_{F}^{2} < η^{2} δ^{2}

i = i + 1

\tilde{c} = {({\vec{P}}_{i - 1}^{T} * (A^{T} * A + μ_{k} I) * {\vec{P}}_{i - 1})}^{- 1} * {\vec{W}}_{i - 1}^{T} * {\vec{W}}_{i - 1}

{\vec{X}}_{i} = {\vec{X}}_{i - 1} + {\vec{P}}_{i - 1} * \tilde{c}

{\vec{X}}_{j, μ_{k}}^{i} = {\vec{X}}_{i} * a

{\vec{R}}_{i} = {\vec{R}}_{i - 1} - (A^{T} * A + μ_{k} I) * {\vec{P}}_{i + 1} * \tilde{c}

{\vec{W}}_{i} = H^{- 1} * {\vec{R}}_{i}

σ = | {∥ {\vec{R}}_{i} ∥}_{F} - {∥ {\vec{R}}_{i - 1} ∥}_{F} |

\tilde{d} = {({\vec{W}}_{i - 1}^{T} * {\vec{W}}_{i - 1})}^{- 1} * ({\vec{W}}_{i}^{T} * {\vec{W}}_{i})

{\vec{P}}_{i} = H^{- T} * {\vec{W}}_{i} + {\vec{P}}_{i - 1} * \tilde{d}

end while

{\vec{X}}_{i n t} = {\vec{X}}_{μ_{k}}^{i}

end while

X_{(:, j, :)}^{*} = {\vec{X}}_{j, μ_{k}}^{i}

end for

4. Numerical Examples

This section presents three examples to show the application of Algorithms 2, 3 and 5 on the restoration of image and video. All calculations are performed in MATLAB R2018a on computers with intel core i7 and 16GB ram.

Suppose

X_{k}

is the k-th approximate solution to the minimization problem (9). The quality of the approximate solution

X_{k}

is defined by the relative error

{Err}_{k} = \frac{{∥ X_{k} - X_{t r u e} ∥}_{F}}{{∥ X_{t r u e} ∥}_{F}},

and the signal-to-noise ratio (SNR)

SNR (X_{k}) = 10 {log}_{10} \frac{{∥ X_{t r u e} - E (X_{t r u e}) ∥}_{F}^{2}}{{∥ X_{k} - X_{t r u e} ∥}_{F}^{2}},

where

X_{t r u e}

denotes the uncontaminated data tensor and

E (X_{t r u e})

is the average gray-level of

X_{t r u e}

. The observed data

B

in (9) is contaminated by a "noise" tensor

E

, i.e.,

B = B_{t r u e} + E

E

is determined as follows. Let

{\vec{E}}_{j}

be the

j -

th transverse slice of

E

, whose entries are scaled and normally distributed with a mean of zero, i.e.,

{\vec{E}}_{j} = ν \frac{{\vec{E}}_{r, j}}{{∥ {\vec{E}}_{r, j} ∥}_{F}} {∥ {\vec{B}}_{t r u e, j} ∥}_{F}, j = 1, . . ., p,

(36)

where the data of

{\vec{E}}_{r, j}

is generated according to N(0, 1).

Example 4.1

(Gray image)

This example considers the restoration of the blurred and noised cameraman image with the size of

256 \times 1 \times 256

. For the operator

A

, its front slices

A_{(:, :, i)}, i = 1, . . ., 256,

are generated by using the MATLAB function blur, i.e.,

\begin{matrix} z = [e x p (- ([0 : b a n d - 1] .^{2}) / (2 σ^{2})), z e r o s (1, N - b a n d)], \\ A = \frac{1}{σ \sqrt{2 π}} t o e p l i t z ([z (1) f l i p l r (z (2 : e n d))], z), A_{(:, :, i)} = A (i, 1) A \end{matrix}

(37)

with

N = 256

σ = 4

and

b a n d = 12

. The condition numbers of

A_{(i)}

are

c o n d (A_{(:, :, 1)}) = c o n d (A_{(:, :, 246)}) = . . . = c o n d (A_{(:, :, 256)}) = 11.1559

, while he condition numbers of the remaining slices are infinite. Let

X_{t r u e}

denote the original undaminated cameraman image. The operator

twist

converts

X_{t r u e}

into tensor column

{\vec{X}}_{t r u e} \in R^{256 \times 1 \times 256}

for storage. The noised tensor

\vec{E}

is generated by (36) with different noise level

ν = 10^{- i}, i = 2, 3

. The blurred and noisy images are generated by

\vec{B} = A * {\vec{X}}_{t r u e} + \vec{E}

The auto-tCG, auto-ttCG and auto-ttpCG methods are used to solve the tensor discrete linear ill-posed problems (1). The discrepancy principle is employed to determine a suitable regularization parameter by using

μ_{k} = μ_{0} q^{k}

with

μ_{0} = {∥ A ∥}_{F}

and

q = \frac{1}{2}

. We set

η = 1.05

in (8).

Figure 3 shows the convergence of relative errors verus (a) the iteration number k and (b) the CPU time for the auto-tCG, auto-ttCG and auto-ttpCG methods with the noise level

ν = 10^{- 3}

corresponding in the Table 2. The iteration process is terminated when the discrepancy principle is satisfied. From Figure 3 (a), we can see that the auto-ttCG and auto-ttpCG methods do not need to solve the normal equation for all

μ_{k} (k < 8)

. This shows that the auto-ttCG and auto-ttpCG methods improve the auto-tCG method by the condition (24). Figure 3 (b) shows that the auto-ttpCG method converges fastest among three methods.

Table 2 lists the regularization parameter, the iteration number, the relative error, SNR and the CPU time of the optimal solution obtained by using the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise levels

ν = 10^{- i}, i = 2, 3

. It can be seen from Table 2 that the auto-ttpCG method has the lowest relative error, highest SNR and the least CPU time for different noise level.

Figure 4 shows the reconstructed images obtained by using the auto-tCG, auto-ttCG and auto-ttpCG methods on the blurred and noised image with the noise level

ν = 10^{- 3}

in Table 2. From Figure 4 we can see that the restored image by the auto-ttpCG method looks a bit better than others but the least CPU time.

Example 4.2

(Color image)

This example shows the restoration of a blurred

Lena

color image by Algorithms 2, 3 and 5. The original

Lena

image

X_{o r i} \in R^{256 \times 256 \times 3}

is stored as a tensor

X_{t r u e} \in R^{256 \times 3 \times 256}

through the MATLAB function

{multi}_{-} twist

. We set

N = 256, σ = 3

and band=12, and get

A \in R^{256 \times 256 \times 256}

z = [e x p (- ([0 : b a n d - 1] .^{2}) / (2 σ^{2})), z e r o s (1, N - b a n d)],

A = t o e p l i t z (z), A_{(:, :, i)} = \frac{1}{2 π σ} A (i, 1) A, i = 1, . . ., 256 .

Then

c o n d (A_{(:, :, 1)}) = . . . = c o n d (A_{(:, :, 12)}) = 4.68 e + 07

, and the condition number of other tensor slices of

A

is infinite. The noise tensor

E

is defined by (36). The blurred and noised tensor is derived by

B = A * X_{t r u e} + E

, which is shown in Figure 6 (a).

We set the color image

B

to be divided into multiple lateral slices and independently process each slice through (1) by using the auto-tCG, auto-ttCG and auto-ttpCG methods. Figure 5 shows the convergence of relative errors verus (a) the iteration number k and (b) the CPU time for the auto-tCG, auto-ttCG and auto-ttpCG methods when dealing with the first tensor lateral slice

B_{(:, 1, :)}

B

with

ν = 10^{- 3}

. Similar results can be derived as that in Example 5.1 from Figure 5. We can see that the auto-ttCG and auto-ttpCG methods need less iterations than the auto-tCG method from Figure 5 (a) and the auto-ttpCG method converges fastest among all methods from Figure 5 (b).

Table 3 lists the relative error, SNR and the CPU time of the optimal solution obtained by using the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise levels

ν = 10^{- i}, i = 2, 3

. The results are very similar to that in Table 2 for different noise level.

Table 3. Example 4.2: Comparison of relative error, SNR, and CPU time between the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise level

ν = 10^{- i}, i = 2, 3

Table 3. Example 4.2: Comparison of relative error, SNR, and CPU time between the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise level

ν = 10^{- i}, i = 2, 3

Noise level	Method	Relative error	SNR	time (secs)
$10^{- 3}$	auto-tCG	5.90e-02	14.62	314.73
	auto-ttCG	5.90e-02	14.62	262.81
	auto-ttpCG	5.43e-02	15.37	103.41
$10^{- 2}$	auto-tCG	7.64e-02	12.37	117.48
	auto-ttCG	7.48e-02	12.55	62.01
	auto-ttpCG	7.01e-02	13.13	54.85

Figure 6 shows the recovered images by the auto-tCG, auto-ttCG and auto-ttpCG methods corresponding to the results with noise level

ν = 10^{- 3}

. The results are very similar to that in Figure 6.

Figure 6. Example 4.2: (a) The blurred and noised Lena image and reconstructed images by (b) the auto-tCG method, (c) the auto-ttCG and (d) the auto-ttpCG method according to the noise level

ν = 10^{- 3}

in Table 3.

Figure 6. Example 4.2: (a) The blurred and noised Lena image and reconstructed images by (b) the auto-tCG method, (c) the auto-ttCG and (d) the auto-ttpCG method according to the noise level

ν = 10^{- 3}

in Table 3.

Example 4.3 (Video) We recover the first 10 consecutive frames of blurred and noised Rhinos video from MATLAB. Each frame has

240 \times 240

pixels. We store 10 pollution- and noise-free frames of the original video in the tensor

X_{t r u e} \in R^{240 \times 10 \times 240}

. Let z be defined by (37) with

N = 240

σ = 2

and

b a n d = 12

. The coefficient tensor

A

is defined as follows:

A = \frac{1}{\sqrt{2 π σ}} t o e p l i t z (z), A_{(:, :, i)} = \frac{1}{2 π σ^{2}} A (i, 1) A, i = 1, . . ., 240 .

The condition number of the frontal slices of

A

c o n d (A_{(:, :, i)}) = 7.4484 e + 09 (i \leq 12)

, and the condition number of the remaining frontal sections of

A

is infinite. The suitable regularization parameter is determined by using the discrepancy principle with

η = 1.1

. The blurred- and noised tensor

B

is generated by

B = A * X_{t r u e} + E

with

E \in R^{120 \times 30 \times 120}

being defined by (36).

Figure 7 shows the convergence of relative errors verus the iteration number k and relative errors verus the CPU time for the auto-tCG, auto-ttCG and auto-ttpCG methods when the second frame of the video with

ν = 10^{- 3}

is restored. Very similar results can be derived from Figure 7 to that in Example 5.1.

Table 4 displays the relative error, SNR and the CPU time of the optimal solution obtained by using the auto-tCG, auto-ttCG and auto-ttpCG methods for the second frame with different noise levels

ν = 10^{- i}, i = 2, 3

. We can see that the auto-ttpCG method has the largest SNR and the lowest CPU time for different noise level

ν = 10^{- i}, i = 2, 3

Figure 8 shows the original video, blurred and noised video, and the recovered video of the second frame of the video for the auto-tCG, auto-ttCG and the auto-ttpCG methods with noise level

ν = 10^{- 3}

corresponding to the results in Table 4. The recovered frame by the auto-ttpCG method looks best among all recovered frames.

5. Conclusion

This paper presents three types of tensor Conjugate-Gradient methods for solving large-scale linear discrete ill-posed problems in tensor form. We first present an automatical determination strategy of a suitable regularization parameter for the tensor conjugate gradient (tCG) method. Furthermore, we develop a truncated version and a preprocessed verion of the tCG method. The proposed methods are used to different examples in image and video restoration.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgements

The authors would like to thank the referees for their helpful and constructive comments. This research was supported in part by the Sichuan Science and Technology Program (grant 2022ZYD0008).

Conflict of interest

The authors declare no conflict of interest.

References

Kilmer, M.E.; Martin, C.D. Factorization strategies for third order tensors. Linear Alg. Appl. 2011, 435, 641–658. [Google Scholar] [CrossRef]
Hao, N.; Kilmer, M.E.; Braman, K.; Hoover, R.C. Facial recognition using tensor-tensor decompositions. SIAM J. Imaging Sci. 2013, 6, 437–463. [Google Scholar] [CrossRef]
Zhang, J.; Saibaba, A. K.; Kilmer, M. E.; Aeron, S. A randomized tensor singular value decomposition based on the t-product. Numer. Linear Algebr. Appl. 2018, 25, e2179. [Google Scholar] [CrossRef]
Fenu, C.; Reichel, L.; Rodriguez, G. GCV for tikhonov regularization via global Golub–Kahan decomposition. Numer. Linear Algebr. Appl. 2016, 25, 467–484. [Google Scholar] [CrossRef]
Hansen, P.C. Rank-Deficient and Discrete Ill-Posed Problems; SIAM: Philadelphia, 1998. [Google Scholar]
Kilmer, M. E.; Braman, K.; Hao, N.; Hoover, R. C. Third-order tensors as operators on matrices: A theoretical and computational framework with applications in imaging. SIAM J. Matrix Anal. Appl. 2013, 34, 148–172. [Google Scholar] [CrossRef]
Engl, H. W.; Hanke, M.; Neubauer, A. Neubauer. In Regularization of Inverse Problems; Kluwer, Dordrecht, 1996. [Google Scholar]
Kindermann, S. Convergence analysis of minimization-based noise level-free parameter choice rules for linear ill-posed problems. Electron. Trans. Numer. Anal. 2011, 38, 233–257. [Google Scholar]
Kindermann, S.; Raik, K. A simplified L-curve method as error estimator. Electron.Trans. Numer. Anal. 2020, 53, 217–238. [Google Scholar] [CrossRef]
Reichel, L.; Rodriguez, G. Old and new parameter choice rules for discrete ill-posed problems. Numer. Algorithms 2013, 63, 65–87. [Google Scholar] [CrossRef]
Reichel, L.; Ugwu, U. O. The tensor Golub–Kahan–Tikhonov method applied to the solution of ill-posed problems with at-product structure. Numer. Linear Algebr. Appl. 2022, 29, e2412. [Google Scholar] [CrossRef]
Ugwu, U. O.; Reichel, L. Tensor Arnoldi–Tikhonov and GMRES-Type Methods for Ill-Posed Problems with a t-Product Structure. J. Sci. Comput. 2022, 90, 1–39. [Google Scholar]
Kilmer, M. E.; Braman, K.; Hao, N.; Hoover, R. C. Third-order tensors as operators on matrices: A theoretical and computational framework with applications in imaging. SIAM J. Matrix Anal. Appl. Appl. 2013, 34, 148–172. [Google Scholar] [CrossRef]
Lund, K. The tensor t-function: A definition for functions of third-order tensors. Numer. Linear Algebr. Appl. 2020, 27, e2288. [Google Scholar] [CrossRef]
Frommer, A.; Maass, P. Fast CG-based methods for Tikhonov–Phillips regularization. SIAM J. Sci. Comput. 1999, 20, 1831–1850. [Google Scholar] [CrossRef]
Cichocki, A.; Mandic, D.; De Lathauwer, L.; Zhou, G.; Zhao, Q.; Caiafa, C.; Phan, H. A. Tensor decompositions for signal processing applications: From two-way to multiway component analysis. IEEE Signal Process. Mag. 2015, 32, 145–163. [Google Scholar] [CrossRef]
Signoretto, M.; Tran Dinh, Q.; De Lathauwer, L.; Suykens, J. A. Learning with tensors: a framework based on convex optimization and spectral regularization. Mach. Learn., 2014, 94, 303-351. Mach. Learn. 2014, 94, 303–351. [Google Scholar] [CrossRef]
Kilmer, M. E.; Horesh, L.; Avron, H.; Newman, E. Tensor-tensor algebra for optimal representation and compression of multiway data. Proceedings of the National Academy of Sciences. Proc. Natl. Acad. Sci. U. S. A. 2021, 118, e2015851118. [Google Scholar] [CrossRef] [PubMed]
Beik, F. P. A.; Najafi–Kalyani, M.; Reichel, L. Iterative Tikhonov regularization of tensor equations based on the Arnoldi process and some of its generalizations. Appl. Numer. Math. 2020, 151, 425–447. [Google Scholar] [CrossRef]
Bentbib, A. H.; Khouia, A.; Sadok, H. The LSQR method for solving tensor least-squares problems. Electron. Trans. Numer. Anal. 2022, 55, 92–111. [Google Scholar] [CrossRef]
Bentbib, A. H.; El Hachimi, A.; Jbilou, K.; Ratnani, A. Fast multidimensional completion and principal component analysis methods via the cosine product. Calcolo 2022, 59, 26. [Google Scholar] [CrossRef]
Khaleel, H. S.; Sagheer, S. V. M.; Baburaj, M.; George, S. N. Denoising of Rician corrupted 3D magnetic resonance images using tensor-SVD. Biomed. Signal Process. Control 2018, 44, 82–95. [Google Scholar] [CrossRef]
Ugwu, U. O.; Reichel, L. Tensor regularization by truncated iteration: a comparison of some solution methods for large-scale linear discrete ill-posed problem with a t-product. arXiv 2021, arXiv:2110.02485. [Google Scholar]
Zeng, C.; Ng, M. K. Decompositions of third-order tensors: HOSVD, T-SVD, and Beyond. Numer. Linear Algebr. Appl. 2020, 27, e2290. [Google Scholar] [CrossRef]
El Hachimi, A.; Jbilou, K.; Ratnani, A.; Reichel, L. Spectral computation with third-order tensors using the t-product. Appl. Numer. Math. 2023, 193, 1–21. [Google Scholar] [CrossRef]
Zheng, M. M.; Ni, G. Approximation strategy based on the T-product for third-order quaternion tensors with application to color video compression. Appl. Math. Lett. 2023, 140, 108587. [Google Scholar] [CrossRef]
Yu, Q.; Zhang, X. (2023). T-product factorization based method for matrix and tensor completion problems. Comput. Optim. Appl. 2023, 84, 761–788. [Google Scholar] [CrossRef]

Figure 1. (a) frontal slices

A_{(:, :, k)}

, (b) lateral slices

A_{(:, j, :)}

and (c) tube fibers

A_{(i, j, :)}

Figure 1. (a) frontal slices

A_{(:, :, k)}

, (b) lateral slices

A_{(:, j, :)}

and (c) tube fibers

A_{(i, j, :)}

Figure 2. twist-squeeze

Figure 3. Example 4.1: Comparison of convergence between (a) relative errors verus the iteration number k and (b) relative errors verus the CPU time for the auto-tCG, auto-ttCG and auto-ttpCG methods with the noise level

ν = 10^{- 3}

ν = 10^{- 3}

Figure 4. Example 4.1: (a) The blurred and noised image and reconstructed images by (b) the auto-tCG method (SNR=22.36, CPU=109.87), (c) the auto-ttCG method (SNR=22.41, CPU=80.93) and (d) the auto-ttpCG method (SNR=22.48, CPU=33.98) according to the noise level

ν = 10^{- 3}

in Table 2.

ν = 10^{- 3}

in Table 2.

Figure 5. Example 4.2: Comparison of convergence between (a) relative errors verus the iteration number k and (b) relative errors verus the CPU time for the auto-tCG, auto-ttCG and auto-ttpCG methods with the noise level

ν = 10^{- 3}

ν = 10^{- 3}

Figure 7. Example 4.3: Comparison of convergence between (a) relative errors verus the iteration number k and (b) relative errors verus the CPU time for the auto-tCG, auto-ttCG and auto-ttpCG methods with the noise level

ν = 10^{- 3}

ν = 10^{- 3}

Figure 8. Example 4.3: (a) Original image, (b) the blurred and noisy image and recovered images by (c) the auto-tCG method, (d) the auto-ttCG and (e) the auto-ttpCG method according to the noise level

ν = 10^{- 3}

in Table 4.

ν = 10^{- 3}

in Table 4.

Table 1. Description of notations

Notation	Interpretation
$A^{T}$	transpose of tensors
$A^{- 1}$	inverse of tensor, $A^{- T} = {(A^{- 1})}^{T} = {(A^{T})}^{- 1}$
$\hat{A}$	FFT of $A$ along the third mode
$unfold (A)$	the block column matrix of $A$
$bcirc (A)$	the block-circulant matrix
$I$	identity tensor
A	matrix
I	identity matrix
${∥ A ∥}_{F}$	the Frobenius norm of tensors $A$ , i.e, ${∥ A ∥}_{F} = \sqrt{\sum_{i = 1}^{l} \sum_{j = 1}^{m} \sum_{k = 1}^{n} a_{i j k}^{2}} .$
∗	t-product
${\vec{A}}_{j}$ , $A_{(:, j, :)}$	the jth tensor column of $A$ , jth lateral slice of $A$
$A_{(:, :, j)}$	the jth frontal slice of tensor $A$
$d$	tube
$〈A, B〉$	$〈A, B〉 = \sum_{i j k} a_{i j k} b_{i j k}$
$〈\vec{A}, \vec{B}〉$	$〈\vec{A}, \vec{B}〉 = \sum_{i k} a_{i 1 k} b_{i 1 k}$

Table 2. Example 4.1: Comparison of relative error, SNR, and CPU time between the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise level

ν = 10^{- i}, i = 2, 3

Table 2. Example 4.1: Comparison of relative error, SNR, and CPU time between the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise level

ν = 10^{- i}, i = 2, 3

Noise level	Method	k	$μ_{k}$	Relative error	SNR	CPU (secs)
$10^{- 3}$	auto-tCG	15	1.96e-05	3.54e-02	22.36	109.87
	auto-ttCG	15	1.96e-05	3.52e-02	22.41	80.93
	auto-ttpCG	15	1.96e-05	3.49e-02	22.48	33.98
$10^{- 2}$	auto-tCG	11	3.14e-04	8.74e-02	14.51	81.94
	auto-ttCG	11	3.14e-04	8.64e-02	14.61	26.42
	auto-ttpCG	11	3.14e-04	8.54e-02	14.72	18.50

Table 4. Example 4.3: Comparison of relative error, SNR, and CPU time between the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise level

ν = 10^{- i}, i = 2, 3

Table 4. Example 4.3: Comparison of relative error, SNR, and CPU time between the auto-tCG, auto-ttCG and auto-ttpCG methods with different noise level

ν = 10^{- i}, i = 2, 3

Noise level	Method	Relative error	SNR	time (secs)
$10^{- 3}$	auto-tCG	2.94e-02	23.17	697.78
	auto-ttCG	2.92e-02	23.23	487.35
	auto-ttpCG	2.66e-02	24.05	214.16
$10^{- 2}$	auto-tCG	5.24e-02	18.15	480.75
	auto-ttCG	5.10e-02	18.38	281.54
	auto-ttpCG	4.74e-02	19.02	156.44

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

MDPI Initiatives

Important Links

Choose an area of interest and we will send you notifications of new preprints at your preferred frequency.

Disclaimer

Tensor Conjugate-Gradient Methods With Automatically Determination of Regularization Parameters for Ill-Posed Problems With T-product

Abstract

1. Introduction

2. Preliminaries

3. Tensor Conjugate-Gradient methods

3.1. The auto-tCG Method

3.2. The truncated tensor Conjugate-Gradient method

3.3. A preconditioned truncated tensor Conjugate-Gradient method

4. Numerical Examples

5. Conclusion

Use of AI tools declaration

Acknowledgements

Conflict of interest

References

MDPI Initiatives

Important Links

Subscribe