Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering

Dan Gabriel Cacuci

doi:10.20944/preprints202501.1906.v1

Submitted:

24 January 2025

Posted:

05 February 2025

You are already at the latest version

Abstract

This work presents the general mathematical frameworks of the “First and Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra Type” designated as the 1st-FASAM-NIE-V and the 2nd-FASAM-NIE-V methodologies, respectively. Using a single large-scale (adjoint) computation, the 1st-FASAM-NIE-V enables the most efficient computation of the exact expressions of all first-order sensitivities of the decoder response to the feature functions and also with respect to the optimal values of the NIE-net’s parameters/weights after the respective NIE-Volterra-net was optimized to represent the underlying physical system. The 2nd-FASAM-NIE-V requires as many large-scale computations as there are first-order sensitivities of the decoder response with respect to the feature functions. Subsequently, the second-order sensitivities of the decoder response with respect to the primary model parameters are obtained trivially by applying the “chain-rule of differentiation” to the second-order sensitivities with respect to the feature functions. The application of the 1st-FASAM-NIE-V and the 2nd-FASAM-NIE-V methodologies is illustrated by using a well-known model for neutron slowing down in a homogeneous hydrogenous medium, which yields tractable closed-form exact explicit expressions for all quantities of interest, including the various adjoint sensitivity functions and first- and second-order sensitivities of the decoder response with respect to all feature functions and also primary model parameters.

Keywords:

first-order features adjoint sensitivity analysis

;

second-order features adjoint sensitivity analysis

;

Volterra neural integral equation

;

neutron slowing down

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

It is well known that Neural Ordinary Differential Equations (NODEs) have enabled the use of deep learning for modeling discretely sampled dynamical systems. NODEs [1,2,3] provide a flexible trade-off between efficiency, memory costs and accuracy while bridging traditional numerical modeling with modern deep learning, as demonstrated by various applications, including time-series, dynamics and control [1,2,3,4,5,6,7,8,9]. However, since each time-step is determined locally in time, NODEs are limited to describing systems that are instantaneous. On the other hand, integral equations (IE) model global “long-distance” spatiotemporal relations, and IE solvers often possess stability properties that are superior to solvers for ordinary and/or partial differential equations. Therefore, differential equations are occasionally recast in integral-equation forms that can be solved more efficiently using IE solvers, as exemplified by the applications described in [10,11,12].

Due to their non-local behavior, IE solvers are suitable for modeling complex dynamics, learning the operator underlying the system under consideration by using data sampled from the respective system. As discussed in [13], the operator learning problem is formulated on finite grids, using finite-difference methods that approximate the domain of the functions under investigation; the learning is performed by using an IE solver which samples the domain of integration continuously. As shown in [14], Neural Integral Equations (NIEs) and the Attentional Neural Integral Equations (ANIEs) can be used to generate dynamics and infer the spatiotemporal relations that initially generated the data, thus enabling the continuous learning of non-local dynamics with arbitrary time resolution. The ANIE interprets the self-attention mechanism as the Nystrom method for approximating integrals [15], which enables efficient integration over higher dimensions, as discussed in [10,11,12,13,14,15] and references therein.

Neural nets are trained by minimizing a “loss functional” chosen by the user to represent the discrepancy between the output produced by the neural net’s decoder and some user-chosen “reference solution.” However, the physical system modeled by a neural net inevitably comprises imperfectly known parameters that stem from measurements and/or computations and are therefore afflicted by uncertainties that stem from the respective experiments and/or computations. Hence, even if the neural net reproduces perfectly a given state of a physical system, the neural net’s “optimized weights” are subject to the uncertainties inherent in the parameters that characterize the underlying physical system, and these uncertainties inevitably propagate to the decoder’s output response. It is hence important to quantify the impact of parameters/weights uncertainties on the uncertainties induced in the decoder’s output response. This impact is quantified by the sensitivities of the decoder’s response with respect to the optimized weights/parameters comprised within the neural net.

Neural nets comprise not only scalar-valued weights/parameters but also functions (e.g., correlations) of such scalar model parameters, which can be conveniently called “features of primary model parameters”. Cacuci [16] has developed the “n^th-Order Features Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (n^th-FASAM-N)”, which enables the most efficient computation of the exact expressions of arbitrarily high-order sensitivities of model responses with respect to the model’s “features”. In turn, the sensitivities of the responses with respect to the primary model parameters are determined, analytically and trivially, by applying the “chain rule” to the expressions obtained for the response sensitivities with respect to the model’s “features.” The n^th-FASAM-N [16] has been applied to develop general first- and second-order sensitivity analysis methodologies for NODEs [17] and for Neural Integral Equations of Fredholm-type [18], which enable the computation, with unsurpassed efficacy, of the exact expressions of first and second-order sensitivities of decoder responses with respect to the underlying neural net’s optimized weights.

This work continues the application of the n^th-FASAM-N [16] methodology to develop the “First- and Second-Order Methodologies for Neural Integral Equations of Volterra Type” (acronyms “1^st-FASAM-NIE-V” and, respectively, “2^nd-FASAM-NIE-V”). The 1^st-FASAM-NIE-V methodology, which is presented in Section 2, enables the most efficient computation of exact expressions of all of the first-order sensitivities of NIE decoder responses with respect to all of the optimal values of the NIE-net’s parameters/weights, after the respective NIE-Volterra-net was optimized to represent the underlying physical system. The efficiency of the 1^st-FASAM-NIE-V is illustrated in Section 3 by applying it to perform a comprehensive first-order sensitivity analysis of the well-known model [19,20,21] of neutron slowing down in a homogeneous medium containing fissionable material.

The general mathematical framework of the 2^nd-FASAM-NIE-Volterra methodology, which is presented in Section 4, enables the most efficient computation of the exact expressions of the second-order sensitivities of NIE decoder responses with respect to all of the optimal values of the NIE-net’s parameters/weights. The efficiency of the 2^nd-FASAM-NIE-V is illustrated in Section 5 by applying it to perform a comprehensive second-order sensitivity analysis of the neutron slowing down model [19,20,21] considered in Section 3. Section 6 concludes this work by presenting a discussion that highlights the unparalleled efficiency of the 2^nd-FASAM-NIE-V methodology for performing sensitivity analysis of Volterra-type neural integral equations.

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1^st-FASAM-NIE-V)

Following [14], a network of nonlinear “Neural Integral Equations of Volterra-type (NIE-Volterra)” can be represented by the system of coupled equations shown below:

h_{i} (t) = g_{i} [F (θ); t] + φ_{i} [F (θ); t] \int_{t_{0}}^{t} ψ_{i} [h (τ); F (θ); τ] d τ; t_{0} \leq t \leq t_{f}; 1 = 1, ... T H .

(1)

The quantities appearing in Eq. (1) are defined as follows:

(i): The real-valued scalar quantities $t$ , $t_{0} \leq t \leq t_{f}$ , and $τ$ , $t_{0} \leq τ \leq t_{f}$ , are time-like independent variables which parameterize the dynamics of the hidden/latent neuron units. Customarily, the variable $t$ is called the “global time” while the variable $τ$ is called the “local time.” The initial time-value is denoted as $t_{0}$ while the stopping time-value is denoted as $t_{f}$ .
(ii): The components of the vector $θ ≜ {[θ_{1}, ..., θ_{T W}]}^{†}$ represent scalar learnable adjustable weights, where $T W$ denotes the total number of adjustable weights in all of the latent neural nets. The components of the column-vector $θ ≜ {[θ_{1}, ..., θ_{T W}]}^{†}$ are considered to be “primary parameters” while the components of the vector-valued function $F (θ) ≜ {[F_{1} (θ), ..., F_{T F} (θ)]}^{†}$ represent the ”feature” functions of the respective weights. The quantity $T F$ denotes the “total number of feature/functions of the primary model parameters” comprised in the NIE-Volterra. In general, $F (θ)$ is a nonlinear function of $θ$ . The total number of feature functions must necessarily be smaller than the total number of primary parameters (weights), i.e., $T F < T W$ . In the extreme case when there are no feature functions, it follows that $F_{i} (θ) \equiv θ_{i}$ , for all $i = 1, ..., T W \equiv T F$ . In this work, all vectors are considered to be column vectors, and the dagger “ $†$ ” symbol will be used to denote “transposition.”. The symbol “ $≜$ ” will be used to denote “is defined as” or, equivalently, “is by definition equal to.”
(iii): The $T H$ -dimensional vector-valued function $h (t) ≜ {[h_{1} (t), ..., h_{T H} (t)]}^{†}$ represents the hidden/latent neural networks. The quantity $T H$ denotes the total number components of $h (t)$ . At the initial time-value $t_{0}$ , the functions $h_{i} (t_{0})$ take on the known values $h_{i} (t_{0}) = g_{i} [F (θ); t_{0}]$ .
(iv): The functions $g_{i} [F (θ); t] = h_{i} (t_{0})$ , $i = 1, ..., T H$ , model the initial state (“encoder”) of the network. The functions $φ_{i} [F (θ); t]$ and $ψ_{i} [h (τ); F (θ); τ]$ , $i = 1, ..., T H$ , depend nonlinearly on $h (t)$ and $F (θ)$ , respectively, and model the dynamics of the latent neurons.

The “training” of the NIE-Volterra net is accomplished by using the “adjoint” or other methods to minimize the user-chosen “loss functional” intended to represent the discrepancy between the output produced by the NIE-decoder and a “reference solution” chosen by the user. After the training is completed, the primary parameters (“weights”)

θ ≜ {[θ_{1}, ..., θ_{T W}]}^{†}

will have been assigned “optimal” values which are obtained as a result of having minimized the chosen loss functional. These optimal values for the primary parameters (“weights”) will be denoted using a superscript “zero,” as follows:

θ^{0} ≜ {[θ_{1}^{0}, ..., θ_{T W}^{0}]}^{†}

. Using these optimal/nominal parameter values to solve the NIE-system will yield the optimal/nominal solution

h^{0} (t) ≜ {[h_{1}^{0} (t), ..., h_{T H}^{0} (t)]}^{†}

, which will satisfy the following form of Eq. (1):

h_{i}^{0} (t) = g_{i}^{0} [F (θ^{0}); t] + φ_{i}^{0} [F (θ^{0}); t] \int_{t_{0}}^{t} ψ_{i}^{0} [h^{0} (τ); F (θ^{0}); τ] d τ; t_{0} \leq t \leq t_{f}; 1 = 1, ... T H .

(2)

After the NIE-net is optimized to reproduce the underlying physical system as closely as possible, the subsequent responses of interest are no longer “loss functions” but become specific functionals of NIE’s “decoder” response/output. Such a decoder-response, which will be denoted as

R [h; F (θ)]

, can be generally represented a scalar-valued functional of

h (t)

and

F (θ)

, defined as follows:

R [h; F (θ)] = \int_{t_{0}}^{t_{f}} D [h (t); F (θ); t] d t

(3)

The function

D [h (t); F (θ); t]

models the decoder and may contain distributions (e.g., Dirac-delta and/or Heaviside functionals, etc.), if the decoder-response is to be evaluated at some particular point in time or over a subinterval within the interval

[t_{0}, t_{f}]

.

The optimal value of the decoder-response, denoted as

R [h^{0}; F (θ^{0})]

, is represented by evaluating Eq. (3) at the optimal/nominal parameter values

θ^{0} ≜ {[θ_{1}^{0}, ..., θ_{T W}^{0}]}^{†}

and optimal/nominal solution

h^{0} (t)

, as follows:

R [h^{0}; F (θ^{0})] = \int_{t_{0}}^{t_{f}} D [h^{0} (t); F (θ^{0}); t] d t

(4)

The true values

θ

of the primary parameters (“weights”) that characterize the physical system modeled by the NIE-V net are afflicted by uncertainties inherent to the experimental and/or computational methodologies employed to model the original physical system. Therefore, the true values

θ

of the primary parameters (“weights”) will differ from the known nominal values

θ^{0}

(which are obtained after training the NIE-net to represent the model of the physical system) by variations denoted as

δ θ ≜ θ - θ^{0}

. The variations

δ θ ≜ θ - θ^{0}

will induce corresponding variations

δ F ≜ F (θ) - F^{0}

,

F^{0} ≜ F (θ^{0})

, in the feature functions, which in turn will induce variations

v^{(1)} (t) ≜ {[v_{1}^{(1)} (t), \dots, v_{T H}^{(1)} (t)]}^{†}

,

v_{i}^{(1)} (t) ≜ h_{i} (t) - h_{1}^{0} (t)

,

i = 1, ..., T H

, around the nominal/optimal functions

h^{0} (t)

. Subsequently, the variations

δ F

and

v^{(1)} (t; x)

will induce variations in the NIE decoder’s response.

The 1^st-FASAM-IDE-V methodology for computing the first-order sensitivities of the decoder’s response with respect to the NIE’s weights will be established by applying the same principles as those underlying the 1^st-FASAM-N [16] methodology. These first-order sensitivities are embodied in the first-order G-variation

δ R (h^{0}; F^{0}; v^{(1)}; δ F)

of the response

R [h; F (θ)]

, for variations

v^{(1)} (t)

and

δ F

around the nominal values

h^{0} (t)

and

F^{0}

, which is by definition obtained as follows:

\begin{array}{l} δ R (h^{0}; F^{0}; v^{(1)}; δ F) = {\{\frac{d}{d ε} \int_{t_{0}}^{t_{f}} D [h^{0} (t) + ε v^{(1)} (t); F^{0} + ε δ F; t] d t\}}_{ε = 0} \\ = {\{δ R (h^{0}; F^{0}; δ F)\}}_{d i r} + {\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d} . \end{array}

(5)

In Eq. (5), the “direct-effect term”

{\{δ R (h^{0}; F^{0}; δ F)\}}_{d i r}

arises directly from variations

δ F

(which in turn stem from parameter variations

δ θ

) and is defined as follows:

\begin{array}{l} {\{δ R (h^{0}; F^{0}; δ F)\}}_{d i r} ≜ \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial F} δ F\}}_{(h^{0}; F^{0})} d t \\ \equiv \sum_{j = 1}^{T F} \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial F_{j}} δ F_{j}\}}_{(h^{0}; F^{0})} d t, \end{array}

(6)

while the “indirect-effect term”

{\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d}

arises through the variations

v^{(1)} (t)

in the hidden state functions

h (t)

, and is defined as follows:

\begin{array}{l} {\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d} ≜ \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial h} v^{(1)} (t)\}}_{(h^{0}; F^{0})} d t \\ \equiv \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial h_{j}} δ h_{j}\}}_{(h^{0}; F^{0})} d t . \end{array}

(7)

The first-order relationship between the variations

v^{(1)} (t)

and

δ F

is obtained from the first-order G-variation of Eq. (1) for

i = 1, ... T H

, as follows:

\begin{array}{l} {\{\frac{d}{d ε} [h_{i}^{0} (t) + ε v_{i}^{(1)} (t)]\}}_{ε = 0} = {\{\frac{d}{d ε} g_{i} (F^{0} + ε δ F; t)\}}_{ε = 0} + {\{\frac{d}{d ε} φ_{i} (F^{0} + ε δ F; t)\}}_{ε = 0} \\ \times {\{\frac{d}{d ε} \int_{t_{0}}^{t} ψ_{i} [h_{1}^{0} (τ) + ε v_{1}^{(1)} (τ), ..., h_{T H}^{0} (τ) + ε v_{T H}^{(1)} (τ); F^{0} + ε δ F; τ] d τ\}}_{ε = 0}; t_{0} \leq t \leq t_{f} . \end{array}

(8)

Performing the operations indicated in Eq. (8) yields the following NIE-V net, which will be called the “1^st-Level Variational Sensitivity System” (1^st-LVSS), for the components

v_{i}^{(1)} (t)

,

i = 1, ... T H

, of the “1^st-level variational function”

v^{(1)} (t)

:

v_{i}^{(1)} (t) = {\{φ_{i} (F; t) \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ\}}_{(h^{0}, F^{0})} + \sum_{k = 1}^{T F} {\{q_{i k} (F; t) δ F_{k}\}}_{(h^{0}; F^{0})}

(9)

where:

q_{i k} (F; t) ≜ \frac{\partial g_{i} (F; t)}{\partial F_{k}} + \frac{\partial φ_{i} (F; t)}{\partial F_{k}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{k}} d τ .

(10)

As indicated in Eq. (9), the 1^st-LVSS is to be computed at the nominal/optimal values for the respective model parameters. It is important to note that the 1^st-LVSS is linear in the variational function

v^{(1)} (t)

, although it generally remains nonlinear in

h (t)

.

The 1^st-LVSS would need to be solved anew to obtain the function

v^{(1)} (t)

that would correspond to each variation

δ F_{j}

,

j = 1, ..., T F

; this procedure would become prohibitively expensive computationally if

T F

is a large number. The need for repeatedly solving the 1^st-LVSS can be avoided by recasting the indirect-effect term

{\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d}

in terms of an expression that does not involve the function

v^{(1)} (t)

. This goal can be achieved by expressing

{\{δ R (h^{0}; F^{0}; v^{(1)})\}}_{i n d}

in terms of another function, which will be called the “1^st-level adjoint function,” and which will be the solution of the “1^st-Level Adjoint Sensitivity System (1^st-LASS)” to be constructed next.

The 1^st-LASS will be constructed in a Hilbert space, denoted as

H_{1} (Ω_{t})

, where

Ω_{t} ≜ [t_{0}, t_{f}]

, comprising elements of the same form as

v^{(1)} (t) \in H_{1} (Ω_{t})

. The inner product of two elements

χ^{(1)} (t) ≜ {[χ_{1}^{(1)} (t), \dots, χ_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

and

η^{(1)} (t) ≜ {[η_{1}^{(1)} (t), \dots, η_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

will be denoted as

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1}

and is defined as follows:

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1} ≜ \int_{t_{0}}^{t_{f}} {[χ^{(1)} (t)]}^{†} η^{(1)} (t) d t = \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{i}^{(1)} (t) η_{i}^{(1)} (t) d t

(11)

The inner product

{〈χ^{(1)} (t), η^{(1)} (t)〉}_{1}

is required to hold in a neighborhood of the nominal values

(h^{0}; F^{0})

.

The next step is to form the inner product of Eq. (9) with a vector

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†} \in H_{1} (Ω_{t})

, where the superscript “(1)” indicates “1^st-level”, to obtain the following relationship:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) v_{i}^{(1)} (t) d t - {\{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ\}}_{(h^{0}, F^{0})} \\ = \sum_{i = 1}^{T H} \sum_{k = 1}^{T F} [\int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) q_{i k} (F; t) d t] δ F_{k} . \end{array}

(12)

The second term on the left-side of Eq. (12) is transformed using “integration by parts” as follows:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ \\ = {\sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \{[\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ] [\int_{t_{0}}^{t} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ]\}}_{t = t_{f}} \\ - {\sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \{[\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ] [\int_{t_{0}}^{t} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ]\}}_{t = t_{0}} \\ - \sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} [\frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{j} (t)} v_{j}^{(1)} (t) d t] [\int_{t_{0}}^{t} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ] \\ = \sum_{i = 1}^{T H} \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{j}^{(1)} (t) \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{j} (t)} d t \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ . \end{array}

(13)

Replacing the result obtained in Eq. (13) into the left-side of Eq. (12) yields the following relation for the left-side of Eq. (12):

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) v_{i}^{(1)} (t) d t - {\{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \sum_{j = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{j} (τ)} v_{j}^{(1)} (τ) d τ\}}_{(h^{0}, F^{0})} \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{i}^{(1)} (t) {\{a_{i}^{(1)} (t) - \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ\}}_{(h^{0}, F^{0})} d t . \end{array}

(14)

The term on the right-side of Eq. (14) is now required to represent the “indirect-effect” term defined in Eq. (7), which is achieved by requiring that the components of the function

a^{(1)} (t)

satisfy the following system of equations for

i = 1, ..., T H

:

a_{i}^{(1)} (t) - \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ = \frac{\partial D [h (t); F (θ); t]}{\partial h_{i}} .

(15)

The Volterra-like neural system obtained in Eq. (15) will be called the “1^st-Level Adjoint Sensitivity System” and its solution,

a^{(1)} (t)

, will be called the “1^st-level adjoint sensitivity function.” The 1^st-LASS is to be solved using the nominal/optimal values for the parameters and for the function

h (t)

but this fact has not been explicitly indicated in order to simplify the notation. The 1^st-LASS is linear in

a^{(1)} (t)

but is, in general, nonlinear in

h (t; x)

. Notably, the 1^st-LASS is independent of any parameter variations and needs to be solved once only to determine the 1^st-level adjoint sensitivity function

a^{(1)} (t)

. The 1^st-LASS is a “final-value problem” since the computation of the adjoint function

a^{(1)} (t)

will commence at

t = t_{f}

, with the known values

a_{i}^{(1)} (t_{f}) = {\{\partial D [h (t); F (θ); t] / \partial h_{i} (t)\}}_{t = t_{f}} .

It follows from Eqs. (12)‒(15) that the indirect-effect term defined in Eq. (7) can be expressed in terms of the 1^st-level adjoint sensitivity function

a^{(1)} (t)

as follows:

{\{δ R (h^{0}; F^{0}; a^{(1)})\}}_{i n d} = \sum_{k = 1}^{T F} {\{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) q_{i k} (F; t) d t\}}_{(h^{0}; F^{0})} δ F_{k} .

(16)

Using the results obtained in Eqs. (16) and (6) in Eq. (5) yields the following expression for the G-variation

δ R (h^{0}; F^{0}; v^{(1)}; δ F)

, which is seen to be linear in

δ F

:

\begin{array}{l} δ R (h^{0}; F^{0}; v^{(1)}; δ F) = \sum_{j = 1}^{T F} \int_{t_{0}}^{t_{f}} {\{\frac{\partial D [h (t); F (θ); t]}{\partial F_{j}} δ F_{j}\}}_{(h^{0}; F^{0})} d t \\ + {\{\sum_{i = 1}^{T H} \sum_{j = 1}^{T F} [\int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) q_{i j} (F; t) d t] δ F_{j}\}}_{(h^{0}; F^{0})} ≜ \sum_{j = 1}^{T F} {\{\frac{\partial R}{\partial F_{j}}\}}_{(h^{0}; F^{0})} δ F_{j} . \end{array}

(17)

Identifying in Eq. (17) the expressions that multiply the variations

δ F_{j}

, yields the following expressions for the sensitivities

\partial R / \partial F_{j}

of the response

R [h; F (θ)]

with respect to the components

F_{j} (θ)

of the feature function

F (θ)

, for

j = 1, ..., T F

:

\begin{array}{l} \frac{\partial R [h; F (θ)]}{\partial F_{j}} = \int_{t_{0}}^{t_{f}} \frac{\partial D [h (t); F (θ); t]}{\partial F_{j}} d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial g_{i} (F; t)}{\partial F_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \{\frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ\} d t . \end{array}

(18)

The expression on the right-side of Eq. (18) is to be evaluated at the nominal/optimal values for the respective model parameters, but this fact has not been indicated explicitly in order to simplify the notation.

The sensitivities with respect to the primary model parameters can be obtained by using the result obtained in Eq. (18) together with the “chain rule” of differentiating compound functions, as follows:

\frac{\partial R}{\partial θ_{j}} = \sum_{i = 1}^{T F} \frac{\partial R}{\partial F_{i}} \frac{\partial F_{i}}{\partial θ_{j}}, j = 1, ..., T W .

(19)

The sensitivities

\partial R / \partial F_{j}

are obtained from Eq. (18) while the derivatives

\partial F_{i} / \partial θ_{j}

are obtained analytically, exactly, from the known expressions of the feature functions

F_{i} (θ)

.

Particular Case: The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1^st-CASAM-NIE-V)

When no feature functions can be constructed from the model parameters/weights, the feature functions become identical to the parameters, i.e.,

F_{i} (θ) \equiv θ_{i}

for all

i = 1, ..., T F ≜ T W

. In this case, the expression obtained in Eq. (18) yields directly the first-order sensitivities

\partial R / \partial θ_{j}

of the decoder response with respect to the model weights/parameters, for all

j = 1, ..., T W

, taking on the following specific form:

\begin{array}{l} \frac{\partial R [h; F (θ)]}{\partial θ_{j}} = \int_{t_{0}}^{t_{f}} \frac{\partial D [h (t); F (θ); t]}{\partial θ_{j}} d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial g_{i} (F; t)}{\partial θ_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \{\frac{\partial φ_{i} (F; t)}{\partial θ_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial θ_{j}} d τ\} d t . \end{array}

(20)

Since the 1^st-LASS is independent of any parameter variations, the 1^st-level adjoint sensitivity function

a^{(1)} (t) ≜ {[a_{1}^{(1)} (t), \dots, a_{T H}^{(1)} (t)]}^{†}

which appears in Eq. (20) remains the solution of the 1^st-LASS defined by Eq. (15). In this case, however, all of the sensitivities

\partial R / \partial θ_{j}

, for all

j = 1, ..., T W

would be obtained by computing integrals using quadrature formulas. Thus, when there are no feature functions of parameters, the 1^st-FASAM-NIE-V reduces to the “First-Order Comprehensive Adjoint Sensitivity Analysis Methodology [16] applied to Neural Integral Equations of Volterra-Type” (1^st-CASAM-NIE-V). On the other hand, when features of parameters can be constructed, only

T F

(T F < T W)

numerical computations of integrals using quadrature formulas are required, using Eq. (18) to obtain the sensitivities

\partial R / \partial F_{j}

,

j = 1, ..., T F

. Subsequently, the sensitivities with respect to the model’s weights/parameters are obtained analytically using the chain-rule provided in Eq. (19).

3. Illustrative Application of the 1^st-CASAM-NIE-V and 1^st-FASAM-NIE-V Methodologies to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

The illustrative model considered in this Section is a Volterra-type integral equation that describes the energy distribution of neutrons in a homogeneous hydrogenous medium (such as a water-moderated/cooled reactor system) containing ²³⁸U (among other materials), which is a heavy element that strongly absorbs neutrons. The distribution of collided neutrons in such a medium is described [19,20,21] by the following linear integral equation of Volterra-type, customarily called the “neutron slowing down equation” for the neutron collision density denoted as

C (E)

:

C (E) = \frac{Σ_{s} (E_{s}) S}{Σ_{t} (E_{s}) E_{s}} + \int_{E}^{E_{s}} C (e) \frac{Σ_{s} (e)}{Σ_{t} (e)} \frac{d e}{e} .

(21)

The various quantities that appear in Eq. (21) are defined as follows:

(i): The quantity $S$ denotes the rate at which the source neutrons, considered to be monoenergetic, are emitted at the “source energy” $E_{s}$ . Neutron upscattering is considered to be negligeable; therefore, $E_{s}$ is the highest energy in the medium.
(ii): The quantity $E$ , $0 < E_{l} \leq E \leq E_{s}$ , denotes the instantaneous energy of the collided neutrons; $E_{l}$ denotes the lowest neutron energy in the model.
(iii): The quantity $Σ_{s} (E)$ denotes the medium’s macroscopic scattering cross section, which is defined as follows:

$Σ_{s} (E) ≜ \sum_{i = 1}^{M} w_{i} N_{i} σ_{s}^{(i)} (E),$

(22)

where M denotes the number of materials in the medium, $w_{i}$ denotes the relative weighting of the i^th-material in the medium, $N_{i}$ denotes the number density of the i^th-material, while $σ_{s}^{(i)} (E)$ denotes the energy-dependent scattering microscopic cross section of the i^th-material.
(iv): The quantity $Σ_{t} (E)$ denotes the medium’s macroscopic scattering cross section, which is defined as follows:

$Σ_{t} (E) ≜ \sum_{i = 1}^{M} w_{i} N_{i} σ_{t}^{(i)} (E),$

(23)

where $σ_{t}^{(i)} (E) \geq σ_{s}^{(i)} (E)$ denotes the energy-dependent total microscopic cross section of the i^th-material. The quantities $w_{i}$ , $N_{i}$ , $σ_{s}^{(i)} (E)$ , $σ_{t}^{(i)} (E)$ are subject to uncertainties since they are determined from experimentally obtained data.

Notably, the Volterra-type Eq. (21) is a “final-value problem” since the computation is started at the highest-energy value,

C (E_{s}) = Σ_{s} (E_{s}) S / Σ_{t} (E_{s}) E_{s}

, and progresses towards the lowest energy value

E_{l}

. Customarily, the solution of Eq. (21) is written in the following form:

C (E) = \frac{Σ_{s} (E_{s}) S}{Σ_{t} (E_{s}) E} e x p \{- \int_{E}^{E_{s}} \frac{Σ_{a} (e)}{Σ_{t} (e)} \frac{d e}{e}\},

(24)

where

Σ_{a} (E) ≜ Σ_{t} (E) - Σ_{s} (E)

denotes the medium’s macroscopic absorption cross section. The expression provided in Eq. (24) is amenable to computations of the loss of neutrons due to absorbing materials, particularly in the so-called “resonance” energy region.

A typical “decoder response” for the NIE-Volterra network modeled by Eq. (21) is the energy-averaged collision density, denoted below as

R [C (E)]

, which would be measured by a detector having an interaction cross-section

Σ_{d}

. Mathematically, this detector-response can be expressed as follows:

R [C (E)] ≜ \int_{E_{l}}^{E_{s}} Σ_{d} (E) C (E) d E; Σ_{d} (E) ≜ N_{d} σ_{d} (E),

(25)

where

N_{d}

and

σ_{d} (E)

denote, respectively, the detector material’s atomic number density and the microscopic cross section describing the interaction (e.g., absorption) of neutrons with the detector’s material;

N_{d}

and

σ_{d} (E)

can be considered as the “weights” that characterizes the neural net’s “decoder.”

Since the energy-dependence of the cross sections does not play a significant role in the sensitivity analysis of the NIE-Volterra modeled by Eq. (21), the respective microscopic cross-sections will henceforth be considered to be energy-independent for the purpose of illustrating the application of the 1^st-FASAM-NIE-V, in order to simplify the ensuing derivations. For energy-independent cross sections, Eqs. (21) and (25) take on the following forms, respectively:

C (E) = F (θ) [\frac{S}{E_{s}} + \int_{E}^{E_{s}} C (e) \frac{d e}{e}],

(26)

R [C (E)] = Σ_{d} (θ) \int_{E_{l}}^{E_{s}} C (E) d E .

(27)

In Eqs. (26) and (27), the source strength

S

is an imprecisely-known “weight” that characterizes the neural net’s “encoder.” Furthermore, the (column) vector of parameters denoted as

θ ≜ {(θ_{1}, ..., θ_{T W})}^{†}

comprises as components the “imprecisely known primary model parameters” (or “weights”, as customarily called when referring to neural nets) and is defined as follows:

θ ≜ {(θ_{1}, ..., θ_{T W})}^{†} ≜ {(w_{1}, ..., w_{M}; N_{1}, ..., N_{M}; σ_{t}^{(1)}, ..., σ_{t}^{(M)}; σ_{s}^{(1)}, ..., σ_{s}^{(M)}; N_{d}, σ_{d})}^{†};

(28)

where

​​​​ T W ≜ 4 \times M + 2

denotes the “total number of imprecisely-known weights/parameters.” These primary model parameters/weights are not known exactly but are affected by uncertainties since they stem from experimental procedures, which determine the nominal/mean/optimal values and the second-order moments of their unknown joint distributions; their third- and higher-order moments are rarely known. It is convenient to denote the nominal values of these primary model parameters/weights by using the superscript “zero” as follows:

θ^{0} ≜ {(θ_{1}^{0}, ..., θ_{T W}^{0})}^{†} ≜ {(w_{1}^{0}, ..., w_{M}^{0}; N_{1}^{0}, ..., N_{M}^{0}; σ_{t}^{(1, 0)}, ..., σ_{t}^{(M, 0)}; σ_{s}^{(1, 0)}, ..., σ_{s}^{(M, 0)}; N_{d}^{0}, σ_{d}^{0})}^{†};

(29)

The “feature function of primary parameters,”

F (θ)

, is defined as follows:

F (θ) ≜ Σ_{s} (θ) / Σ_{t} (θ) \leq 1.

(30)

The closed-form solution of Eq. (26) has the following expression in terms of the feature function

F (θ)

:

C (E) = \frac{S}{E_{s}} F (θ) {(\frac{E_{s}}{E})}^{F (θ)} .

(31)

The closed-form expression of the decoder response can be readily obtained by replacing the result obtained in Eq. (31) into Eq. (27) and performing the integration over the energy-variable to obtain:

R [C (E)] = S Σ_{d} (θ) \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] .

(32)

The expression obtained in Eq. (32) reveals that the imprecisely known quantities that affect the decoder-response

R [C (E)]

are as follows:

(i): the source strength $S$ ;
(ii): the detector interaction macroscopic cross section $Σ_{d} (θ)$ , which can be considered to be a “feature function” of the model parameters $θ$ ;
(iii): the feature function $F (θ) ≜ Σ_{s} (θ) / Σ_{t} (θ)$ .

3.1. Application of 1^st-CASAM-NIE-V to Directly Compute the First-Order Sensitivities of the Decoder Response with Respect to the Primary Model Parameters

The first-order sensitivities of the decoder response

R [C (E)]

with respect to the model parameters is obtained by applying the definition of the G-differential to Eq. (26), for arbitrary parameter variations

δ θ ≜ {(δ θ_{1}, ..., δ θ_{T W})}^{†} ≜ θ - θ^{0} ≜ {(θ_{1} - θ_{1}^{0}, ..., θ_{T W} - θ_{T W}^{0})}^{†}

around the parameters’ nominal values. These parameter variations will induce variations

δ C (E) ≜ C (E) - C^{0} (E)

in the neutron collision density, around the nominal value

C^{0} (E)

of the neutron collision density. The variations

δ θ

and

δ C (E)

will induce variations

δ R [C^{0} (E); θ^{0}; δ C (E); δ θ]

in the decoder’s response.

The first-order Gateaux (G-)variation

δ R [C^{0} (E); θ^{0}; δ C (E); δ θ]

is obtained, by definition, from Eq. (27) as follows:

\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ θ) ≜ \frac{d}{d ε} {\{[Σ_{d} (θ^{0} + ε δ θ)] \int_{E_{l}}^{E_{s}} [C^{0} (E) + ε δ C (E)] d E\}}_{ε = 0} \\ = {\{δ R (C^{0}; θ^{0}; δ θ)\}}_{d i r} + {\{δ R (C^{0}; θ^{0}; δ C)\}}_{i n d}, \end{array}

(33)

where the “direct effect” term

{\{δ R (C^{0}; θ^{0}; δ θ)\}}_{d i r}

arises directly from parameter variations

δ θ

and is defined as follows:

\begin{array}{l} {\{δ R (C^{0}; θ^{0}; δ θ)\}}_{d i r} ≜ [\int_{E_{l}}^{E_{s}} C^{0} (E) d E] {\sum_{i = 1}^{T W} \{\frac{\partial Σ_{d} (θ)}{\partial θ_{i}}\}}_{θ = θ^{0}} δ θ_{i} \\ = {\{(δ N_{d}) σ_{d} + N_{d} (δ σ_{d})\}}_{θ = θ^{0}} \int_{E_{l}}^{E_{s}} C^{0} (E) d E, \end{array}

(34)

while the indirect effect term arises from the variations

δ C (E)

and is defined as follows:

{\{δ R (C^{0}; θ^{0}; δ C)\}}_{i n d} ≜ Σ_{d} (θ^{0}) \int_{E_{l}}^{E_{s}} δ C (E) d E .

(35)

As indicated in Eqs. (34) and (35), both the direct-effect and the indirect-effect term are to be evaluated at the nominal parameter values.

The first-order relation between the variation

δ C (E)

and the parameter variations

δ θ_{i}

is obtained by evaluating the G-variation of Eq. (26) for variations

δ θ

around the nominal parameter values

θ^{0}

which yields, by definition, the following NIE-Volterra equation for

δ C (E)

:

\begin{array}{l} δ C (E) ≜ \frac{d}{d ε} {\{F (θ^{0} + ε δ θ) \frac{S^{0} + ε δ S}{E_{s}} + F (θ^{0} + ε δ θ) \int_{E}^{E_{s}} [C^{0} (e) + ε δ C (e)] \frac{d e}{e}\}}_{ε = 0} \\ = F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e} + Q (E), \end{array}

(36)

where:

\begin{array}{l} Q (E) ≜ \frac{F (θ^{0}) δ S}{E_{s}} + {\{[\frac{S^{0}}{E_{s}} + \int_{E}^{E_{s}} C^{0} (e) \frac{d e}{e}] [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}]\}}_{θ = θ^{0}} \\ = \frac{F (θ^{0}) δ S}{E_{s}} + {\{\frac{S^{0}}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}]\}}_{θ = θ^{0}} . \end{array}

(37)

The second equality in Eq. (37) has been obtained by using Eqs. (26) and (31) to eliminate the integral term involving

C^{0} (e)

.

The particular form of the first-order derivative

\partial F (θ) / \partial θ_{i}

, which appears in Eq. (37), is obtained by using the definition of

F (θ)

provided in Eq. (30), which yields the following expression:

\frac{\partial F (θ)}{\partial θ_{i}} = \frac{1}{Σ_{t} (θ)} \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} - \frac{Σ_{s} (θ)}{Σ_{t}^{2} (θ)} \frac{\partial Σ_{t} (θ)}{\partial θ_{i}},

(38)

In view of the definition provided in Eq. (22), the derivatives

\partial Σ_{s} (θ) / \partial θ_{i}

have the following particular expressions:

f o r i = 1, ..., M : θ_{i} ≜ w_{i}; \Rightarrow \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} = N_{i} σ_{s}^{(i)};

(39)

f o r i = M + 1, ..., 2 M : ​ θ_{i} ≜ N_{i}; \Rightarrow \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} = w_{i} σ_{s}^{(i)};

(40)

f o r i = 3 M + 1, ..., 4 M : ​ θ_{i} ≜ σ_{s}^{(i)}; \Rightarrow \frac{\partial Σ_{s} (θ)}{\partial θ_{i}} = w_{i} N_{i} .

(41)

In view of the definition provided in Eq. (23), the derivatives

\partial Σ_{t} (θ) / \partial θ_{i}

have the following particular expressions:

f o r i = 1, ..., M : θ_{i} ≜ w_{i}; \Rightarrow \frac{\partial Σ_{t} (θ)}{\partial θ_{i}} = N_{i} σ_{t}^{(i)};

(42)

f o r i = M + 1, ..., 2 M : ​ θ_{i} ≜ N_{i}; \Rightarrow \frac{\partial Σ_{t} (θ)}{\partial θ_{i}} = w_{i} σ_{t}^{(i)};

(43)

f o r i = 2 M + 1, ..., 3 M : ​ θ_{i} ≜ σ_{t}^{(i)}; \Rightarrow \frac{\partial Σ_{t} (θ)}{\partial θ_{i}} = w_{i} N_{i} .

(44)

The NIE-Volterra net represented by Eq. (36) will be called the “1^st-Level Variational Sensitivity System (1^st-LVSS)” and its solution,

δ C (E)

, will be called the “1^st-level variational sensitivity function.” It is evident that Eq. (36) would need to be solved

(T W + 1)

-times in order to obtain the variation

δ C (E)

for the source variation

δ S

and for every parameter variation

δ θ_{i}

This need for repeatedly solving Eq. (36) can be circumvented by applying the principles of the 1^st-CASAM-NIE-V, generally outlined in Section 2, to eliminate the appearance of the variation

δ C (E)

in the indirect-effect term defined in Eq. (35) while expressing this indirect-effect term as a functional of a first-level adjoint function that does not depend on any parameter variation, as follows:

Consider that the function $δ C (E) \in H_{1} (Ω_{E})$ belongs to a Hilbert space denoted as $H_{1} (Ω_{E})$ , which is defined on the domain $Ω_{E} ≜ [E_{l}, E_{s}]$ . The inner product in $H_{1} (Ω_{E})$ of two functions $u (E) \in H_{1} (Ω_{E})$ and $v (E) \in H_{1} (Ω_{E})$ will be denoted as ${〈u (E), v (E)〉}_{1}$ and is defined as follows:

${〈u (E), v (E)〉}_{1} ≜ \int_{E_{l}}^{E_{0}} u (E) v (E) d E .$

(45)
Form the inner product of Eq. (36) with a vector $a^{(1)} (E) \in H_{1} (E)$ , where the superscript “(1)” indicates “1^st-Level”, to obtain the following relationship:

${〈a^{(1)} (E), δ C (E) - F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e}〉}_{1} = {〈a^{(1)} (E), Q (E)〉}_{1} .$

(46)
Transform the left-side of Eq. (46) as follows:

$\begin{array}{l} {〈a^{(1)} (E), δ C (E) - F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e}〉}_{1} = \int_{E_{l}}^{E_{s}} a^{(1)} (E) δ C (E) d E \\ - F (θ^{0}) \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E \int_{E}^{E_{s}} δ C (e) \frac{d e}{e} = \int_{E_{l}}^{E_{s}} δ C (E) a^{(1)} (E) d E \\ - F (θ^{0}) \int_{E_{l}}^{E_{s}} δ C (E) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e = \int_{E_{l}}^{E_{s}} δ C (E) [a^{(1)} (E) - \frac{F (θ^{0})}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e] d E . \end{array}$

(47)
Require the last term in Eq. (47) to represent the indirect-effect term defined in Eq. (35) which yields the following “1^st-Level Adjoint Sensitivity System (1^st-LASS)” for the first-level adjoint sensitivity function $a^{(1)} (E)$ :

$a^{(1)} (E) - \frac{F (θ^{0})}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e = Σ_{d} (θ^{0}) .$

(50)

The 1^st-LASS represented by Eq. (50) is a linear NIE-Volterra net, which is independent of any parameter variation and needs to be solved just once to obtain the first-level adjoint sensitivity function $a^{(1)} (E)$ . Notably, the 1^st-LASS is an “initial-value problem,” in that the computation of $a^{(1)} (E)$ commences at the lowest-energy value, where $a^{(1)} (E_{l}) = Σ_{d} (θ^{0})$ , and progresses towards the highest-energy value, $E_{s}$ . For further reference, the closed-form solution of Eq. (50) can be obtained by differentiating this equation with respect to $E$ and subsequently integrating the resulting first-order linear differential equation, to obtain the following exact expression:

$a^{(1)} (E) = \frac{Σ_{d} (θ)}{1 - F (θ)} [1 - F (θ) {(\frac{E}{E_{l}})}^{F (θ) - 1}] .$

(51)

The expression on the right-side of Eq. (51) is to be evaluated at the nominal parameter values $θ^{0}$ , but the superscript “zero” has been omitted for notational simplicity.
Using Eqs. (46), (47) and (50) yields the following expression for the indirect-effect term defined in Eq. (35):

$\begin{array}{l} {\{δ R (C; θ; δ C)\}}_{i n d} = \int_{E_{l}}^{E_{s}} a^{(1)} (E) Q (E) d E = \frac{F (θ) δ S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E \\ + \frac{S}{E_{s}} [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}] \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} . \end{array}$

(52)

The expression on the right-side of Eq. (52) is to be evaluated at the nominal parameter values $θ^{0}$ , but the superscript “zero” has been omitted for notational simplicity.
Adding the expression obtained in Eq. (52) to the expression of the direct-effect term represented by Eq. (34) yields the following expression for the first-order G-variation $δ R (C^{0}; θ^{0}; δ C; δ θ)$ :

$\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ θ) = [(δ N_{d}) σ_{d} + N_{d} (δ σ_{d})] \int_{E_{l}}^{E_{s}} C (E) d E \\ + \frac{F (θ) δ S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + \frac{S}{E_{s}} [\sum_{i = 1}^{T W} \frac{\partial F (θ)}{\partial θ_{i}} δ θ_{i}] \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} . \end{array}$

(53)
It follows from Eq. (53) that the first-order sensitivities of the decoder response with respect to the (encoder’s) source strength and the optimal weights/parameters have the following expressions:

$\frac{\partial R}{\partial S} = \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E;$

(54)

$\frac{\partial R}{\partial N_{d}} = σ_{d} \int_{E_{l}}^{E_{s}} C (E) d E;$

(55)

$\frac{\partial R}{\partial σ_{d}} = N_{d} \int_{E_{l}}^{E_{s}} C (E) d E;$

(56)

$\frac{\partial R}{\partial θ_{i}} = \frac{S}{E_{s}} \frac{\partial F (θ)}{\partial θ_{i}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E; i = 1, ..., T W - 2.$

(57)

Inserting into Eqs. (54)‒(57) the closed-form expression for the neutron collision density obtained in Eq. (31) yields the following closed-form explicit expressions for the first-order sensitivities of the decoder response with respect to the (encoder’s) source strength and the optimal weights/parameters:

\frac{\partial R}{\partial S} = Σ_{d} (θ) \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}];

(58)

\frac{\partial R}{\partial N_{d}} = σ_{d} S \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}];

(59)

\frac{\partial R}{\partial σ_{d}} = N_{d} S \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}]

(60)

\begin{array}{l} \frac{\partial R}{\partial θ_{i}} = \frac{\partial F (θ)}{\partial θ_{i}} \frac{S Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\}; \\ i = 1, ..., T W . \end{array}

(61)

The correctness of the expressions obtained in Eqs. (58)‒(61) can be readily verified by differentiating the expressions of the decoder’s response obtained in Eq. (32).

In practice, only the exact mathematical expression of the 1^st-LASS, namely Eq. (50), and the exact mathematical expression of the first-order sensitivities obtained in Eqs. (54)‒(57) are available. The solution of the 1^st-LASS, which is a linear NIE-Volterra net for the first-level adjoint sensitivity function

a^{(1)} (E)

, would need to be obtained numerically, in practice. The numerical solution for

a^{(1)} (E)

would be used to determine the first-order sensitivities stemming from the “indirect-effect” term by using quadrature formulas to evaluate the integrals obtained in Eqs. (54) and (57). It is very important to note that a single “large-scale” computation, for determining numerically the adjoint function

a^{(1)} (E)

by solving the 1^st-LASS (a NIE-Volterra type equation), would be needed for evaluating all of the first-order sensitivities. The numerical computations using quadrature formulas for evaluating the integrals in Eqs. (54) and (57) are considered to be “small-scale” computations.

As has been already observed in the brief remarks following Eq. (37), the computation of the first-order sensitivities of the decoder response with respect to the encoder source strength S and model weights/parameters could also have been computed by numerically solving repeatedly the NIE-Volterra net (1^st-LVSS) represented by Eq. (36). This procedure would be very expensive computationally, since it would require

(T W + 1)

large scale computations to solve the 1^st-LVSS defined by Eq. (36) in order to obtain the variation

δ C (E)

for every parameter variation

δ θ_{i}

and the source variation

δ S

. In addition, the same amount of “quadrature” computations would need to be performed using Eq. (35) as would be needed for evaluating the first-order sensitivities using Eqs. (54) and (57).

3.2. Efficient Indirect Computation Using the 1^st-FASAM-NIE-V of the First-Order Sensitivities of the Decoder Response with Respect to Pimary Model Parameters

When feature functions of model parameters such as

Σ_{d} (θ)

and

F (θ)

can be identified, as is the case with the NIE-Volterra net and decoder response represented by Eqs. (26) and (27), respectively, it is considerably more efficient to determine the first-order sensitivities of the decoder response with respect to the feature functions and subsequently derive analytically the sensitivities with respect to the primary model parameters by using the “chain rule of differentiation,” as will be shown in this Section. Thus, considering arbitrary variations

δ Σ_{d} (θ) ≜ Σ_{d} (θ) - Σ_{d} (θ^{0})

and

δ F (θ) ≜ F (θ) - F (θ^{0})

around the nominal values

Σ_{d} (θ^{0})

and, respectively,

F (θ^{0})

, the first-order G-variation of the decoder response has the following expression:

\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ Σ_{d}) ≜ \frac{d}{d ε} {\{[Σ_{d} (θ^{0}) + ε δ Σ_{d} (θ^{0})] \int_{E_{l}}^{E_{s}} [C^{0} (E) + ε δ C (E)] d E\}}_{ε = 0} \\ = (δ Σ_{d}) {\{\int_{E_{l}}^{E_{s}} C (E) d E\}}_{θ = θ^{0}} + {\{δ R (C^{0}; θ^{0}; δ C)\}}_{i n d}, \end{array}

(62)

where the expression of the indirect effect term is defined in Eq. (35). The first-order relation between the variation

δ C (E)

and the variations

δ Σ_{d} (θ)

and

δ F (θ)

is obtained, by definition, from Eq. (26) as follows:

\begin{array}{l} δ C (E) ≜ \frac{d}{d ε} \{[F (θ^{0}) + ε δ F (θ)] \frac{S^{0} + ε δ S}{E_{s}} + [F (θ^{0}) + ε δ F (θ)] \\ {\times \int_{E}^{E_{s}} [C^{0} (e) + ε δ C (e)] \frac{d e}{e}\}}_{ε = 0} = F (θ^{0}) \int_{E}^{E_{s}} δ C (e) \frac{d e}{e} + Q (E), \end{array}

(63)

where:

Q (E) ≜ \frac{F (θ^{0}) δ S}{E_{s}} + {\{\frac{S^{0}}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} [δ F (θ)]\}}_{θ = θ^{0}} .

(64)

Comparing Eq. (63) to Eq. (36) indicates that the only difference between these equations is the expression of the term

Q (E)

, which is expressed in terms of

δ F (θ)

in Eq. (64). Consequently, the first-level adjoint sensitivity function that corresponds to the variational function

δ C (E)

is determined by following the same procedure as outlined in Eqs. (46)‒(50), ultimately obtaining the same 1^st-LASS as was obtained in Eq. (50), having as solution the same expression for

a^{(1)} (E)

as was obtained in Eq. (51). It further follows that the expression of the indirect-effect term will have the following expression:

{\{δ R (C; θ; δ C)\}}_{i n d} = (δ S) \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + [δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} .

(65)

It follows from Eqs. (62) and (65) that the first-order G-variation

δ R (C^{0}; θ^{0}; δ C; δ Σ_{d})

has the following expression:

\begin{array}{l} δ R (C^{0}; θ^{0}; δ C; δ Σ_{d}) = (δ Σ_{d}) {\{\int_{E_{l}}^{E_{s}} C (E) d E\}}_{θ = θ^{0}} \\ + (δ S) \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + [δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} . \end{array}

(66)

As indicated by the expression obtained in Eq. (66), the first-order sensitivities of the decoder response with respect to the feature functions and the encoder’s source strength are as follows:

\frac{\partial R}{\partial F (θ)} = \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E;

(67)

\frac{\partial R}{\partial Σ_{d} (θ)} = \int_{E_{l}}^{E_{s}} C (E) d E;

(68)

\frac{\partial R}{\partial S} = \frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E .

(69)

The closed-form expressions of the above sensitivities are readily determined by using in Eqs. (67)‒(69) the expressions obtained in Eqs. (51) and (24), and by performing the respective integrations obtain:

\frac{\partial R}{\partial F (θ)} = \frac{S Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(70)

\frac{\partial R}{\partial Σ_{d} (θ)} = S \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}];

(71)

\frac{\partial R}{\partial S} = Σ_{d} (θ) \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] .

(72)

The first-order sensitivities with respect to the primary parameters are obtained analytically from Eqs. (67) and (68), respectively, by using the following “chain rule” of differentiation:

\frac{\partial R}{\partial N_{d}} = \frac{\partial R}{\partial Σ_{d} (θ)} \frac{\partial Σ_{d} (θ)}{\partial N_{d}} = σ_{d} \frac{\partial R}{\partial Σ_{d} (θ)};

(73)

\frac{\partial R}{\partial σ_{d}} = \frac{\partial R}{\partial Σ_{d} (θ)} \frac{\partial Σ_{d} (θ)}{\partial σ_{d}} = N_{d} \frac{\partial R}{\partial Σ_{d} (θ)};

(74)

\frac{\partial R}{\partial θ_{i}} = \frac{\partial R}{\partial F (θ)} \frac{\partial F (θ)}{\partial θ_{i}}; i = 1, ..., 4 M .

(75)

The specific expressions of the first-order sensitivities

\partial R / \partial θ_{i}

,

i = 1, ..., 4 M

, are obtained by using Eq. (75) in conjunction with Eq. (69) and Eqs. (38)‒(44).

3.3. Discussion: Direct Versus Indirect Computation of the First-Order Sensitivities of Decoder Response with Respect to the Primary Model Parameters:

The principles of the 1^st-CASAM-NIE-V were applied in Section 3.1 to determine the first-order sensitivities of the decoder response directly with respect to the model’s primary parameters/weights. It has been shown that this procedure requires a single “large-scale” computation to solve a NIE-Volterra equation in order to determine the (single) 1^st-level adjoint sensitivity function

a^{(1)} (E)

, which is subsequently used in

4 M + 1

integrals that are computed using quadrature formulas. The two additional first-order sensitivities with respect to the components of

Σ_{d} (θ)

require a single quadrature involving the forward function

C (E)

.

The principles of the 1^st-FASAM-NIE-V were applied in Subsection 3.2 to determine the first-order sensitivities of the decoder response with respect to the feature functions. This path required just two (as opposed to

4 M + 1

) numerical evaluations of (two) integrals using quadrature formulas involving the 1^st-level adjoint sensitivity function

a^{(1)} (E)

. The sensitivities of the decoder response with respect to the primary parameters/weights were subsequently determined analytically, using the “chain rule of differentiation” of the explicitly-known expression of the feature function

F (θ)

. Evaluating the two additional first-order sensitivities with respect to the components of

Σ_{d} (θ)

require a single quadrature involving the forward function

C (E)

, as in Section 3.1. Evidently, the indirect path presented in Section 3.2 is computationally more efficient, since it requires substantially fewer numerical quadratures than the path presented in Section 3.1. The superiority of the indirect path, via “feature functions,” over the direct computation of sensitivities with respect to the model parameters will be considerably more evident for the computation of second-order sensitivities, as will be shown in the forthcoming Section 4 and Section 5, below.

Of course, when no feature functions can be identified, the 1^st-FASAM-NIE-V methodology becomes identical to the 1^st-CASAM-NIE-V methodology.

4. The Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (2^nd-FASAM-NIE-V)

The second-order sensitivities of the response

R [h; F (θ)]

defined in Eq. (3) will be computed by conceptually using their basic definitions as being the “first-order sensitivities of the first-order sensitivities.” Thus, the second-order sensitivities stemming from the first-order sensitivities

\partial R [h; F (θ)] / \partial F_{j}

are obtained from the first-order G-differential of Eq. (18), for

j = 1, ..., T F

, as follows:

\begin{array}{l} δ (\frac{\partial R}{\partial F_{j}}) ≜ {\{\frac{d}{d ε} [\int_{t_{0}}^{t_{f}} \frac{\partial D [h^{0} (t) + ε v^{(1)} (t); F (θ^{0}) + ε δ F; t]}{\partial F_{j}} d t]\}}_{ε = 0} \\ + {\{\sum_{i = 1}^{T H} \frac{d}{d ε} \int_{t_{0}}^{t_{f}} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)] \frac{\partial g_{i} (F^{0} + ε δ F; t)}{\partial F_{j}} d t\}}_{ε = 0} \\ + \{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)] \frac{\partial φ_{i} (F^{0} + ε δ F; t)}{\partial F_{j}} \\ {\times \int_{t_{0}}^{t} ψ_{i} [h^{0} (τ) + ε v^{(1)} (τ); F^{0} + ε δ F; τ] d τ\}}_{ε = 0} \\ + \{\sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)] φ_{i} (F^{0} + ε δ F; t) d t \\ \times {\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h^{0} (τ) + ε v^{(1)} (τ); F^{0} + ε δ F; τ]}{\partial F_{j}} d τ\}}_{ε = 0} \\ ≜ δ {(\partial R / \partial F_{j})}_{d i r} + δ {(\partial R / \partial F_{j})}_{i n d}; j = 1, ..., T F . \end{array}

(76)

In Eq. (76), the expression of the direct-effect term

δ {(\partial R / \partial F_{j})}_{d i r}

is obtained after performing the operations with respect to the scalar

ε

and comprises the variations

δ F

(stemming from variations in the model parameters), being defined as follows:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{d i r} ≜ \sum_{k = 1}^{T F} \int_{t_{0}}^{t_{f}} [\frac{\partial^{2} D [h (t); F (θ); t]}{\partial F_{k} \partial F_{j}} δ F_{k}] d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} [a_{i}^{(1)} (t) \sum_{k = 1}^{T F} \frac{\partial^{2} g_{i} (h; F)}{\partial F_{k} \partial F_{j}} δ F_{k}] d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) [\sum_{k = 1}^{T F} \frac{\partial^{2} φ_{i} (F; t)}{\partial F_{k} \partial F_{j}} δ F_{k}] \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} [\sum_{k = 1}^{T F} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{k}} δ F_{k}] d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) [\sum_{k = 1}^{T F} \frac{\partial φ_{i} (F; t)}{\partial F_{k}} δ F_{k}] d t \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t [\sum_{k = 1}^{T F} \frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial F_{k} \partial F_{j}} δ F_{k}]; j = 1, ..., T F . \end{array}

(77)

The expression on the right-side of Eq. (77) is to be evaluated at the nominal/optimal values for the respective model parameters, but this fact has not been indicated explicitly in order to simplify the notation.

The expression of the indirect-effect term

δ {(\partial R / \partial F_{j})}_{i n d}

defined in Eq. (76) is obtained after performing the operations with respect to the scalar

ε

and comprises the variations

v^{(1)} (t)

and

δ a^{(1)} (t) ≜ {[δ a_{1}^{(1)} (t), \dots, δ a_{T H}^{(1)} (t)]}^{†}

, as follows:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{i n d} ≜ \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial^{2} D [h (t); F (θ); t]}{\partial h_{k} (t) \partial F_{j}} v_{k} (t) d t + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{i}^{(1)} (t) \frac{\partial g_{i} (F; t)}{\partial F_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} \sum_{k = 1}^{T H} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{i}^{(1)} (t) φ_{i} (F; t) d t \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \int_{t_{0}}^{t} \sum_{k = 1}^{T H} [\frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ) \partial F_{j}} v_{k} (τ)] d τ; j = 1, ..., T F . \end{array}

(78)

The expressions in Eq. (78) are to be evaluated at the nominal values of the respective functions and parameters, but the respective indication (i.e., the superscript “zero”) has been omitted in order to simplify the notation.

The direct-effect term

δ {(\partial R / \partial F_{j})}_{d i r}

can be evaluated at this time for all variations

δ F

, but the indirect-effect term

δ {(\partial R / \partial F_{j})}_{i n d}

can be evaluated only after having determined the variations

v^{(1)} (t)

and

δ a^{(1)} (t)

. The variation

v^{(1)} (t)

is the solution of the 1^st-LVSS defined by Eq. (9). On the other hand, the variational function

δ a^{(1)} (t)

is the solution of the system of equations obtained by G-differentiating the 1^st-LASS. By definition, the G-differential of Eq. (15) is obtained as follows, for

i = 1, ..., T H

:

\begin{array}{l} {\{\frac{d}{d ε} [a_{i}^{(1, 0)} (t) + ε δ a_{i}^{(1)} (t)]\}}_{ε = 0} - \{\frac{d}{d ε} [\sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h^{0} (t) + ε v^{(1)} (t); F^{0} + ε δ F; t]}{\partial h_{i} (t)}] \\ \times {\int_{t}^{t_{f}} [a_{k}^{(1, 0)} (τ) + ε δ a_{k}^{(1)} (τ)] φ_{k} (F^{0} + ε δ F; τ) d τ\}}_{ε = 0} \\ = {\{\frac{d}{d ε} \frac{\partial D [h^{0} (t) + ε v^{(1)} (t); F^{0} + ε δ F; t]}{\partial h_{i} (t)}\}}_{ε = 0} . \end{array}

(79)

Performing the operations indicated in Eq. (79) and rearranging the various terms yields the following relations, for

i = 1, ..., T H

:

\begin{array}{l} δ a_{i}^{(1)} (t) - \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ - \sum_{m = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t) \\ - \sum_{k = 1}^{T H} [\sum_{m = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t)] \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ = \sum_{n = 1}^{T F} S_{i n} (F; t) δ F_{n}, \end{array}

(80)

where:

\begin{array}{l} \sum_{n = 1}^{T F} S_{i n} (F; t) δ F_{n} ≜ \sum_{k = 1}^{T H} [\sum_{n = 1}^{T F} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n}] \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ + \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \sum_{n = 1}^{T F} \int_{t}^{t_{f}} [a_{k}^{(1)} (τ) \frac{\partial φ_{k} (F; τ)}{\partial F_{n}} δ F_{n}] d τ + \sum_{n = 1}^{T F} \frac{\partial^{2} D [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n} . \end{array}

(81)

As indicated by the result obtained in Eq. (80), the variations

δ a^{(1)} (t)

are coupled to the variations

v^{(1)} (t)

. Therefore, they can be obtained by simultaneously solving Eqs. (80) and (9), which together will be called the “2^nd-Level Variational Sensitivity System (2^nd-LVSS).” The solution of the 2^nd-LVSS, namely the vector

v^{(2)} (t) ≜ {[δ a^{(1)} (t), v^{(1)} (t)]}^{†}

, will be called the “2^nd-level variational sensitivity function.” Since the 2^nd-LVSS depends on the variations

δ F

(stemming from variations in the model parameters), it would need to be solved anew for each such variation. The repeated solving of the 2^nd-LVSS can be avoided by following the general principles underlying the 2^nd-FASAM [16], which considers the function

v^{(2)} (t) ≜ {[v^{(1)} (t), δ a^{(1)} (t)]}^{†}

to be an element in a Hilbert space denoted as

H_{2} (Ω_{t})

. The Hilbert space

H_{2} (Ω_{t})

is considered to be endowed with an inner product denoted as

{〈χ^{(2)}, η^{(2)}〉}_{2}

, between two vectors

χ^{(2)} (t) ≜ {[χ_{1}^{(2)} (t), χ_{2}^{(2)} (t)]}^{†} \in H_{2} (Ω_{t})

and

η^{(2)} (t) = {[η_{1}^{(2)} (t), η_{2}^{(2)} (t)]}^{†} \in H_{2} (Ω_{t})

, with

η_{1}^{(2)} (t) ≜ {[η_{1, 1}^{(2)} (t), \dots, η_{1, T H}^{(2)} (t)]}^{†}

,

η_{2}^{(2)} (t) ≜ {[η_{2, 1}^{(2)} (t), \dots, η_{2, T H}^{(2)} (t)]}^{†}

,

χ_{1}^{(2)} (t) ≜ {[χ_{1, 1}^{(2)} (t), \dots, χ_{1, T H}^{(2)} (t)]}^{†}

,

χ_{2}^{(2)} (t) ≜ {[χ_{2, 1}^{(2)} (t), \dots, χ_{2, T H}^{(2)} (t)]}^{†}

, which is defined as follows:

\begin{array}{l} {〈χ^{(2)}, η^{(2)}〉}_{2} ≜ \int_{t_{0}}^{t_{f}} {[χ^{(2)} (t)]}^{†} η^{(2)} (t) d t = {〈χ_{1}^{(2)}, η_{1}^{(2)}〉}_{1} + {〈χ_{2}^{(2)}, η_{2}^{(2)}〉}_{1} \\ = \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{1, j}^{(2)} (t) η_{1, j}^{(2)} (t) d t + \sum_{j = 1}^{T H} \int_{t_{0}}^{t_{f}} χ_{2, j}^{(2)} (t) η_{2, j}^{(2)} (t) d t . \end{array}

(82)

Following the general principles underlying the 2^nd-FASAM [16], the function

v^{(2)} (t) ≜ {[v^{(1)} (t), δ a^{(1)} (t)]}^{†}

will be eliminated from the expression of each indirect-effect terms

δ {(\partial R / \partial F_{j})}_{i n d}

,

j = 1, ..., T F

, defined in Eq. (78). This elimination is achieved by considering, for each index

j = 1, ..., T F

, a vector-valued function denoted as

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†} \in H_{2} (Ω_{t})

, with

a_{1}^{(2)} (t; j) ≜ {[a_{1, 1}^{(2)} (t; j), \dots, a_{1, T H}^{(2)} (t; j)]}^{†}

and

a_{2}^{(2)} (t; j) ≜ {[a_{2, 1}^{(2)} (t; j), \dots, a_{2, T H}^{(2)} (t; j)]}^{†}

. Using the definition provided in Eq. (82), we construct the inner product of Eqs. (9) and (80) with the vector

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

, to obtain the following relation:

\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) v_{i}^{(1)} (t) d t - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) φ_{i} (F; t) d t \sum_{k = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) δ a_{i}^{(1)} (t) d t - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t \int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{m = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t) d t \\ - \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} [\sum_{m = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t)] d t \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ = Q^{(2)}, \end{array}

(83)

where:

Q^{(2)} ≜ \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t) d t [\sum_{k = 1}^{T F} q_{i k} (F; t) δ F_{k}] + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t) d t [\sum_{k = 1}^{T F} S_{i k} (F; t) δ F_{k}] .

(84)

Following the principles of the 2^nd-CASAM [16], the left-side of Eq. (83) will be identified with the indirect-effect term defined in Eq. (78), thereby determining the (yet undetermined) functions

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

. For this purpose, the right-side of Eq. (78) is cast in the form of the inner product

{〈v^{(2)} (t), []〉}_{2} = {〈v^{(1)} (t), []〉}_{1} + {〈δ a^{(1)} (t), []〉}_{1}

. The terms on the right-side of Eq. (78) involving the components of the function

δ a^{(1)} (t)

are already in the desired format, but the terms involving the components of the function

v^{(1)} (t)

must be re-arranged, as follows:

(i): The fourth term on the right-side of Eq. (78), is recast by using “integration by parts” as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} \sum_{k = 1}^{T H} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ \\ = {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} d t] [\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ]\}}_{t = t_{f}} \\ - {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} d t] [\int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k} (τ) d τ]\}}_{t = t_{0}} \\ - \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} v_{k} (t) \int_{t_{0}}^{t} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ \\ = \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} v_{k} (t) d t \int_{t}^{t_{f}} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ . \end{array}$

(85)
(ii): The sixth (last) term on the right-side of Eq. (78), is recast by using “integration by parts,” as above, to obtain the following relation:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \int_{t_{0}}^{t} \sum_{k = 1}^{T H} [\frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ) \partial F_{j}} v_{k} (τ)] d τ \\ = \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} \frac{\partial^{2} ψ_{i} [h (t); F; t]}{\partial h_{k} (t) \partial F_{j}} v_{k} (t) d t \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ . \end{array}$

(86)

Using in Eq. (78) the results obtained in Eqs. (85) and (86) yields the following expression for the indirect-effect term, for

j = 1, ..., T F

:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{i n d} ≜ \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k} (t) d t \{\frac{\partial^{2} D [h (t); F (θ); t]}{\partial h_{k} (t) \partial F_{j}} \\ + \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ + \sum_{i = 1}^{T H} \frac{\partial^{2} ψ_{i} [h (t); F; t]}{\partial h_{k} (t) \partial F_{j}} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ\} \\ + \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{k}^{(1)} (t) \{\frac{\partial g_{k} (F; t)}{\partial F_{j}} + \frac{\partial φ_{k} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{k} [h (τ); F; τ] d τ + φ_{k} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{k}}{\partial F_{j}} d τ\} d t . \end{array}

(87)

The left-side of Eq. (83) is now recast in the form of the inner product

{〈v^{(2)} (t), []〉}_{2}

by performing the following operations:

(i): The second term on the left-side of Eq. (83) is rearranged by using “integration by parts” as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) φ_{i} (F; t) d t \sum_{k = 1}^{T H} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial h_{k} (τ)} v_{k}^{(1)} (τ) d τ \\ = \sum_{k = 1}^{T H} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k} (t) \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} d t \int_{t}^{t_{f}} a_{1, i}^{(2)} (τ; j) φ_{i} (F; τ) d τ . \end{array}$

(88)
(ii): The fourth term on the left-side of Eq. (83) is rearranged by using “integration by parts” as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t \int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ = {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t] [\int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ]\}}_{t = t_{f}} \\ - {\sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \{[\int_{t_{0}}^{t} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} d t] [\int_{t}^{t_{f}} δ a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ]\}}_{t = t_{0}} \\ - \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} d t [- δ a_{k}^{(1)} (t) φ_{k} (F; t)] [\int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ] \\ = \sum_{k = 1}^{T H} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} δ a_{k}^{(1)} (t) φ_{k} (F; t) d t \int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ \end{array}$

(89)
(iii): The fifth term on the left-side of Eq. (83) is rearranged as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) \sum_{k = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} v_{k}^{(1)} (t) d t \\ = \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k}^{(1)} (t) [\sum_{i = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t; j)] d t . \end{array}$

(90)
(iv): The sixth term on the left-side of Eq. (83) is rearranged as follows:

$\begin{array}{l} \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t) \sum_{k = 1}^{T H} [\sum_{m = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} v_{m}^{(1)} (t)] d t \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ = \sum_{m = 1}^{T H} \sum_{i = 1}^{T H} \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{m}^{(1)} (t) [\frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial h_{m} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ] d t \\ = \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} v_{k}^{(1)} (t) \sum_{i = 1}^{T H} \sum_{n = 1}^{T H} [\frac{\partial^{2} ψ_{n} [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{n}^{(1)} (τ) φ_{n} (F; τ) d τ] d t \end{array}$

(91)

Inserting the results obtained in Eqs. (88)‒(91) into the left-side of Eq. (83) yields the following relation:

\begin{array}{l} Q^{(2)} = \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} d t v_{k}^{(1)} (t) \{a_{1, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{1, i}^{(2)} (τ; j) φ_{i} (F; τ) d τ \\ - \sum_{i = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t; j) - \sum_{i = 1}^{T H} \sum_{n = 1}^{T H} \frac{\partial^{2} ψ_{n} [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{n}^{(1)} (τ) φ_{n} (F; τ) d τ\} \\ + \sum_{k = 1}^{T H} \int_{t_{0}}^{t_{f}} d t δ a_{k}^{(1)} (t) \{a_{2, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} φ_{k} (F; t) d t \int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ\} . \end{array}

(92)

The right-side of Eq. (92) can now be required to represent the indirect-effect term defined in Eq. (87), by imposing the requirement that the hitherto arbitrary function

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

be the solution of the following NIE-Volterra equations, for

i = 1, ..., T H; ​ j = 1, ..., T F

:

\begin{array}{l} a_{1, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{1, i}^{(2)} (τ; j) φ_{i} (F; τ) d τ - \sum_{i = 1}^{T H} \frac{\partial^{2} D [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t; j) \\ - \sum_{i = 1}^{T H} \sum_{n = 1}^{T H} \frac{\partial^{2} ψ_{n} [h (t); F; t]}{\partial h_{k} (t) \partial h_{i} (t)} a_{2, i}^{(2)} (t) \int_{t}^{t_{f}} a_{n}^{(1)} (τ) φ_{n} (F; τ) d τ = \frac{\partial^{2} D [h (t); F (θ); t]}{\partial h_{k} (t) \partial F_{j}} \\ + \sum_{i = 1}^{T H} \frac{\partial ψ_{i} [h (t); F; t]}{\partial h_{k} (t)} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) \frac{\partial φ_{i} (F; τ)}{\partial F_{j}} d τ + \sum_{i = 1}^{T H} \frac{\partial^{2} ψ_{i} [h (t); F; t]}{\partial h_{k} (t) \partial F_{j}} \int_{t}^{t_{f}} a_{i}^{(1)} (τ) φ_{i} (F; τ) d τ; \end{array}

(93)

\begin{array}{l} a_{2, k}^{(2)} (t; j) - \sum_{i = 1}^{T H} φ_{k} (F; t) d t \int_{t_{0}}^{t} a_{2, i}^{(2)} (τ; j) \frac{\partial ψ_{k} [h (τ); F; τ]}{\partial h_{i} (τ)} d τ \\ = \frac{\partial g_{k} (F; t)}{\partial F_{j}} + \frac{\partial φ_{k} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} ψ_{k} [h (τ); F; τ] d τ + φ_{k} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{k}}{\partial F_{j}} d τ . \end{array}

(94)

It follows from Eqs. (92)‒(94) that the indirect-effect term

δ {(\partial R / \partial F_{j})}_{i n d}

defined by Eq. (78) or, equivalently, Eq. (87) can be expressed in terms of the function

a^{(2)} (t; j) = {[a_{1}^{(2)} (t; j), a_{2}^{(2)} (t; j)]}^{†}

as follows, for

j = 1, ..., T F

:

\begin{array}{l} δ {(\frac{\partial R}{\partial F_{j}})}_{i n d} = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) d t [\sum_{n = 1}^{T F} q_{i n} (F; t) δ F_{n}] + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) d t [\sum_{n = 1}^{T F} S_{i n} (F; t) δ F_{n}] \\ = \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) d t \sum_{n = 1}^{T F} δ F_{n} \{\frac{\partial g_{i} (F; t)}{\partial F_{n}} + \frac{\partial φ_{i} (F; t)}{\partial F_{n}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{n}} d τ\} + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) d t \{\sum_{n = 1}^{T F} \frac{\partial^{2} D [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n} \\ + \sum_{k = 1}^{T H} [\sum_{n = 1}^{T F} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} δ F_{n}] \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ \\ + \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \sum_{n = 1}^{T F} \int_{t}^{t_{f}} [a_{k}^{(1)} (τ) \frac{\partial φ_{k} (F; τ)}{\partial F_{n}} δ F_{n}] d τ\} . \end{array}

(95)

The second-order sensitivities

\partial^{2} R [h; F (θ)] / \partial F_{j} \partial F_{n}

of the decoder-response with respect to the components of the feature function are obtained by adding the expression of the indirect-effect term obtained in Eq. (95) to the expression for the direct-effect term obtained in Eq. (77) and subsequently identifying the expressions that multiply the variations

δ F_{n}

. The expressions thus obtained for

\partial^{2} R [h; F (θ)] / \partial F_{j} \partial F_{n}

, for

j, n = 1, ..., T F

, are as follows:

\begin{array}{l} \frac{\partial^{2} R [h; F (θ)]}{\partial F_{j} \partial F_{n}} ≜ \frac{\partial^{2} D [h (t); F (θ); t]}{\partial F_{n} \partial F_{j}} + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial^{2} g_{i} (h; F)}{\partial F_{n} \partial F_{j}} d t \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial^{2} φ_{i} (F; t)}{\partial F_{n} \partial F_{j}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{j}} \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{n}} d τ \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) \frac{\partial φ_{i} (F; t)}{\partial F_{n}} d t \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{j}} d τ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{i}^{(1)} (t) φ_{i} (F; t) d t \frac{\partial^{2} ψ_{i} [h (τ); F; τ]}{\partial F_{n} \partial F_{j}} \\ + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{1, i}^{(2)} (t; j) d t \{\frac{\partial g_{i} (F; t)}{\partial F_{n}} + \frac{\partial φ_{i} (F; t)}{\partial F_{n}} \int_{t_{0}}^{t} ψ_{i} [h (τ); F; τ] d τ \\ + φ_{i} (F; t) \int_{t_{0}}^{t} \frac{\partial ψ_{i} [h (τ); F; τ]}{\partial F_{n}} d τ\} + \sum_{i = 1}^{T H} \int_{t_{0}}^{t_{f}} a_{2, i}^{(2)} (t; j) d t \{\frac{\partial^{2} D [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} \\ + \sum_{k = 1}^{T H} \frac{\partial^{2} ψ_{k} [h (t); F; t]}{\partial F_{n} \partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) φ_{k} (F; τ) d τ + \sum_{k = 1}^{T H} \frac{\partial ψ_{k} [h (t); F; t]}{\partial h_{i} (t)} \int_{t}^{t_{f}} a_{k}^{(1)} (τ) \frac{\partial φ_{k} (F; τ)}{\partial F_{n}} d τ\} . \end{array}

(96)

The NIE-Volterra system presented in Eqs. (93) and (94) is called the “2^nd-Level Adjoint Sensitivity System (2^nd-LASS)” and its solution,

a^{(2)} (j; t) = {[a_{1}^{(2)} (j; t), a_{2}^{(2)} (j; t)]}^{†}

,

j = 1, ..., T F

, is called the “2^nd-level adjoint sensitivity function.” Since the sources on the right-sides of Eqs. (93) and (94) stem from the first-order sensitivities

\partial R [h; F (θ)] / \partial F_{j}

,

j = 1, ..., T F

, they are dependent on the index “j”, which implies that for each first-order sensitivity

\partial R [h; F (θ)] / \partial F_{j}

, there will correspond a distinct 2^nd-LASS, having a distinct solution

a^{(2)} (j; t) = {[a_{1}^{(2)} (j; t), a_{2}^{(2)} (j; t)]}^{†}

, a fact that has been emphasized by using the index “j” in the list of arguments of this 2^nd-level adjoint sensitivity function. Therefore, there will be as many 2^nd-level adjoint functions as there are distinct first-order sensitivities

\partial R [h; F (θ)] / \partial F_{j}

, which is equivalent to the number of components

F_{j}

of the “feature-function”

F (θ)

. Notably, the integral operators on the left-sides of Eqs. (93) and (94) do not depend on the index “j”, which means that the same left-hand side needs to be inverted for computing the 2^nd-level adjoint function, regardless of the source term on the right-side (which corresponds to the particular component of the feature-function) of Eqs. (93) and (94). Therefore, if the inverses of the operators appearing on the left-sides of Eqs. (93) and (94) could be stored, they would not need to be inverted repeatedly, so the various 2^nd-level adjoint functions would be computed most efficiently.

The second-order sensitivities of the decoder-response with respect to the optimal weights/parameters

θ_{k}, k = 1, ..., T W

, are obtained analytically by using the chain rule in conjunction with the expressions obtained in Eqs. (96) and (18), as follows:

\frac{\partial^{2} R [F (θ)]}{\partial θ_{k} \partial θ_{j}} = \frac{\partial}{\partial θ_{k}} \{\sum_{i = 1}^{T F} \frac{\partial R [F (θ)]}{\partial F_{i} (θ)} \frac{\partial F_{i} (θ)}{\partial θ_{j}}\}, j, k = 1, ..., T W .

(97)

When there are no feature functions but only individual model parameters, i.e., when

F_{i} (θ) \equiv θ_{i}

for all

i = 1, ..., T F ≜ T W

, the expression obtained in Eq. (96) yields directly the second-order sensitivities

\partial^{2} R / \partial θ_{i} \partial θ_{j}

, for all

i, j = 1, ..., T W

. In this case, the 2^nd-LASS would need to be solved

T W

-times rather than just

T F

-times

(T F < T W)

, when

T F

feature functions can be constructed.

5. Illustrative Application of the 2^nd-FASAM-NIE-V to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

The 2^nd-FASAM-NIE-V methodology developed in Section 4 will be applied in this Section to the illustrative model (considered in Section 3) that describes the energy distribution of neutrons in a homogeneous hydrogenous medium. As discussed in Section 4, the second-order sensitivities of the decoder response,

R [C (E)]

, with respect to the feature functions

F (θ)

,

Σ_{d} (θ)

, and S, will be determined by considering the second-order sensitivities to be “the first-order sensitivities of the first-order sensitivities.” Thus, the first-order sensitivities obtained in Eqs. (67)‒(69) will play the role of ”responses” in the application of the 2^nd-FASAM-NIE-V methodology.

5.1. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial F (θ)$

The second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial F (θ)

will be determined from the expression of the first-order G-variation

δ \{\partial R / \partial F (θ)\}

, which is by definition obtained from Eq. (67) as shown below:

\begin{array}{l} δ \{\frac{\partial R}{\partial F (θ)}\} = \frac{d}{d ε} {\{\frac{S^{0} + ε δ S}{E_{s}} \int_{E_{l}}^{E_{s}} [a^{(1, 0)} (E) + ε δ a^{(1)} (E)] {(\frac{E_{s}}{E})}^{[F (θ^{0}) + ε δ F (θ)]} d E\}}_{ε = 0} \\ = δ {\{\frac{\partial R}{\partial F (θ)}\}}_{d i r} + δ {\{\frac{\partial R}{\partial F (θ)}\}}_{i n d}, \end{array}

(98)

where the direct-effect term

δ {\{\partial R / \partial F (θ)\}}_{d i r}

is defined as follows:

\begin{array}{l} δ {\{\frac{\partial R}{\partial F (θ)}\}}_{d i r} ≜ {\{\frac{(δ S)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E\}}_{θ = θ^{0}} \\ + {\{[δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} \ln (\frac{E_{s}}{E}) d E\}}_{θ = θ^{0}}, \end{array}

(99)

while the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

is defined as follows:

δ {\{\frac{\partial R}{\partial F (θ)}\}}_{i n d} ≜ {\{\frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} δ a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E\}}_{θ = θ^{0}} .

(100)

The direct-effect term can be computed at this time. On the other hand, the indirect-effect term can be computed after determining the variational function

δ a^{(1)} (E)

, which is the solution of the G-differentiated 1^st-LASS defined in Eq. (50). By definition, the G-differential of Eq. (50) is provided by the following equation:

\begin{array}{l} \frac{d}{d ε} {\{[a^{(1, 0)} (E) + ε δ a^{(1)} (E)]\}}_{ε = 0} - \frac{d}{d ε} {\{\frac{F (θ^{0}) + ε δ F (θ)}{E} \int_{E_{l}}^{E} [a^{(1, 0)} (e) + ε δ a^{(1)} (e)] d e\}}_{ε = 0} \\ = \frac{d}{d ε} {\{Σ_{d} (θ^{0}) + ε δ Σ_{d} (θ)\}}_{ε = 0} . \end{array}

(101)

Performing the operations indicated in Eq. (101) yields the following equation to be satisfied by the function

δ a^{(1)} (E)

at the nominal parameter values (the superscript “zero” will be omitted for notational simplicity):

δ a^{(1)} (E) - \frac{F (θ)}{E} \int_{E_{l}}^{E} δ a^{(1)} (e) d e = \frac{δ F (θ)}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) .

(102)

Evidently, Eq. (102) would need to be solved repeatedly, for every parameter variation, in order to compute the function

δ a^{(1)} (E)

that would correspond to the respective parameter variation. As was shown in Section 4, the need for repeatedly solving Eq. (102)can be avoided by deriving an alternative expression for the indirect-effect term which does not involve

δ a^{(1)} (E)

. This alternative expression is derived by introducing the 2^nd-LASS, the solution of which would replace the function

δ a^{(1)} (E)

in the alternative expression for the indirect-effect term. Notably, Eq. (102) is independent of variations in the forward function,

δ C (E)

; therefore, the 2^nd-level adjoint sensitivity function will comprise a single component, denoted as

a^{(2)} (E; 1)

, and the 2^nd-LASS will comprise a single equation for this component.

Following the principles of the 2^nd-FASAM-NIE-V, we use Eq. (45) to construct the inner product of Eq. (102) with a function

a^{(2)} (E; 1)

, where the argument “1” indicates that this 2^nd-level adjoint sensitivity function will correspond to the first-order sensitivity

\partial R [C (E)] / \partial F (θ)

, to obtain the following relation:

\begin{array}{l} \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) δ a^{(1)} (E) d E - F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} δ a^{(1)} (e) d e \\ = δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E . \end{array}

(103)

The function

a^{(2)} (E; 1)

will be determined by requiring the left-side of Eq. (103) to represent the indirect-effect term defined in Eq. (100). For this purpose, the left-side will be recast using “integration by parts” into the following form:

\begin{array}{l} \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) δ a^{(1)} (E) d E - F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} δ a^{(1)} (e) d e \\ = \int_{E_{l}}^{E_{s}} δ a^{(1)} (E) [a^{(2)} (E; 1) - F (θ) \int_{E}^{E_{s}} a^{(2)} (e; 1) \frac{d e}{e}] d E . \end{array}

(104)

Requiring the right-side of Eq. (104) to represent the indirect-effect term defined in Eq. (100) yields the following Volterra-type 2^nd-LASS to be satisfied by the 2^nd-level adjoint sensitivity function

a^{(2)} (E; 1)

:

a^{(2)} (E; 1) - F (θ) \int_{E}^{E_{s}} a^{(2)} (e; 1) \frac{d e}{e} = \frac{S}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} .

(105)

The above 2^nd-LASS is to be satisfied at the nominal parameter values

θ^{0}

. Notably, Eq. (105) is an “final-value problem” since the computation of

a^{(2)} (E; 1)

commences at the highest energy, where

a^{(2)} (E_{s}; 1) = S / E_{s}

, and proceeds towards the lowest energy value,

E_{l}

. For subsequent verification purposes, the closed-form explicit expression of

a^{(2)} (E; 1)

obtained by solving Eq.(105) is as follows:

a^{(2)} (E; 1) = E^{- F (θ)} [- S F (θ) E_{s}^{F (θ) - 1} \ln E + S F (θ) E_{s}^{F (θ) - 1} \ln E_{s} + S E_{s}^{F (θ) - 1}] .

(106)

It follows from Eqs. (103)‒(105) that the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

is given by the following expression involving the 2^nd-level adjoint sensitivity function

a^{(2)} (E; 1)

:

δ {\{\frac{\partial R}{\partial F (θ)}\}}_{i n d} = δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E .

(107)

Adding the expression for the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

obtained in Eq. (107) to the expression for the direct-effect term

δ {\{\partial R / \partial F (θ)\}}_{d i r}

obtained in Eq. (99) yields the following expression for the G-differential

δ \{\partial R / \partial F (θ)\}

:

\begin{array}{l} δ \{\frac{\partial R}{\partial F (θ)}\} = δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E \\ + \frac{(δ S)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E + [δ F (θ)] \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} \ln (\frac{E_{s}}{E}) d E . \end{array}

(108)

The above expression is to be evaluated at the nominal parameter values

θ^{0}

.

It follows from the expression obtained in Eq. (108) that the second-order sensitivities (of the decoder response with respect to the feature functions) which stem from the first-order sensitivity

\partial R [C (E)] / \partial F (θ)

have the following expressions to be evaluated at the nominal parameter values

θ^{0}

:

\frac{\partial^{2} R}{\partial^{2} F (θ)} = \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + \frac{S}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} \ln (\frac{E_{s}}{E}) d E;

(109)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial F (θ)} = \frac{1}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(2)} (E; 1) d E;

(110)

\frac{\partial^{2} R}{\partial S \partial F (θ)} = \frac{1}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) {(\frac{E_{s}}{E})}^{F (θ)} d E .

(111)

Since the 1^st-level adjoint sensitivity function

a^{(1)} (E)

is already available, the sensitivity

\partial^{2} R / \partial S \partial F (θ)

can be computed. The closed-form explicit expressions for the above second-order sensitivities are obtained by inserting the expression obtained in Eq. (106) for

a^{(2)} (E; 1)

and the expression obtained in Eq. (51) for

a^{(1)} (E)

, and performing the respective integrations. Carrying out these operations yields the following expressions:

\begin{array}{l} \frac{\partial^{2} R}{\partial F (θ) \partial F (θ)} = [S Σ_{d} (θ)] \{- 2 {[1 - F (θ)]}^{- 3} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] \\ - 2 {[1 - F (θ)]}^{- 2} {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}}) - \frac{F (θ)}{1 - F (θ)} {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} {[\ln (\frac{E_{s}}{E_{l}})]}^{2}\}; \end{array}

(112)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial F (θ)} = \frac{S}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(113)

\frac{\partial^{2} R}{\partial S \partial F (θ)} = \frac{Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\} .

(114)

5.2. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial Σ_{d} (θ)$

The second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial Σ_{d} (θ)

will be determined from the expression of the first-order G-variation

δ \{\partial R / \partial Σ_{d} (θ)\}

, which is by definition obtained from Eq. (68) as shown below:

δ \{\frac{\partial R}{\partial Σ_{d} (θ)}\} = \frac{d}{d ε} {\{\int_{E_{l}}^{E_{s}} [C^{0} (E) + ε δ C (E)] d E\}}_{ε = 0} = \int_{E_{l}}^{E_{s}} δ C (E) d E .

(115)

Comparing Eq. (115) with Eq. (35), it becomes apparent that the following relation holds:

δ \{\partial R / \partial Σ_{d} (θ)\} = {\{δ R (C; θ; δ C)\}}_{i n d} / Σ_{d} (θ)

(116)

Replacing the expression obtained for

{\{δ R (C; θ; δ C)\}}_{i n d}

in Eq. (65) into Eq. (116) yields the following expression:

δ \{\frac{\partial R}{\partial Σ_{d} (θ)}\} = \frac{F (δ S)}{E_{s} Σ_{d}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + \frac{S (δ F)}{E_{s} Σ_{d}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)} .

(117)

It follows from Eq. (117) that the second-order sensitivities that stem from

\partial R / \partial Σ_{d}

have the following expressions:

\frac{\partial^{2} R}{\partial F (θ) \partial Σ_{d} (θ)} = \frac{S}{E_{s} Σ_{d} (θ)} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E {(\frac{E_{s}}{E})}^{F (θ)};

(118)

\frac{\partial^{2} R}{δ Σ_{d} (θ) \partial Σ_{d} (θ)} = 0;

(119)

\frac{\partial^{2} R}{δ S \partial Σ_{d} (θ)} = \frac{F (θ)}{E_{s} Σ_{d} (θ)} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E .

(120)

The explicit closed-form expressions of the above second-order sensitivities are obtained by replacing the expression of

a^{(1)} (E)

in Eqs. (118) and (120), respectively, and performing the respective integrations to obtain:

\frac{\partial^{2} R}{\partial F (θ) \partial Σ_{d} (θ)} = \frac{S}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(121)

\frac{\partial^{2} R}{δ S \partial Σ_{d} (θ)} = \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] .

(122)

The expression for

\partial^{2} R / \partial F (θ) \partial Σ_{d} (θ)

in Eq. (118) must be equivalent to the expression for

\partial^{2} R / \partial Σ_{d} (θ) \partial F (θ)

in Eq. (110). Notably, therefore, these mixed second-order sensitivities are computed twice, using distinct expressions involving distinct adjoint functions, which provides an intrinsic mechanism for the stringent verification of the accuracy of the computations used for obtaining the numerical values of the respective adjoint functions.

5.3. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial S$

The second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial S

will be determined from the expression of the first-order G-variation

δ \{\partial R / \partial S\}

, which is by definition obtained from Eq. (69) as shown below:

\begin{array}{l} δ \{\frac{\partial R}{\partial S}\} = \frac{d}{d ε} {\{\frac{[F (θ^{0}) + ε δ F (θ)]}{E_{s}} \int_{E_{l}}^{E_{s}} [a^{(1, 0)} (E) + ε δ a^{(1)} (E)] d E\}}_{ε = 0} \\ = δ {\{\frac{\partial R}{\partial S}\}}_{d i r} + δ {\{\frac{\partial R}{\partial S}\}}_{i n d}, \end{array}

(123)

where the direct-effect term

δ {\{\partial R / \partial S\}}_{d i r}

is defined as follows:

δ {\{\frac{\partial R}{\partial S}\}}_{d i r} ≜ {\{\frac{δ F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E\}}_{θ = θ^{0}},

(124)

while the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

is defined as follows:

δ {\{\frac{\partial R}{\partial S}\}}_{i n d} ≜ {\{\frac{F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} δ a^{(1)} (E) d E\}}_{θ = θ^{0}} .

(125)

The direct-effect term

δ {\{\partial R / \partial S\}}_{d i r}

can be computed at this time. To determine the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

without needing to compute the variational function

δ a^{(1)} (E)

by solving repeatedly Eq. (102), the steps outlined in Section 5.1 are applied using a 2^nd-order adjoint sensitivity function denoted as

a^{(2)} (E; 3)

, where the argument “3” indicates that this function will correspond to the first-order sensitivity

\partial R / \partial S

. Thus, following the conceptual steps outlined in Eqs. (103)‒(107) but for the function

a^{(2)} (E; 3)

as the counterpart of the function

a^{(2)} (E; 1)

, and for the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

as the counterpart of the indirect-effect term

δ {\{\partial R / \partial F (θ)\}}_{i n d}

, leads to the following expression for the indirect-effect term

δ {\{\partial R / \partial S\}}_{i n d}

:

δ {\{\frac{\partial R}{\partial S}\}}_{i n d} ≜ {\{δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) d E\}}_{θ = θ^{0}},

(126)

where the 2^nd-order adjoint sensitivity function

a^{(2)} (E; 3)

is the solution of the following Volterra-type 2^nd-LASS:

a^{(2)} (E; 3) - F (θ) \int_{E}^{E_{s}} a^{(2)} (e; 3) \frac{d e}{e} = \frac{F (θ)}{E_{s}} .

(127)

For verification purposes, the explicit closed-form expression of the solution of Eq. (127) is provided below.

a^{(2)} (E; 3) = \frac{F (θ)}{E_{s}} {(\frac{E_{s}}{E})}^{F (θ)} .

(128)

Adding the expressions obtained in Eqs. (126) and (124) yields the following expression for the first G-variation

δ \{\partial R / \partial S\}

:

\begin{array}{l} δ \{\frac{\partial R}{\partial S}\} = \frac{δ F (θ)}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + δ F (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e \\ + δ Σ_{d} (θ) \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) d E . \end{array}

(129)

It follows from Eq. (129) that the second-order sensitivities stemming from the first-order sensitivity

\partial R / \partial S

have the following expressions:

\frac{\partial^{2} R}{\partial F (θ) \partial S} = \frac{1}{E_{s}} \int_{E_{l}}^{E_{s}} a^{(1)} (E) d E + \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) \frac{d E}{E} \int_{E_{l}}^{E} a^{(1)} (e) d e

(130)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial S} = \int_{E_{l}}^{E_{s}} a^{(2)} (E; 3) d E .

(131)

\frac{\partial^{2} R}{\partial S \partial S} = 0 .

(132)

Inserting the expressions of

a^{(2)} (E; 3)

and

a^{(1)} (E)

, respectively, into Eqs. (130) and (131), and performing the respective integrations, yields the following closed-form explicit expressions for the respective second-order sensitivities:

\frac{\partial R}{\partial F (θ) \partial S} = \frac{Σ_{d} (θ)}{1 - F (θ)} \{\frac{1}{1 - F (θ)} [1 - {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1}] - F (θ) {(\frac{E_{s}}{E_{l}})}^{F (θ) - 1} \ln (\frac{E_{s}}{E_{l}})\};

(133)

\frac{\partial^{2} R}{\partial Σ_{d} (θ) \partial S} = \frac{F (θ)}{1 - F (θ)} [1 - {(\frac{E_{s}}{E})}^{F (θ) - 1}] .

(134)

The expression for

\partial^{2} R / \partial S F (θ)

in Eq. (111) must be equivalent to the expression for

\partial^{2} R / \partial F (θ) \partial S

in Eq. (130). The expression for

\partial^{2} R / \partial S \partial Σ_{d} (θ)

in Eq. (122) must be equivalent to the expression for

\partial^{2} R / \partial Σ_{d} (θ) \partial S

in Eq. (131). The equivalences of these corresponding expressions provide stringent verification criteria for the accuracy of the computation of the respective adjoint functions.

5.4. Discussion: Direct Computation of Second-Order Sensitivities Versus Their Indirect Computation via Feature Functions

Notably, the 9 second-order sensitivities of the decoder response with respect to the 3 feature functions,

F (θ)

,

Σ_{d} (θ)

and

S

were computed using 3 adjoint computations; each of these adjoint computations corresponds to one of the 3 first-order sensitivities of the decoder response with respect to the feature functions. Only 6 of these 9 second-order sensitivities have distinct values; the mixed sensitivities were computed twice, using different adjoint functions thus providing stringent verification criteria for the numerical computations of these functions. The second-order sensitivities of the decoder response with respect to the primary model parameters are obtained by applying the “chain-rule of differentiation” provided in Eq. (97) to the second-order sensitivities with respect to the feature functions.

On the other hand, the computation of the second-order sensitivities of the decoder response directly with respect to the primary model parameters would be performed by treating each of the first-order sensitivities defined in Eqs. (54)‒(57) as a “decoder/model response.” A shown in Section 3.1, there would be

T W + 1

first-order sensitivities in this case, which means that there would be

T W + 1

“2^nd-level adjoint sensitivity systems” to be solved, each one having a source-term that would correspond to one of the

T W + 1

first-order sensitivities. Evidently, it is considerably more advantageous computationally to consider compute the second-order sensitivities via “feature functions,” whenever possible, rather than directly with respect to the primary model parameters.

6. Discussion and Conclusions

This work has introduced the general mathematical framework of the 2^nd-FASAM-NIE-V methodology. The acronym “2^nd-FASAM-NIE-V” stands for “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra Type.” The 2^nd-FASAM-NIE-V encompasses the mathematical framework of the (first-order) 1^st-FASAM-NIE-V methodology, which enables the most efficient computation of the exact expressions of all first-order sensitivities with respect to the feature functions and also with respect to the optimal values of the NIE-net’s parameters/weights, after the respective NIE-Volterra-net was optimized to represent the underlying physical system. The 1^st-FASAM-NIE-V methodology requires a single large-scale computation for determining the first-level adjoint sensitivity function that is subsequently used for computing the sensitivities using conventional numerical quadrature formulas. The 2^nd-FASAM-NIE-V requires as many large-scale computations (to solve the 2^nd-Level Adjoint Sensitivity System) as there are first-order sensitivities of the decoder response with respect to the feature functions. Subsequently, the second-order sensitivities of the decoder response with respect to the primary model parameters are obtained by applying the “chain-rule of differentiation” to the second-order sensitivities with respect to the feature functions.

The application of the 1^st-FASAM-NIE-V and the 2^nd-FASAM-NIE-V methodologies has been illustrated by using a well-known model for neutron slowing down in a homogeneous hydrogenous medium. This model has been chosen because the application of the 1^st-FASAM-NIE-V and the 2^nd-FASAM-NIE-V yields tractable explicit exact expressions for all quantities of interest, including the various adjoint sensitivity functions and first- and second-order sensitivities of the decoder response with respect to all feature functions and also with respect to the primary model parameters. This illustrative application highlights the unsurpassed efficiency of the 1^st-FASAM-NIE-V and the 2^nd-FASAM-NIE-V for second-order sensitivity analysis of NIE-Volterra nets. Ongoing research aims at developing the “Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integro-Differential Equations,” which will enable, in premiere, the exact computations of second-order sensitivities of decoder responses with respect to optimized weights/parameters, based on the NIDE-models introduced in [22].

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed at the corresponding author.

Conflicts of Interest

The author declares no conflicts of interest.

References

Chen, R.T.Q.; Rubanova, Y.; Bettencourt, J.; Duvenaud, D.K. Neural ordinary differential equations. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: New York, NY, USA, 2018; Volume 31, pp. 6571–6583. [Google Scholar] [CrossRef]
Ruthotto, L.; Haber, E. Deep neural networks motivated by partial differential equations. J. Math. Imaging Vis. 2018, 62, 352–364. [Google Scholar] [CrossRef]
Lu, Y.; Zhong, A.; Li, Q.; Dong, B. Beyond finite layer neural networks: Bridging deep architectures and numerical differential equations. In Proceedings of the International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; pp. 3276–3285. [Google Scholar]
Dupont, E.; Doucet, A.; The, Y.W. Augmented neural odes. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; Volume 32, pp. 14–15. [Google Scholar]
Grathwohl, W.; Chen, R.T.Q.; Bettencourt, J.; Sutskever, I.; Duvenaud, D. Ffjord: Free-form continuous dynamics for scalable reversible generative models. In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
Zhong, Y.D.; Dey, B.; Chakraborty, A. Symplectic ode-net: Learning Hamiltonian dynamics with control. In Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia, 30 April 2020. [Google Scholar]
Kidger, P.; Morrill, J.; Foster, J.; Lyons, T. Neural controlled differential equations for irregular time series. In Proceedings of the Advances in Neural Information Processing Systems, Virtual, 6–12 December 2020; Volume 33, pp. 6696–6707. [Google Scholar]
Morrill, J.; Salvi, C.; Kidger, P.; Foster, J. Neural rough differential equations for long time series. In Proceedings of the International Conference on Machine Learning, Virtual, 18–24 July 2021; pp. 7829–7838. [Google Scholar]
Kidger, P. On Neural Differential Equations. arXiv 2022, arXiv:2202.02435. [Google Scholar]
Rokhlin, V. Rapid solution of integral equations of classical potential theory. J. Comput. Phys. 1985, 60, 187–207. [Google Scholar] [CrossRef]
Rokhlin, V. Rapid solution of integral equations of scattering theory in two dimensions. J. Comput. Phys. 1990, 86, 414–439. [Google Scholar] [CrossRef]
Greengard, L.; Kropinski, M.C. An integral equation approach to the incompressible Navier-Stokes equations in two dimensions. SIAM J. Sci. Comput. 1998, 20, 318–336. [Google Scholar] [CrossRef]
Effati, S.; Buzhabadi, R. A neural network approach for solving Fredholm integral equations of the second kind. Neural Comput. Appl. 2012, 21, 843–852. [Google Scholar] [CrossRef]
Zappala, E.; de Oliveira Fonseca, A.H.; Caro, J.O.; van Dijk, D. Neural Integral Equations. arXiv 2023, arXiv:2209.15190v4. [Google Scholar]
Xiong, Y.; Zeng, Z.; Chakraborty, R.; Tan, M.; Fung, G.; Li, Y.; Singh, V. Nystromformer: A nystrom-based algorithm for approximating self-attention. Proc. AAAI Conf. Artif. Intell. 2021, 35, 14138. [Google Scholar] [PubMed]
Cacuci, D.G. Cacuci, D.G. Introducing the n^th-Order Features Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (n^th-FASAM-N): I. Mathematical Framework. Am. J. Comput. Math. 2024, 14, 11–42. https://doi.org/10.4236/ajcm.2024.141002. See also: Dan Gabriel Cacuci, The n^th-Order Comprehensive Adjoint Sensitivity Analysis Methodology (n^th-CASAM): Overcoming the Curse of Dimensionality in Sensitivity and Uncertainty Analysis, Volume III: Nonlinear Systems, 369 pages, Springer Nature Switzerland, Cham, 2023. [CrossRef]
Cacuci, D.G. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Ordinary Differential Equations. I: Mathematical Framework. Processes 2024, 12, 2660. [Google Scholar] [CrossRef]
Cacuci, D.G. Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Fredholm-Type Neural Integral Equations. Mathematics 2025, 13, 14. [Google Scholar] [CrossRef]
Weinberg, A.M.; Wigner, E.P. The Physical Theory of Neutron Chain Reactors; The University of Chicago Press: Chicago, Il., USA, 1958. [Google Scholar]
Lamarsh, J.R. Introduction to Nuclear Reactor Theory; Addison-Wesley Publishing Company: Reading, Massachusetts, USA, 1966. [Google Scholar]
Duderstadt, J.J.; Hamilton, L.J. Nuclear Reactor Analysis; John Wiley & Sons: New York, USA, 1976. [Google Scholar]
Zappala, E.; de Oliveira Fonseca, A.H.; Moberly, A.H.; Higley, J.M.; Abdallah, C.; Cardin, J.; Van Dijk, D. Neural Integro-Differential Equations. arXiv 2022, arXiv:2206.14282v1. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering

Abstract

Keywords:

Subject:

1. Introduction

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1^st-FASAM-NIE-V)

Particular Case: The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1^st-CASAM-NIE-V)

3. Illustrative Application of the 1^st-CASAM-NIE-V and 1^st-FASAM-NIE-V Methodologies to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

3.1. Application of 1^st-CASAM-NIE-V to Directly Compute the First-Order Sensitivities of the Decoder Response with Respect to the Primary Model Parameters

3.2. Efficient Indirect Computation Using the 1^st-FASAM-NIE-V of the First-Order Sensitivities of the Decoder Response with Respect to Pimary Model Parameters

3.3. Discussion: Direct Versus Indirect Computation of the First-Order Sensitivities of Decoder Response with Respect to the Primary Model Parameters:

4. The Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (2^nd-FASAM-NIE-V)

5. Illustrative Application of the 2^nd-FASAM-NIE-V to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

5.1. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial F (θ)$

5.2. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial Σ_{d} (θ)$

5.3. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial S$

5.4. Discussion: Direct Computation of Second-Order Sensitivities Versus Their Indirect Computation via Feature Functions

6. Discussion and Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe

Introducing the Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type: Mathematical Methodology and Illustrative Application to Nuclear Engineering

Abstract

Keywords:

Subject:

1. Introduction

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1st-FASAM-NIE-V)

Particular Case: The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1st-CASAM-NIE-V)

3. Illustrative Application of the 1st-CASAM-NIE-V and 1st-FASAM-NIE-V Methodologies to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

3.1. Application of 1st-CASAM-NIE-V to Directly Compute the First-Order Sensitivities of the Decoder Response with Respect to the Primary Model Parameters

3.2. Efficient Indirect Computation Using the 1st-FASAM-NIE-V of the First-Order Sensitivities of the Decoder Response with Respect to Pimary Model Parameters

3.3. Discussion: Direct Versus Indirect Computation of the First-Order Sensitivities of Decoder Response with Respect to the Primary Model Parameters:

4. The Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (2nd-FASAM-NIE-V)

5. Illustrative Application of the 2nd-FASAM-NIE-V to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

5.1. Computation of Second-Order Sensitivities Stemming from ∂ R / ∂ F θ

5.2. Computation of Second-Order Sensitivities Stemming from ∂ R / ∂ Σ d θ

5.3. Computation of Second-Order Sensitivities Stemming from ∂ R / ∂ S

5.4. Discussion: Direct Computation of Second-Order Sensitivities Versus Their Indirect Computation via Feature Functions

6. Discussion and Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe

2. First-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1^st-FASAM-NIE-V)

Particular Case: The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (1^st-CASAM-NIE-V)

3. Illustrative Application of the 1^st-CASAM-NIE-V and 1^st-FASAM-NIE-V Methodologies to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

3.1. Application of 1^st-CASAM-NIE-V to Directly Compute the First-Order Sensitivities of the Decoder Response with Respect to the Primary Model Parameters

3.2. Efficient Indirect Computation Using the 1^st-FASAM-NIE-V of the First-Order Sensitivities of the Decoder Response with Respect to Pimary Model Parameters

4. The Second-Order Features Adjoint Sensitivity Analysis Methodology for Neural Integral Equations of Volterra-Type (2^nd-FASAM-NIE-V)

5. Illustrative Application of the 2^nd-FASAM-NIE-V to Neutron Slowing Down in an Infinite Homogeneous Hydrogenous Medium

5.1. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial F (θ)$

5.2. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial Σ_{d} (θ)$

5.3. Computation of Second-Order Sensitivities Stemming from $\partial R / \partial S$