Preprint
Article

The Second-Order Features Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (2nd-FASAM-L): Mathematical Framework and Illustrative Application to an Energy System

Altmetrics

Downloads

85

Views

18

Comments

0

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

01 April 2024

Posted:

05 April 2024

You are already at the latest version

Alerts
Abstract
This work presents the mathematical framework of the Second-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (abbreviated as “2nd-CASAM-L”), which enables the most efficient computation of exactly obtained mathematical expressions of first- and second-order sensitivities of a generic system response with respect to functions (“features”) of model parameters. Subsequently, the first- and second-order sensitivities with respect to the model’s uncertain parameters, boundaries, and internal interfaces are obtained analytically and exactly, without needing large-scale computations. Within the 2nd-FASAM-L methodology, the number of large-scale computations is proportional to the number of model features (defined as functions of model parameters), as opposed to being proportional to the number of model parameters, which are considerably more than the number of features, being incomparably more efficient and more accurate than any other methods (statistical, finite differences, etc.) for computing exact expressions of response sensitivities (of any order) with respect to the model’s features and/or primary uncertain parameters, boundaries, and internal interfaces. The application of the 2nd-CASAM-L methodology is illustrated using a simplified energy-dependent neutron transport model of fundamental significance in nuclear reactor physics.
Keywords: 
Subject: Physical Sciences  -   Mathematical Physics

1. Introduction

The analysis of the accuracy of results (usually called “responses”) computed by models relies fundamentally on the functional derivatives (usually called “sensitivities”) of the respective model responses with respect to the parameters in the respective computational model. Such sensitivities are needed for many purposes, including: (i) understanding the model by ranking the importance of the various parameters; (ii) performing “reduced-order modeling” by eliminating unimportant parameters and/or processes; (iii) quantifying the uncertainties induced in a model response due to model parameter uncertainties; (iv) performing “model validation,” by comparing computations to experiments to address the question “does the model represent reality?” (v) prioritizing improvements in the model; (vi) performing data assimilation and model calibration as part of forward “predictive modeling” to obtain best-estimate predicted results with reduced predicted uncertainties; (vii) performing inverse “predictive modeling”; (viii) designing and optimizing the system.
Response sensitivities are computed by using either deterministic or statistical methods. The simplest deterministic method for computing response sensitivities is to use finite-difference schemes in conjunction with re-computations using the model with “judiciously chosen” altered parameter values. Evidently, such methods can at best compute approximate values of a very limited number of sensitivities. Deterministic methods that can compute more exactly the values of first-order sensitivities include the “Green’s function method” [1], the “forward sensitivity analysis methodology” [2], and the “direct method” [3], which rely on analytical or numerical differentiation of the computational model under investigation to compute local response sensitivities exactly. However, for a computational model comprising many parameters, the conventional deterministic methods become impractical for computing sensitivities higher than first-order because they are subject to the “curse of dimensionality,” a term coined by Belmann [4] to describe phenomena in which the number of computations increases exponentially in the respective phase-space. In the particular case of sensitivity analysis using conventional deterministic methods, the number of large-scale computations increases exponentially in the phase-space of model parameter as the order of sensitivities increases.
The alternatives to “deterministic methods” are the “statistical methods”, which construct an approximate response distribution (often called “response surface”) in the parameters space, and subsequently use scatter plots, regression, rank transformation, correlations, and so-called “partial correlation analysis,” in order to identify approximate expectation values, variances and covariances for the responses. These statistical quantities are subsequently used to construct quantities that play the role of (approximate) first-order response sensitivities. Thus, statistical methods commence with “uncertainty analysis” and subsequently attempt an approximate “sensitivity analysis” of the approximately computed model response (called a “response surface”) in the phase-space of the parameters under consideration. The currently popular statistical methods for uncertainty and sensitivity analysis are broadly categorized as sampling-based methods [5,6], variance-based methods [7,8], and Bayesian methods [9]. Various variants of the statistical methods for uncertainty and sensitivity analysis are reviewed in the book edited by Saltarelli et al. [10]. The main advantage of using statistical methods for uncertainty and sensitivity analysis is that they are conceptually easy to implement. On the other hand, statistical methods for uncertainty and sensitivity analysis have three major inherent practical drawbacks, as follows:
(i)
Even first-order sensitivities cannot be computed exactly.
(ii)
Statistical methods are (also) subject to the curse of dimensionality and have not been developed for producing second- and higher-order sensitivities.
(iii)
Since the response sensitivities and parameter uncertainties are amalgamated, inherently and inseparably, within the results produced by statistical methods, improvements in parameter uncertainties cannot be directly propagated to improve response uncertainties; rather, the entire set of simulations and statistical post-processing must be repeated anew.
(iv)
A “fool-proof” statistical method for analyzing correctly models involving highly correlated parameters does not seem to exist currently, so that particular care must be used when interpreting regression results obtained using such models.
It is known that the “adjoint method of sensitivity analysis” has been the most efficient method for computing exactly first-order sensitivities, since it requires a single large-scale (adjoint) computation for computing all of the first-order sensitivities, regardless of the number of model parameters. The idea underlying the computation of response sensitivities with respect to model parameters using adjoint operators was first used by Wigner [11] to analyze first-order perturbations in nuclear reactor physics and shielding models based on the linear neutron transport (or diffusion) equation, as subsequently described in textbooks on these subjects [12,13,14,15,16]. Cacuci [2] is credited (see, e.g., [17,18]) for having conceived the rigorous “1st-order adjoint sensitivity analysis methodology” for generic large-scale nonlinear (as opposed to linearized) systems involving generic operator responses and having introduced these principles to the earth, atmospheric and other sciences.
Cacuci [19,20] has extended his 1st-order adjoint sensitivity analysis methodology to enable the comprehensive computation of 2nd-order sensitivities of model responses to model parameters (including imprecisely known domain boundaries and interfaces) for large-scale linear and nonlinear systems. The 2nd-order adjoint sensitivity analysis methodology for linear systems was applied [21] to compute exactly the 21,976 first-order sensitivities and 482,944,576 second-order sensitivities (of which 241,483,276 are distinct from each other) for an OECD/NEA reactor physics benchmark [22]. This benchmark is modeled by the neutron transport equation involving 21,976 uncertain parameters, the solving of which is representative of “large-scale computations.” The neutron transport equation was solved using the software package PARTISN [23] in conjunction with the MENDF71X cross section library [24], which comprises 618-group cross sections based on ENDF/B-VII.1 nuclear data [25]. The spontaneous fission source was computed using the code SOURCES4C [26]. This work has demonstrated that, contrary to the widely held belief that second- and higher-order sensitivities are negligeable for reactor physics systems, many 2nd-order sensitivities of the OECD benchmark’s response to the benchmark’s uncertain parameters were much larger than the largest 1st-order ones. This finding has motivated the investigation of the largest 3rd-order sensitivities, many of which were found to be even larger than the 2nd-order ones. Subsequently, the mathematical framework for determining and computing the 4th-order sensitivities was developed, and many of these were found to be larger than the 3rd-order ones. This sequence of findings has motivated the development by Cacuci [27] of the “nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems” (abbreviated as “nth-CASAM-L”), which was developed specifically for linear systems because important model responses produced by such systems are various Lagrangian functionals which depend simultaneously on both the forward and adjoint state functions governing the respective linear system. Among the most important such responses are the Raleigh quotient for computing eigenvalues and/or separation constants when solving partial differential equations, and the Schwinger functional for first-order “normalization-free” solutions [28,29]. These functionals play a fundamental role in optimization and control procedures, derivation of numerical methods for solving equations (differential, integral, integro-differential), etc.
In parallel with developing the nth-CASAM-L, Cacuci [30] has also developed the nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (nth-CASAM-N). Just like the nth-CASAM-L, the nth-CASAM-N is also formulated in linearly increasing higher-dimensional Hilbert spaces (as opposed to exponentially increasing parameter-dimensional spaces), thus overcoming the curse of dimensionality in sensitivity analysis of nonlinear systems, enabling the most efficient computation of exactly-determined expressions of arbitrarily high-order sensitivities of generic nonlinear system responses with respect to model parameters, uncertain boundaries and internal interfaces in the model’s phase-space.
Recently, Cacuci [31] has introduced the “Second-Order Function/Feature Adjoint Sensitivity Analysis Methodology for Nonlinear Systems” (2nd-FASAM-N), which enables a considerable reduction (by comparison to the 2nd-CASAM-N) of the number of large-scale computations needed to compute the second-order sensitivities of a model response with respect to the model parameters, thereby becoming the most efficient methodology known for computing second-order sensitivities exactly. Paralleling the construction of the 2nd-FASAM-N, this work introduces the “First- and Second-Order Function/Feature Adjoint Sensitivity Analysis Methodology for Response-Coupled Adjoint/Forward Linear Systems” (1st and 2nd-FASAM-L). The mathematical methodology of the 1st-FASAM-L is presented in Section 3, while the mathematical methodology of the 2nd-FASAM-L is presented in Section 4. The application of the 1st-FASAM-L and the 2nd-FASAM-L is illustrated in Section 5 by means of a simplified yet representative energy-dependent neutron-slowing down model which is fundamental importance to reactor physics and design [32,33,34]. The concluding discussion presented in Section 6 prepares the ground for the subsequent generalization of the present work to enable the most efficient possible computation of exact sensitivities of any (arbitrarily-high) order with respect to “feature functions” of model parameters and, hence, to the model’s parameters.

2. Mathematical Modeling of Response-Coupled Linear Forward and Adjoint Systems

The generic mathematical model considered in this work is fundamentally the same as was considered in [27], but with the major difference that functions (‘features”) of the primary model parameters will be generically identified within the model. The primary model parameters will be denoted as α 1 ,…, α T P , where the subscript “TP” indicates “Total number of Primary Parameters;” the qualifier “primary” indicates that these parameters do not depend on any other parameters within the model. These model parameters are considered to include imprecisely known geometrical parameters that characterize the physical system’s boundaries in the phase-space of the model’s independent variables. These boundaries depend on the physical system’s geometrical dimensions, which may be imprecisely known because of manufacturing tolerances. In practice, these primary model parameters are subject to uncertainties. It will be convenient to consider that these parameters are components of a “vector of primary parameters” denoted as α ( α 1 , , α T P ) T P , where T P denotes the TP-dimensional subset of the set of real scalars. For subsequent developments, matrices and vectors will be denoted using capital and lower-case bold letters, respectively. The symbol “ ” will be used to denote “is defined as” or “is by definition equal to.” Transposition will be indicated by a dagger ( ) superscript. The nominal parameter values will be denoted as α 0 [ α 1 0 , ... , α i 0 , .. , α T P 0 ] ; the superscript “0” will be used throughout this work to denote “nominal” or “mean” values.
The model is considered to comprise T I independent variables which will be denoted as x i , i = 1 , ... , T I , and are considered to be the components of a T I -dimensional column vector denoted as x ( x 1 , , x T I ) T I , where the sub/superscript “ T I ” denotes the “Total number of Independent variables.” The vector x T I of independent variables is considered to be defined on a phase-space domain, denoted as Ω ( α ) , Ω ( α ) { λ i ( α ) x i ω i ( α ) ; i = 1 , ... , T I } , the boundaries of which may depend on some of the model parameters α . The lower boundary-point of an independent variable is denoted as λ i ( α ) (e.g., the inner radius of a sphere or cylinder, the lower range of an energy-variable, the initial time-value, etc.), while the corresponding upper boundary-point is denoted as ω i ( α ) (e.g., the outer radius of a sphere or cylinder, the upper range of an energy-variable, the final time-value, etc.). A typical example of boundary conditions that depend on imprecisely-known parameters that pertain to the geometry of the model and also to parameters that pertain to the material properties of the respective model occur when modeling particle diffusion within a medium, the boundaries of which are facing vacuum. For such models, the boundary conditions for the respective state (dependent) variables (i.e., particle flux and/or current) are imposed not on the physical boundary but on the “extrapolated boundary” of the respective spatial domain. The “extrapolated boundary” depends both on the imprecisely known physical dimensions of the medium’s domain/extent and also on the medium’s properties, i.e., atomic number densities and microscopic transport cross sections. The boundary of Ω ( α ) , which will be denoted as Ω [ λ ( α ) ; ω ( α ) ] , comprises the set of all of the endpoints λ i ( α ) , ω i ( α ) , i = 1 , ... , T I , of the respective intervals on which the components of x are defined, i.e., Ω [ λ ( α ) ; ω ( α ) ] { λ i ( α ) ω i ( α ) , i = 1 , ... , T I } .
The mathematical model that underlies the numerical evaluation of a process and/or state of a physical system comprises equations that relate the system's independent variables and parameters to the system's state/dependent variables. A linear physical system can generally be modeled by a system of coupled operator-equations as follows:
L [ x ; g ( α ) ] φ ( x ) = Q [ x ; g ( α ) ] , x Ω ( α ) .
In Equation (1), the vector φ ( x ) [ φ 1 ( x ) , , φ T D ( x ) ] is a T D -dimensional column vector of dependent variables and where the sub/superscript “ T D ” denotes the “Total (number of) Dependent variables.” The functions φ i ( x ) , i = 1 , ... , T D , denote the system’s “dependent variables” (also called “state functions”). The matrix L ( x ; α ) [ L i j ( x ; α ) ] ,   i , j = 1 , ... , T D , has dimensions T D × T D . The components L i j ( x ; α ) are operators that act linearly on the dependent variables φ j ( x ) and also depend (in general, nonlinearly) on the uncertain model parameters α . Furthermore, the vector g ( α ) [ g 1 ( α ) , , g T G ( α ) ] is a T G -dimensional vector having components g i ( α ) , i = 1 , ... , T G , which are real-valued functions of (some of) the primary model parameters α T P . The quantity T G denotes the total number of such functions which appear exclusively in the definition of the model’s underlying equations. Such functions customarily appear in models in the form of correlations that describe “features” of the system under consideration, such as material properties, flow regimes. etc. Usually, the number of functions g i ( α ) is considerably smaller than the total number of model parameters, i.e., T G T P . For example, the numerical model (Cacuci and Fang, 2023) of the OECD/NEA reactor physics benchmark (Valentine 2006) comprises 21,976 uncertain primary model parameters (including microscopic cross sections and isotopic number densities) but the neutron transport equation, which is solved to determine the neutron flux distribution within the benchmark, does not use these primary parameters directly but instead uses just several hundreds of “group-averaged macroscopic cross sections” which are functions/features of the microscopic cross sections and isotopic number densities (which in turn are uncertain quantities that would be components of the vector of primary model parameters). In particular, a component g j ( α ) may simply be one of the primary model parameters α j , i.e., g j ( α ) α j .
The T D -dimensional column vector Q [ x ; g ( α ) ] ( q 1 , .... , q T D ) , having components q i [ x ; g ( α ) ] , i = 1 , ... , T D , denotes inhomogeneous source terms, which usually depend nonlinearly on the uncertain parameters α . Since the right-side of Equation (1) may contain distributions, the equality in this equation is considered to hold in the weak (i.e., “distributional”) sense. Similarly, all of the equalities that involve differential equations in this work will be considered to hold in the distributional sense.
When L [ x ; g ( α ) ] contains differential operators, a set of boundary and initial conditions which define the domain of L [ x ; g ( α ) ] must also be given. Since the complete mathematical model is considered to be linear in φ ( x ) , the boundary and/or initial conditions needed to define the domain of L [ x ; g ( α ) ] must also be linear in φ ( x ) . Such linear boundary and initial conditions are represented in the following operator form:
B [ x ; g ( α ) ; λ ( α ) ; ω ( α ) ] φ ( x ) = C [ x ; g ( α ) ; λ ( α ) ; ω ( α ) ] , x Ω [ λ ( α ) ; ω ( α ) ]
In Equation (2), the quantity B [ x ; g ( α ) ; λ ( α ) ; ω ( α ) ] denotes a matrix of dimensions N B × T D having components denoted as B i j ( x ; α ) ; i = 1 , ... , N B ; j = 1 , ... , T D , which are operators that act linearly on φ ( x ) and nonlinearly on the components of g ( α ) ; the quantity N B denotes the total number of boundary and initial conditions. The N B -dimensional column vector C [ x ; g ( α ) ; λ ( α ) ; ω ( α ) ] comprises components that are operators which, in general, act nonlinearly on the components of g ( α ) .
Physical problems modeled by linear systems and/or operators are naturally defined in Hilbert spaces. The dependent variables φ i ( x ) , i = 1 , ... , T D , for the physical system represented by Eqs. (1) and (2) are considered to be square-integrable functions of the independent variables and are considered to belong to a Hilbert space which will be denoted as H 0 ( Ω ) , where the subscript “zero” denotes “zeroth-level“ or “original.” Higher-level Hilbert spaces, which will be denoted as H 1 ( Ω ) and H 2 ( Ω ) , will also be used in this work. The Hilbert space H 0 ( Ω ) is considered to be endowed with the following inner product, denoted as φ ( x ) , ψ ( x ) 0 , between two elements φ ( x ) H 0 ( Ω ) and ψ ( x ) H 0 ( Ω ) :
φ ( x ) , ψ ( x ) 0 i = 1 T I λ i ( α ) ω i ( α ) φ ( x ) · ψ ( x ) d x = j = 1 T D λ 1 ( α ) ω 1 ( α ) ... λ i ( α ) ω i ( α ) ... λ T I ( α ) ω T I ( α ) φ j ( x ) ψ j ( x ) d x 1 ... d x i ... d x T I .
The “dot” in Equation (3) indicates the “scalar product of two vectors,” which is defined as follows:
φ ( x ) · ψ ( x ) i = 1 T D φ i ( x ) ψ i ( x ) .
The product-notation i = 1 T I λ i ( α ) ω i ( α ) [ ] d x i in Equation (3) denotes the respective multiple integrals.
The linear operator L [ x ; g ( α ) ] is considered to admit an adjoint operator, which will be denoted as L * [ x ; g ( α ) ] and which is defined through the following relation for a vector ψ ( x ) H 0
ψ ( x ) , L [ x ; g ( α ) ] φ ( x ) 0 = L * [ x ; g ( α ) ] ψ ( x ) , φ ( x ) 0
In Equation (5), the formal adjoint operator L * [ x ; g ( α ) ] is the T D × T D matrix comprising elements L j i * [ x ; g ( α ) ] which are obtained by transposing the formal adjoints of the forward operators L i j [ x ; g ( α ) ] . Hence, the system adjoint to the linear system represented by (1) and (2) can generally be represented as follows:
L * [ x ; g ( α ) ] ψ ( x ) = Q * [ x ; g ( α ) ] , x Ω ( α ) ,
B * [ x ; g ( α ) ; λ ( α ) ; ω ( α ) ] ψ ( x ) = C * [ x ; g ( α ) ; λ ( α ) ; ω ( α ) ] , x Ω [ λ ( α ) ; ω ( α ) ] .
When the forward operator L [ x ; g ( α ) ] comprises differential operators, the operations (e.g., integration by parts) that implement the transition from the left-side to the right side of Equation (5) give rise to boundary terms which are collectively called the “bilinear concomitant.” The domain of L * [ x ; g ( α ) ] is determined by selecting adjoint boundary and/or initial conditions so as to ensure that the adjoint system is well-posed mathematically. It is also desirable that the selected adjoint boundary conditions should cause the bilinear concomitant to vanish when implemented in Equation (5) together with the forward boundary conditions given in Equation (2). The adjoint boundary conditions thus selected are represented in operator form by Equation (7).
The relationship shown in Equation (5), which is the basis for defining the adjoint operator, also provides the following fundamental “reciprocity-like” relation between the sources of the forward and the adjoint equations, i.e. Eqs. (1) and (6), respectively:
ψ ( x ) , Q [ x ; g ( α ) ] 0 = Q * [ x ; g ( α ) ] , φ ( x ) 0
The functional on the right-side of Equation (8) represents a “detector response”, i.e., a reaction-rate between the particles and the medium represented by Q * [ x ; g ( α ) ] which is equivalent to the “number of counts” of particles incident on a detector of particles that “measures” the particle flux φ ( x ) . In view of the relation provided in (8), the vector-valued source term Q * [ x ; g ( α ) ] { q 1 * [ x ; g ( α ) ] , .... , q T D * [ x ; g ( α ) ] } in the adjoint equation Equation (6) is usually associated with the “result of interest” to be measured and/or computed, which is customarily called the system’s “response.” In particular, if q i * [ x ; g ( α ) ] = δ ( x x d ) and q j i * [ x ; g ( α ) ] = 0 , then Q * [ x ; g ( α ) ] , φ ( x ) 0 = φ i ( x d ) , which means that, in such a case, the right-side of Equation (8) provides the value of the ith-dependent variable (particle flux, temperature, velocity, etc.) at the point in phase-space where the respective measurement is performed.
The results computed using a mathematical model are customarily called “model responses” (or “system responses” or “objective functions” or “indices of performance”). For linear physical systems, the system’s response may depend not only on the model’s state-functions and on the system parameters but may simultaneously also depend on the adjoint state function. As has been discussed by Cacuci [27,30], any response of a linear system can be formally represented (using expansions or interpolation, if necessary) and fundamentally analyzed in terms of the following generic integral representation:
R [ φ ( x ) , ψ ( x ) ; f ( α ) ] λ 1 ( α ) ω 1 ( α ) ... λ T I ( α ) ω T I ( α ) S [ φ ( x ) , ψ ( x ) ; g ( α ) ; h ( α ) ; x ] d x 1 ... d x T I ,
where S [ φ ( x ) , ψ ( x ) ; g ( α ) ; h ( α ) ; x ] is a suitably differentiable nonlinear function of φ ( x ) , ψ ( x ) , and α . The integral representation of the response provided in Equation (9) can represent “averaged” and/or “point-valued” quantities in the phase-space of independent variables. For example, if R [ φ ( x ) , ψ ( x ) ; f ( α ) ] represents the computation or the measurement (which would be a “detector-response”) of a quantity of interest at a point x d in the phase-space of independent variables, then S [ φ ( x ) , ψ ( x ) ; g ( α ) ; h ( α ) ; x ] would contain a Dirac-delta functional of the form δ ( x x d ) . Responses that represent “differentials/derivatives of quantities” would contain derivatives of Dirac-delta functionals in the definition of S [ φ ( x ) , ψ ( x ) ; g ( α ) ; h ( α ) ; x ] . The vector h ( α ) [ h 1 ( α ) , , h T H ( α ) ] , having components h i ( α ) , i = 1 , ... , T H , which appears among the arguments of the function S [ φ ( x ) , ψ ( x ) ; g ( α ) ; h ( α ) ; x ] , represents functions of primary parameters that often appear solely in the definition of the response but do not appear in the mathematical definition of the model, i.e., in Eqs. (1), (2), (6) and (7). The quantity T H denotes the total number of such functions which appear exclusively in the definition of the model’s response. Evidently, the response will depend directly and/or indirectly (through the “feature”-functions) on all of the primary model parameters. This fact has been indicated in Equation (9) by using the vector-valued function f ( α ) as an argument in the definition of the response R [ φ ( x ) , ψ ( x ) ; f ( α ) ] to represent the concatenation of all of the “features” of the model and response under consideration. The vector f ( α ) of “model features” is thus defined as follows:
f ( α ) [ g ( α ) ; h ( α ) ; λ ( α ) ; ω ( α ) ] [ f 1 ( α ) , ... , f T F ( α ) ] ; T F T G + T H + 2 T I .
As defined in Equation (10), the quantity T F denotes the total number of “feature functions of the model’s parameters” which appear in the definition of the nonlinear model’s underlying equations and response.
Solving Eqs. (1) and (2), at the nominal (or mean) values, denoted as α 0 [ α 1 0 , ... , α i 0 , .. , α T P 0 ] , of the model parameters, yields the nominal forward solution, which will be denoted as φ 0 ( x ) . Solving Eqs. (6) and (7) at the nominal values, α 0 , of the model parameters yields the nominal adjoint solution, which will be denoted as ψ 0 ( x ) . The nominal value of the response, R [ φ 0 ( x ) , ψ 0 ( x ) ; f ( α 0 ) ] , is determined by using the nominal parameter values α 0 , the nominal value φ 0 ( x ) of the forward state function, and the nominal value ψ 0 ( x ) of the adjoint state function.
The definition provided by Equation (9) implies that the model response R [ φ ( x ) , ψ ( x ) ; f ( α ) ] depends on the components of the feature function f ( α ) , and would therefore admit a Taylor-series expansion around the nominal value f 0 f ( α 0 ) , having the following form:
R [ f ( α ) ] = R ( f 0 ) + j 1 = 1 T F { R ( f ) f j 1 } f 0 δ f j 1 + 1 2 j 1 = 1 T F j 2 = 1 T F { 2 R ( f ) f j 1 f j 2 } f 0 δ f j 1 δ f j 2 + ...
where δ f j [ f j ( α ) f j 0 ] ; f j 0 f j ( α 0 ) ; j = 1 , ... , T F . The “sensitivities of the model response with respect to the (feature) functions” are naturally defined as being the functional derivatives of R [ f ( α ) ] with respect to the components (“features”) f j ( α ) of f ( α ) . The notation { · } f 0 indicates that the quantity enclosed within the braces is to be evaluated at the nominal values f 0 f ( α 0 ) . Since T F T P , the computations of the functional derivatives of R k [ f ( α ) ] with respect to the functions f j ( α ) , which appear in Equation (11), will be considerably less expensive computationally than the computation of the functional derivatives involved in the Taylor-series of the response with respect to the model parameters. The functional derivatives of the response with respect to the parameters can be obtained from the functional derivatives of the response with respect to the “feature” functions f j ( α ) by simply using the chain rule, i.e.:
{ R ( α ) α j 1 } α 0 = i 1 = 1 T F { R ( f ) f i 1 f i 1 ( α ) α j 1 } α 0 ; { 2 R ( α ) α j 1 α j 2 } α 0 = α j 2 i 1 = 1 T F { R ( f ) f i 1 f i 1 ( α ) α j 1 } α 0 ;
and so on. The evaluation/computation of the functional derivatives f i 1 ( α ) / α j 1 , 2 f i 1 ( α ) / α j 1 α j 2 , etc., does not require computations involving the model, and is therefore computationally trivial by comparison to the evaluation of the functional derivatives (“sensitivities”) of the response with respect to either the functions (“features”) f j ( α ) or the model parameters α i , i = 1 , ... , T P .
The range of validity of the Taylor-series shown in Equation (11) is defined by its radius of convergence. The accuracy −as opposed to the “validity”− of the Taylor-series in predicting the value of the response at an arbitrary point in the phase-space of model parameters depends on the order of sensitivities retained in the Taylor-expansion: the higher the respective order, the more accurate the respective response value predicted by the Taylor-series. In the particular cases when the response happens to be a polynomial function of the “feature” functions f j ( α ) , the Taylor series represented by Equation (11) is finite and exactly represents the respective model response.
In turn, the functions f i ( α ) can also be formally expanded in a multivariate Taylor-series around the nominal (mean) parameter values α 0 , namely:
f i ( α ) = f i ( α 0 ) + j 1 = 1 T P { f i ( α ) α j 1 } α 0 δ α j 1 + 1 2 j 1 = 1 T P j 2 = 1 T P { 2 f i ( α ) α j 1 α j 2 } α 0 δ α j 1 δ α j 2 + 1 3 ! j 1 = 1 T P j 2 = 1 T P j 3 = 1 T P { 3 f i ( α ) α j 1 α j 2 α j 3 } α 0 δ α j 1 δ α j 2 δ α j 3 + ... ,
The choice of feature functions f i ( α ) is not unique but can be tailored by the user to the problem at hand. The two most important guiding principles for constructing the feature functions f i ( α ) based on the primary parameters are as follows:
(i)
As will be shown below in Section 4 while establishing the mathematical framework underlying the 2nd-FASAM-L, the number of large-scale computations needed to determine the numerical value of the second-order sensitivities is proportional to the number of first-order sensitivities of the model’s response with respect to the feature functions fi(α). Consequently, it is important to minimize the number of feature functions fi(α), while ensuring that all of the primary model parameters are considered within the expressions constructed for the feature functions fi(α). In the extreme case when some primary parameters, αj, cannot be grouped into the expressions of the feature functions fi(α), each of the respective primary model parameters αj becomes a feature function fj(α).
(ii)
The expressions of the features functions fi(α) must be independent of the model’s state functions; they must be exact, closed-form, scalar-valued functions of the primary model parameters αj, so the exact expressions of the derivatives of fi(α) with respect to the primary model parameters αj can be obtained analytically (with “pencil and paper”) and, hence, inexpensively from a computational standpoint. The motivation for this requirement is to ensure that the numerical determination of the subsequent derivatives of the features functions fi(α) with respect to the primary model parameters αj becomes trivial computationally.
The domain of validity of the Taylor-series in Equation (13) is defined by its own radius of convergence. Of course, in the extreme case when no feature function can be constructed, the feature functions will be the primary parameters themselves, in which case the nth-FASAM-L methodology becomes identical to the previously established nth-CASAM-L methodology [27].

3. The First-Order Function/Feature Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward and Adjoint Linear Systems (1st-FASAM-L)

The “First-Order Function/Feature Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems” (1st-FASAM-L) aims at enabling the most efficient computation of the first-order sensitivities of a generic model response of the form R [ φ ( x ) , ψ ( x ) ; α ] with respect to the components of the “features” function f ( α ) . In preparation for subsequent generalizations towards establishing the generic pattern for computing sensitivities of arbitrarily high-order, the function u ( 1 ) ( 2 ; x ) [ φ ( x ) , ψ ( x ) ] will be called the “1st-level forward/adjoint function” and the system of equations satisfied by this function (which is obtained by concatenating the original forward and adjoint equations together with their respective boundary/initial conditions) will be called “the 1st-Level Forward/Adjoint System (1st-LFAS)” and will be re-written in the following concatenated matrix-form:
F ( 1 ) [ 2 × 2 ; x ; f ] u ( 1 ) ( 2 ; x ) = q F ( 1 ) ( 2 ; x ; f ) ; x Ω ( α ) ;
b F ( 1 ) [ u ( 1 ) ( 2 ; x ) ; f ] = 0 ; x Ω [ λ ( α ) ; ω ( α ) ] ;
where the following definitions were used:
F ( 1 ) [ 2 × 2 ; x ; f ] ( L ( x ; f ) 0 0 L * ( x ; f ) ) ; u ( 1 ) ( 2 ; x ) [ φ ( x ) , ψ ( x ) ] ;
q F ( 1 ) ( 2 ; x ; f ) ( Q ( x ; g ) Q * ( x ; g ) ) ; b F ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ] ( B ( x ; f ) φ ( x ) C ( f ) B * ( x ; f ) ψ ( x ) C * ( f ) ) .
In the list of arguments of the matrix F ( 1 ) [ 2 × 2 ; x ; f ] , the argument “ 2 × 2 ” indicates that this square matrix comprises four component sub-matrices, as indicated in Equation (16). Similarly, the argument “2” that appears in the block-vectors u ( 1 ) ( 2 ; x ) , q F ( 1 ) ( 2 ; x ; f ) , and b F ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ] indicates that each of these column block-vectors comprises two sub-vectors as components. Also, throughout this work, the quantity “ 0 ” will be used to denote either a vector or a matrix with zero-valued components, depending on the context. For example, the vector “ 0 ” in Equation (15) is considered to have as many components as the vector b F ( 1 ) [ u ( 1 ) ( 2 ; x ) ; f ] . On the other hand, the quantity “ 0 ” which appears in Equation (16) may represent either a (sub) matrix or a vector of the requisite dimensions.
The nominal (or mean) parameter values, α 0 , are considered to be known, but these values will differ from the true values α , which are unknown, by variations δ α ( δ α 1 , , δ α T P ) , where δ α i α i α i 0 . The parameter variations δ α will induce variations δ f ( α ) [ δ f 1 ( α ) , , δ f T F ( α ) ] in the vector-valued function f ( α ) , around the nominal value f 0 f ( α 0 ) , and will also induce variations δ φ ( x ) and δ ψ ( x ) , respectively, around the nominal solution ( φ 0 , ψ 0 ) through the equations underlying the model. All of these variations will induce variations in the model response R [ u ( 1 ) ( 2 ; x ) ; f ] R [ φ ( x ) , ψ ( x ) ; f ( α ) ] , in a neighborhood [ φ 0 ( x ) + ε δ φ ( x ) , ψ 0 ( x ) + ε δ ψ ( x ) ; f 0 + ε δ f ] around ( φ 0 , ψ 0 ; f 0 ) , where ε is a real-valued scalar.
Formally, the first-order sensitivities of the response R [ u ( 1 ) ( 2 ; x ) ; f ] with respect to the components of the feature function f ( α ) are provided by the first-order Gateaux (G-)variation of R ( φ , ψ , f ) at the phase-space point ( φ 0 , ψ 0 , f 0 ) , which is defined as follows:
δ R ( φ 0 , ψ 0 , f 0 ; δ φ , δ ψ , δ f ) { d d ε R [ φ 0 ( x ) + ε δ φ ( x ) , ψ 0 ( x ) + ε δ ψ ( x ) ; f 0 + ε δ f ] } ε = 0 { d d ε R [ u ( 1 , 0 ) ( 2 ; x ) + ε v ( 1 ) ( 2 ; x ) ; f 0 + ε δ f ] } ε = 0 δ R [ u ( 1 , 0 ) ( 2 ; x ) ; f 0 ; v ( 1 ) ( 2 ; x ) , δ f ] ,
where the following definitions were used:
u ( 1 , 0 ) ( 2 ; x ) [ φ 0 ( x ) , ψ 0 ( x ) ] ; v ( 1 ) ( 2 ; x ) [ δ φ ( x ) , δ ψ ] .
In general, the G-variation δ R ( φ 0 , ψ 0 , f 0 ; δ φ , δ ψ , δ f ) is nonlinear in the variations δ f ( α ) , δ φ ( x ) and/or δ ψ ( x ) . In such cases, the partial functional Gateaux (G-)derivatives of the response R ( φ , ψ , f ) with respect to the functions φ , ψ , f do not exist, which implies that the response sensitivities to the model parameters do not exist, either. Therefore, it will be henceforth assumed in this work that δ R ( φ 0 , ψ 0 , f 0 ; δ φ , δ ψ , δ f ) is linear in the respective variations, so the corresponding partial G-derivatives exist and δ R ( φ 0 , ψ 0 , f 0 ; δ φ , δ ψ , δ f ) is actually the first-order G-differential of the response. The usual numerical methods (e.g., Newton’s method and variants thereof) for solving the equations underlying the model also require the existence of the first-order G-derivatives of the original model equations; these will also be assumed to exist. When the 1st-order G-derivatives exists, the G-differential δ R [ u ( 1 , 0 ) ( 2 ; x ) ; f 0 ; v ( 1 ) ( 2 ; x ) , δ f ] can be written as follows:
δ R [ u ( 1 , 0 ) ( 2 ; x ) ; f 0 ; v ( 1 ) ( 2 ; x ) , δ f ] = { δ R [ u ( 1 ) ( 2 ; x ) ; f ; δ f ] } d i r + { δ R [ u ( 1 ) ( 2 ; x ) ; f ; v ( 1 ) ( 2 ; x ) ] } i n d .
In Equation (20), the “direct-effect” term { δ R [ u ( 1 ) ( 2 ; x ) ; f ; δ f ] } d i r comprises only dependencies on δ f ( α ) and is defined as follows:
{ δ R [ u ( 1 ) ( 2 ; x ) ; f ; δ f ] } d i r { R ( u ( 1 ) ; f ) f δ f } α 0
The following convention/definition was used in Equation (21):
[ ] f δ f i = 1 T F [ ] f i δ f i = i = 1 T G [ ] g i δ g i + i = 1 T H [ ] h i δ h i + i = 1 T I [ ] ω i δ ω i + i = 1 T I [ ] λ i δ λ i
The above convention implies that:
(a) For j = 1 , ... , T G :
R ( u ( 1 ) ; f ) f j δ f j { λ 1 ( α ) ω 1 ( α ) ... λ T I ( α ) ω T I ( α ) S ( φ , ψ ; g ; h ) g i d x 1 ... d x T I } α 0 δ g i ; i = 1 , ... , T G ;
(b) For j = T G + 1 , ... , T G + T H :
R ( u ( 1 ) ; f ) f j δ f j { λ 1 ( α ) ω 1 ( α ) ... λ T I ( α ) ω T I ( α ) S ( φ , ψ ; g ; h ) h i d x 1 ... d x T I } α 0 δ h i ; i = 1 , ... , T H ;
(c) For j = T G + T H + 1 , ... , T G + T H + T I :
R ( u ( 1 ) ; f ) f j δ f j { ω i λ 1 ω 1 d x 1 ... λ T I ω T I d x T I S ( φ , ψ ; g ; h ) } α 0 δ ω i , i = 1 , ... , T I ;
(d) For j = T G + T H + T I + 1 , ... , T G + T H + 2 T I :
R ( u ( 1 ) ; f ) f j δ f j { λ i λ 1 ω 1 d x 1 ... λ T I ω T I d x T I S ( φ , ψ ; g ; h ) } α 0 δ λ i , i = 1 , ... , T I .
The notation on the left-side of Equation (22) represents the inner product between two vectors, but the symbol “( )” which indicates “transposition” has been omitted in order to keep the notation as simple as possible. “Daggers” indicating transposition will also be omitted in other inner products, whenever possible, while avoiding ambiguities.
In Equation (20), the “indirect-effect” term { δ R [ u ( 1 ) ( 2 ; x ) ; f ; v ( 1 ) ( 2 ; x ) ] } i n d depends only on the variations v ( 1 ) ( 2 ; x ) [ δ φ ( x ) , δ ψ ] in the state functions, and is defined as follows:
{ δ R [ u ( 1 ) ( 2 ; x ) ; f ; v ( 1 ) ( 2 ; x ) ] } i n d { λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S ( φ , ψ ; g ; h ) u ( 1 ) ( 2 ; x ) v ( 1 ) ( 2 ; x ) } α 0 { λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S ( φ , ψ ; g ; h ) φ δ φ } α 0 + { λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S ( φ , ψ ; g ; h ) ψ δ ψ } α 0 .
In Eqs. (21) and (27), the notation { } α 0 has been used to indicate that the quantity within the brackets is to be evaluated at the nominal values of the parameters and state functions. This simplified notation is justified by the fact that when the parameters take on their nominal values, it implicitly means that the corresponding state functions also take on their corresponding nominal values. This simplified notation will be used throughout this work.
The direct-effect term can be computed after having solved the forward system modeled by Eqs. (1) and (2), as well as the adjoint system modeled by Eqs. (6) and (7), to obtain the nominal values φ 0 , ψ 0 of the forward and adjoint dependent variables.
On the other hand, the indirect-effect term { δ R [ u ( 1 ) ( 2 ; x ) ; f ; v ( 1 ) ( 2 ; x ) ] } i n d defined in Equation (27) can be quantified only after having determined the variations v ( 1 ) ( 2 ; x ) [ δ φ ( x ) , δ ψ ] in the state functions of the 1st-Level Forward/Adjoint System (1st-LFAS). The variations v ( 1 ) ( 2 ; x ) are obtained as the solutions of the system of equations obtained by taking the first-order G-differentials of the 1st-LFAS defined by Eqs. (14) and (15), which are obtained by definition as follows:
{ d d ε F ( 1 ) [ 2 × 2 ; x ; f 0 + ε δ f ] [ u ( 1 , 0 ) ( 2 ; x ) + ε v ( 1 ) ( 2 ; x ) ] } ε = 0 = { d d ε q F ( 1 ) [ 2 ; x ; f 0 + ε δ f ] } ε = 0 ,
{ d d ε b F ( 1 ) [ 2 ; u ( 1 , 0 ) ( 2 ; x ) + ε v ( 1 ) ( 2 ; x ) ; f 0 + ε δ f ] } ε = 0 = 0 [ 2 ] .
Carrying out the differentiations with respect to ε in the above equations and setting ε = 0 in the resulting expressions yields the following matrix-vector equations:
{ V ( 1 ) [ 2 × 2 ; x ; f ] v ( 1 ) ( 2 ; x ) } α 0 = { q V ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 ; x Ω ( α 0 ) ;
{ b v ( 1 ) ( u ( 1 ) ; v ( 1 ) ; f ; δ f ) } α 0 = 0 ; x Ω [ λ ( α 0 ) ; ω ( α 0 ) ] ;
where:
V ( 1 ) [ 2 × 2 ; x ; f ] ( L ( x ; f ) 0 0 L * ( x ; f ) ) = F ( 1 ) [ 2 × 2 ; x ; f ] ;
q V ( 1 ) [ 2 ; u ( 1 ) ; f ; δ f ] ( q 1 ( 1 ) ( φ ; f ; δ f ) q 2 ( 1 ) ( ψ ; f ; δ f ) ) ; b v ( 1 ) ( u ( 1 ) ; v ( 1 ) ; f ; δ f ) ( b 1 ( 1 ) ( φ ; δ φ ; f ; δ f ) b 2 ( 1 ) ( ψ ; δ ψ ; f ; δ f ) ) ;
 
q 2 ( 1 ) ( ψ , f ; δ f ) [ Q * L * ψ ( x ) ] f δ f j 1 = 1 T F s 2 ( 1 ) ( j 1 ; ψ ; f ) δ f j 1
b 1 ( 1 ) ( φ ; δ φ ; f ; δ f ) B δ φ + ( B φ C ) f δ f ;
b 2 ( 1 ) ( ψ ; δ ψ ; f ; δ f ) B * δ ψ + ( B * ψ C * ) f δ f .
In order to keep the notation as simple as possible in Eqs. (30)‒(37), the differentials with respect to the various components of the feature function f ( α ) have all been written in the form ( [ ] / f ) δ f , keeping in mind the convention/notation introduced in Equation (22). The system of equations comprising Eqs. (30) and (31) will be called the “1st-Level Variational Sensitivity System (1st-LVSS)” and its solution, v ( 1 ) ( 2 ; x ) , will be called the “1st-level variational sensitivity function,” which is indicated by the superscript “(1)”. The solution, v ( 1 ) ( 2 ; x ) , of the 1st-LVSS will be a function of the components of the vector of variations δ f . In principle, therefore, if the response sensitivities with respect to the components of the feature function f ( α ) are of interest, then the 1st-LVSS would need to be solved as many times as there are components in the variational features-function δ f . On the other hand, if the response sensitivities with respect to the primary parameters are of interest, then the 1st-LVSS would need to be solved as many times as there are primary parameters. Solving the 1st-LVSS involves “large-scale computations.”
On the other hand, solving the 1st-LVSS can be avoided altogether by using the ideas underlying the “adjoint sensitivity analysis methodology” originally conceived by Cacuci [2] and subsequently generalized by Cacuci [27,30] to enable the computation of arbitrarily high-order response sensitivities with respect to primary model parameters for both linear and nonlinear models. Thus, the need for solving repeatedly the 1st-LVSS for every variation in the components of the feature function (or for every variation in the model’s parameters) is eliminated by expressing the indirect-effect term { δ R [ u ( 1 ) ( 2 ; x ) ; f ; v ( 1 ) ( 2 ; x ) ] } i n d defined in Equation (27) in terms of the solutions of the “1st-Level Adjoint Sensitivity System” (1st-LASS), which will be constructed by implementing the following sequence of steps:
  • Introduce a Hilbert space, denoted as H 1 , comprising vector-valued elements of the form χ ( 1 ) ( 2 ; x ) [ χ 1 ( 1 ) ( x ) , χ 2 ( 1 ) ( x ) ] , where the components χ i ( 1 ) ( x ) [ χ i , 1 ( 1 ) ( x ) , ... , χ i , j ( 1 ) ( x ) , ... , χ i , T D ( 1 ) ( x ) ] , i = 1 , 2 , are square-integrable functions. Consider further that this Hilbert space is endowed with an inner product denoted as χ ( 1 ) ( 2 ; x ) , θ ( 1 ) ( 2 ; x ) 1 between two elements, χ ( 1 ) ( 2 ; x ) H 1 , θ ( 1 ) ( 2 ; x ) H 1 , which is defined as follows:
    χ ( 1 ) ( 2 ; x ) , θ ( 1 ) ( 2 ; x ) 1 i = 1 2 χ i ( 1 ) ( x ) , θ i ( 1 ) ( x ) 0 .
  • In the Hilbert H 1 , form the inner product of Equation (30) with a yet undefined vector-valued function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] H 1 to obtain the following relation:
    { a ( 1 ) ( 2 ; x ) , V ( 1 ) [ 2 × 2 ; x ; f 0 ] v ( 1 ) ( 2 ; x ) 1 } α 0 = { a ( 1 ) ( 2 ; x ) , q V ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ; δ f ] 1 } α 0 .
  • Using the definition of the adjoint operator in the Hilbert space H 1 , recast the left-side of Equation (39) as follows:
    { a ( 1 ) ( 2 ; x ) , V ( 1 ) [ 2 × 2 ; x ; f ] v ( 1 ) ( 2 ; x ) 1 } α 0 = { v ( 1 ) ( 2 ; x ) , A ( 1 ) [ 2 × 2 ; x ; f ] a ( 1 ) ( 2 ; x ) 1 } α 0 + { P ( 1 ) [ v ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 ,
    where { P ( 1 ) [ v ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 denotes the bilinear concomitant defined on the phase-space boundary x Ω ( α 0 ) , and where A ( 1 ) [ 2 × 2 ; x ; f ] is the operator formally adjoint to V ( 1 ) [ 2 × 2 ; x ; f ] , i.e.,
    A ( 1 ) [ 2 × 2 ; x ; f ] { V ( 1 ) [ 2 × 2 ; x ; f ] } * = ( L * ( x ; f ) 0 0 L ( x ; f ) ) .
  • Require the first term on right-side of Equation (40) to represent the indirect-effect term defined in Equation (27),by imposing the following relation:
    A ( 1 ) [ 2 × 2 ; x ; f ] a ( 1 ) ( 2 ; x ) = q A ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ] , x Ω ( α 0 ) ;
    where:
    q A ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ] [ S ( u ( 1 ) ; f ) u ( 1 ) ( 2 ; x ) ] ( [ S ( u ( 1 ) ; f ) / φ ] [ S ( u ( 1 ) ; f ) / ψ ] ) .
  • Implement the boundary conditions represented by Equation (31) into Equation (40) and eliminate the remaining unknown boundary-values of the function v ( 1 ) ( 2 ; x ) from the expression of the bilinear concomitant { P ( 1 ) [ v ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 by selecting appropriate boundary conditions for the function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] , to ensure that Equation (42) is well-posed while being independent of unknown values of v ( 1 ) ( 2 ; x ) and of δ f . The boundary conditions thus chosen for the function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] can be represented in operator form as follows:
    { b A ( 1 ) [ u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ] } α 0 = 0 , x Ω [ λ ( α 0 ) ; ω ( α 0 ) ] .
  • The selection of the boundary conditions for a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] represented by Equation (44) eliminates the appearance of the unknown values of v ( 1 ) ( 2 ; x ) in { P ( 1 ) [ v ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 and reduces this bilinear concomitant to a residual quantity that contains boundary terms involving only known values of u ( 1 ) ( 2 ; x ) , a ( 1 ) ( 2 ; x ) , f , and δ f . This residual quantity will be denoted as { P ^ ( 1 ) [ u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 . In general, this residual quantity does not automatically vanish, although it may do so occasionally.
  • The system of equations comprising Equation (42) together with the boundary conditions represented Equation (44) will be called the 1st-Level Adjoint Sensitivity System (1st-LASS). The solution a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] of the 1st-LASS will be called the 1st-level adjoint sensitivity function. The 1st-LASS is called “first-level” (as opposed to “first-order”) because it does not contain any differential or functional-derivatives, but its solution, a ( 1 ) ( 2 ; x ) , will be used below to compute the first-order sensitivities of the response with respect to the components of the feature function f ( α ) .
  • Using Equation (39) together with the forward and adjoint boundary conditions represented by Eqs. (31) and (44) in Equation (40) reduces the latter to the following relation:
    { a ( 1 ) ( 2 ; x ) , q V ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ; δ f ] 1 } α 0 = { v ( 1 ) ( 2 ; x ) , A ( 1 ) [ 2 × 2 ; x ; f ] a ( 1 ) ( 2 ; x ) 1 } α 0 + { P ^ ( 1 ) [ u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 .
  • In view of Eqs. (27) and (42), the first term on the right-side of Equation (45) represents the indirect-effect term { δ R [ u ( 1 ) ( 2 ; x ) ; f ; v ( 1 ) ] } i n d . It therefore follows from Equation (45) that the indirect-effect term can be expressed in terms of the 1st-level adjoint sensitivity function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] as follows:
    { δ R [ u ( 1 ) ( 2 ; x ) ; f ; v ( 1 ) ( 2 ; x ) ] } i n d = { a ( 1 ) ( 2 ; x ) , q V ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ; δ f ] 1 } α 0 { P ^ ( 1 ) [ u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 { δ R [ u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } i n d .
As indicated by the identity shown in Equation (46), the variations δ φ and δ ψ have been eliminated from the original expression of the indirect-effect term, which now depends on the 1st-level adjoint sensitivity function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] . Adding the expression obtained in Equation (46) with the expression for the direct-effect term defined in Equation (21) yields, according to Equation (20) the following expression for the total 1st-order sensitivity { δ R ( φ , ψ , f ; δ φ , δ ψ , δ f ) } α 0 of the response R [ φ ( x ) , ψ ( x ) ; f ] with respect to the components of the feature function f ( α ) :
{ δ R ( φ , ψ , f ; δ φ , δ ψ , δ f ) } α 0 = { R ( u ( 1 ) ; f ) f δ f } α 0 + { a ( 1 ) ( 2 ; x ) , q V ( 1 ) [ 2 ; u ( 1 ) ( 2 ; x ) ; f ; δ f ] 1 } α 0 { P ^ ( 1 ) [ u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ; δ f ] } α 0 j 1 = 1 T F { R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] δ f j 1 } α 0 .
The identity which appears in Equation (47) emphasizes the fact that the variations δ φ and δ ψ , which are expensive to compute, have been eliminated from the final expressions of the 1st-order sensitivities R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] of the response with respect to the components f j 1 ( α ) , j 1 = 1 , ... , T F , of the features functions. The dependence on the variations δ φ and δ ψ has been replaced in the expression of R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] by the dependence on the 1st-level adjoint sensitivity function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] . It is very important to note that the 1st-LASS is independent of variations δ f ( α ) in the components of the feature function and is consequently also independent of any variations δ α in the primary model parameters. Hence, the 1st-LASS needs to be solved only once to determine the 1st-level adjoint sensitivity function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] . Subsequently, the “indirect-effect term” is computed efficiently and exactly by simply performing the integrations required to compute the inner product over the adjoint function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] , as indicated on the right-side of Equation (47). Solving the 1st-Level Adjoint Sensitivity System (1st-LASS) requires the same computational effort as solving the original coupled linear system, entailing the following operations: (i) inverting (i.e., solving) the left-side of the original adjoint equation with the source [ S ( u ( 1 ) ; α ) / φ ] to obtain the 1st-level adjoint sensitivity function a 1 ( 1 ) ( x ) ; and (ii) inverting the left-side of the original forward equation with the source [ S ( u ( 1 ) ; α ) / ψ ] to obtain the 1st-level adjoint sensitivity function a 2 ( 1 ) ( x ) .
The 1st-order sensitivities R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] , j 1 = 1 , ... , T F , can be expressed as an integral over the independent variables as follows:
R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] .
In particular, if the residual bilinear concomitant is non-zero, the functions S ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] would contain suitably defined Dirac delta-functionals for expressing the respective non-zero boundary terms as volume-integrals over the phase-space of the independent variables. Dirac-delta functionals would also be used in the expression of S ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] to represent terms containing the derivatives of the boundary end-points with respect to the model and/or response parameters.
The response sensitivities with respect to the primary model parameters would be obtained by using the expression obtained in Equation (48) in conjunction with the “chain rule” of differentiation provided in Equation (12).
It is important to compare the results produced by the 1st-FASAM-L (for obtaining the sensitivities of the model response with respect to the model’s features) with the 1st-CASAM (the 1st-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems) methodology, which provides the expressions of the responses sensitivities directly with respect to the model’s primary parameters. Recall that the 1st-CASAM-L [27] yields the following expression for the 1st-order sensitivities of the response with respect to the primary model parameters:
{ R [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; α ] α j 1 } α 0 = { λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S [ u ( 1 ) ( 2 ; x ) ; α ] α j 1 } α 0 + k = 1 T I m = 1 , k j T I { λ m ( α ) ω m ( α ) d x m S [ u ( 1 ) ( 2 ; ... , ω k , ... ) ; α ] ω k ( α ) α j 1 S [ u ( 1 ) ( 2 ; ... , λ k , ... ) ; α ] λ k ( α ) α j 1 } α 0 + { a ( 1 ) ( 2 ; x ) , α j 1 q ( 1 ) [ u ( 1 ) ( 2 ; x ) ; α ] 1 } α 0 { α j 1 P ^ ( 1 ) [ u ( 1 ) ; a ( 1 ) ; α ] } α 0 ; j 1 = 1 , ... , T P .
The same 1st-level adjoint sensitivity function, denoted as a ( 1 ) ( 2 ; x ) , appears in Equation (49) as well as in Equation (48). Therefore, the same number of “large-scale computations” (which are needed to solve the 1st-LASS to determine the 1st-level adjoint sensitivity function) is needed for obtaining either the response sensitivities with respect to the components, f j ( α ) , j = 1 , ... , T F , of the feature function f ( α ) using the c, or for obtaining the response sensitivities directly with respect to the primary model parameters α j , j = 1 , ... , T P , by using the 1st-CASAM-L. The use of the 1st-CASAM-L would also require performing a number of T P integrations to compute all of the response sensitivities with respect to the primary parameters; in contradistinction, the use of the 1st-FASAM-L would require only T F integrations ( T F < T P ) to compute all of the response sensitivities with respect to the components f j ( α ) of the feature function. Since integrations using quadrature-scheme are significantly less expensive computationally by comparison to solving systems of equations (e.g., the original equations underlying the model and the 1st-LASS), the computational savings provided by the use of the 1st-FASAM-L is small by comparison to using the 1st-CASAM-L. However, this conclusion is valid only for the computation of 1st-order sensitivities. As will be shown in Section 4, below, the computational savings are significantly larger when computing the second-order sensitivities by using the 2nd-FASAM-L rather than using the 2nd-CASAM-L (or any other method).

4. The Second-Order Function/Feature Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward and Adjoint Linear Systems (2nd-FASAM-L)

The “Second-Order Function/Feature Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems“ (2nd-FASAM-L) determines the 2nd-order sensitivities 2 R [ u ( 1 ) ( 2 ; x ) ; f ( α ) ] / f j 2 f j 1 of the response with respect to the components of the “feature” function f ( α ) by conceptually considering that the first-order sensitivities R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] R [ u ( 1 ) ( 2 ; x ) ; f ( α ) ] / f j 1 , which were obtained in Equation (48), are “model responses.” Consequently, the 2nd-order sensitivities are obtained as the “1st-order sensitivities of the 1st-order sensitivities” by applying the concepts underlying 1st-FASAM to each 1st-order sensitivity R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] , j 1 = 1 , ... , T F , which depends on both the vector u ( 1 ) ( 2 ; x ) , which comprises the original state variables, as well as on the 1st-level adjoint function a ( 1 ) ( 2 ; x ) .
To establish the pattern underlying the computation of sensitivities of arbitrarily high-order, it is useful to introduce a systematic classification of the systems of equations that will underly the computation of the sensitivities of various orders. As has been shown in the Section 2.1, above, the 1st-order response sensitivities R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] depend on the original state functions u ( 1 ) ( 2 ; x ) [ φ ( x ) , ψ ( x ) ] and on the 1st-level adjoint sensitivity function a ( 1 ) ( 2 ; x ) [ a 1 ( 1 ) ( x ) , a 2 ( 1 ) ( x ) ] . The system of equations satisfied by these functions will be called “the 2nd-Level Forward/Adjoint System (2nd-LFAS)” and will be re-written in the following concatenated form:
F ( 2 ) [ 2 2 × 2 2 ; f ( α ) ] u ( 2 ) ( 2 2 ; x ) = q F ( 2 ) [ 2 2 ; u ( 1 ) ( 2 ; x ) ; f ( α ) ] ; x Ω ( α ) ;
b F ( 2 ) ( 2 2 ; u ( 2 ) ; f ) ( b F ( 1 ) , b A ( 1 ) ) = 0 ; x Ω [ λ ( α ) ; ω ( α ) ] ;
where the following definitions were used:
F ( 2 ) [ 2 2 × 2 2 ; f ( α ) ] d i a g ( F ( 1 ) , A ( 1 ) ) ; u ( 2 ) ( 2 2 ; x ) [ u ( 1 ) ( 2 ; x ) , a ( 1 ) ( 2 ; x ) ] ;
q F ( 2 ) [ 2 2 ; u ( 1 ) ( 2 ; x ) ; f ( α ) ] [ q F ( 1 ) ( 2 ; x ; f ) , q A ( 1 ) [ 2 ; u ( 1 ) ; f ] ] .
The notation used for the matrix F ( 2 ) [ 2 2 × 2 2 ; f ( α ) ] indicates the following characteristics: (i) the superscript “2” indicates “2nd-level”; (ii) the argument “ 2 2 × 2 2 ” indicates that this square matrix comprises 4x4=16 component sub-matrices. Similarly, the argument “22” that appears in the block-vectors u ( 2 ) ( 2 2 ; x ) , q F ( 2 ) [ 2 2 ; u ( 1 ) ( 2 ; x ) ; f ( α ) ] and b F ( 2 ) ( 2 2 ; u ( 2 ) ; α ) indicates that each of these column block-vectors comprises four sub-vectors as components.
The first-order G-differential of a first-order sensitivity R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; f ( α ) ] , j 1 = 1 , ... , T F , is obtained by definition as follows:
{ δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ; δ f ] } α 0 { d d ε δ R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) + ε v ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) + ε δ a ( 1 ) ( 2 ; x ) ; f + ε δ f ] } ε = 0 = { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d + { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; δ f ] } d i r
The direct-effect term { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; δ f ] } d i r in Equation (54) is defined as follows
{ δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; δ f ] } d i r j 2 = 1 T F { f j 2 λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; f ( α ) ] } α 0 δ f j 2 .
and can be computed immediately. The indirect-effect term { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d in Equation (54) depends on the 2nd-level variational sensitivity function v ( 2 ) ( 2 2 ; x ) [ v ( 1 ) ( 2 ; x ) , δ a ( 1 ) ( 2 ; x ) ] and is defined as follows:
{ δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I [ s ( 2 ) ( 2 2 ; j 1 ; u ( 2 ) ; f ) · v ( 2 ) ( 2 2 ; x ) ] ,
where:
s ( 2 ) ( 2 2 ; j 1 ; u ( 2 ) ; f ) R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] / u ( 2 ) .
Evidently, the functions v ( 1 ) ( 2 ; x ) and δ a ( 1 ) ( 2 ; x ) are needed in order to evaluate the above indirect-effect term. These functions are the solutions of the system of equations obtained by taking the first-G-differential of the 2nd-LFAS defined by Eqs. (52) and (53). Applying the definition of the first G-differential the 2nd-LFAS yields the following 2nd-Level Variational Sensitivity System (2nd-LVSS)” for the 2nd-Level variational sensitivity function  v ( 2 ) ( 2 2 ; x ) [ v ( 1 ) ( 2 ; x ) , δ a ( 1 ) ( 2 ; x ) ] :
{ d d ε F ( 2 ) [ 2 2 × 2 2 ; f 0 + ε δ f ] [ u ( 2 , 0 ) ( 2 2 ; x ) + ε v ( 2 ) ( 2 2 ; x ) ] } ε = 0 = { d d ε q F ( 2 ) [ 2 2 ; u ( 1 , 0 ) ( 2 ; x ) + ε v ( 1 ) ( 2 ; x ) ; f 0 + ε δ f ] } ε = 0 ; x Ω ( α 0 ) ;
{ d d ε b F ( 2 ) [ u ( 2 , 0 ) ( 2 2 ; x ) + ε v ( 2 ) ( 2 2 ; x ) ; f 0 + ε δ f ] } ε = 0 = 0 ; x Ω [ λ ( α 0 ) ; ω ( α 0 ) ] .
Carrying out the differentiation with respect to ε in Eqs. (58) and (59), and setting ε = 0 in the resulting expressions yields the following 2nd-LVSS:
{ V ( 2 ) [ 2 2 × 2 2 ; x ; f ] v ( 2 ) ( 2 2 ; x ) } α 0 = { q V ( 2 ) [ 2 2 ; u ( 2 ) ( 2 2 ; x ) ; f ; δ f ] } α 0 ; x Ω ( α 0 ) ;
{ b v ( 2 ) ( u ( 2 ) ; v ( 2 ) ; f ; δ f ) } α 0 = 0 ; x Ω [ λ ( α 0 ) ; ω ( α 0 ) ] ;
where the following definitions were used:
V ( 2 ) [ 2 2 × 2 2 ; x ; f ] = ( V ( 1 ) [ 2 × 2 ; x ; f ] 0 V 21 ( 2 ) ( 2 × 2 ; u ( 1 ) ; f ) A ( 1 ) [ 2 × 2 ; x ; f ] ) ; V 21 ( 2 ) ( 2 × 2 ; u ( 1 ) ; f ) ( 2 S ( u ( 1 ) ; f ) φ φ 2 S ( u ( 1 ) ; f ) φ ψ 2 S ( u ( 1 ) ; f ) ψ φ 2 S ( u ( 1 ) ; f ) ψ ψ ) ;
q V ( 2 ) [ 2 2 ; u ( 2 ) ( 2 2 ; x ) ; f ; δ f ] ( q V ( 1 ) [ 2 ; u ( 1 ) ; f ; δ f ] p A ( 1 ) [ 2 ; u ( 1 ) ; a 1 ( 1 ) ; f ; δ f ] ) ; p A ( 1 ) [ 2 ; u ( 1 ) ; a 1 ( 1 ) ; f ; δ f ] ( p 1 ( 1 ) ( u ( 1 ) ; a 1 ( 1 ) ; δ f ) p 2 ( 1 ) ( u ( 1 ) ; a 1 ( 1 ) ; δ f ) ) ;
p 1 ( 1 ) ( u ( 1 ) ; a 1 ( 1 ) ; f ; δ f ) 2 S ( u ( 1 ) ; f ) f φ δ f [ L * ( f ) a 1 ( 1 ) ] f δ f ;
p 2 ( 1 ) ( u ( 1 ) ; a 2 ( 1 ) ; f ; δ f ) 2 S ( u ( 1 ) ; f ) f ψ δ f [ L ( f ) a 2 ( 1 ) ] f δ f ;
b v ( 2 ) ( u ( 2 ) ; v ( 2 ) ; f ; δ f ) ( b v ( 1 ) ( u ( 1 ) ; v ( 1 ) ; f ; δ f ) δ b A ( 1 ) ( u ( 2 ) ; v ( 2 ) ; f ; δ f ) ) ;
The matrix V 21 ( 2 ) ( 2 × 2 ; u ( 1 ) ; f ) depends only the system’s response and is responsible for coupling the forward and adjoint systems. Although the forward and adjoint systems are coupled, they could nevertheless be solved successively rather than simultaneously, because the matrix V ( 2 ) [ 2 2 × 2 2 ; x ; f ] is block-diagonal. All of the components of the matrices and vectors underlying the 2nd-LVSS are to be computed at nominal parameter and state function values, as indicated in Eqs. (60) and (61).
Computing the indirect-effect term { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d by solving the 2nd-LVSS would require at least 2 T F ( T F + 1 ) large-scale computations (to solve the 2nd-LVSS) for every component of the feature function f ( α ) .
The need for solving the 2nd-LVSS will be circumvented by deriving an alternative expression for the indirect-effect term { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d , as defined in Equation (56), in which the second-level variational function v ( 2 ) ( 2 2 ; x ) will be replaced by a 2nd-level adjoint function which is independent of variations in the model parameter and state functions. This 2nd-level adjoint function will be the solution of a 2nd-Level Adjoint Sensitivity System (2nd-LASS), which will be constructed by using the same principles as employed for deriving the 1st-LASS. The 2nd-LASS is constructed in a Hilbert space, denoted as H 2 , which will comprise as elements block-vectors of the same form as v ( 2 ) ( 2 2 ; x ) , i.e., a vector in H 2 has the generic structure χ ( 2 ) ( 2 2 ; x ) [ χ 1 ( 2 ) ( x ) , χ 2 ( 2 ) ( x ) , χ 3 ( 2 ) ( x ) , χ 4 ( 2 ) ( x ) ] , comprising four vector-valued components of the form χ i ( 2 ) ( x ) [ χ i , 1 ( 2 ) ( x ) , ... , χ i , j ( 2 ) ( x ) , ... , χ i , T D ( 2 ) ( x ) ] , i = 1 , 2 , 3 , 4 = 2 2 . The inner product between two elements, χ ( 2 ) ( 2 ; x ) H 2 and θ ( 2 ) ( 2 ; x ) H 2 , of this Hilbert space, will be denoted as χ ( 2 ) ( 2 ; x ) , θ ( 2 ) ( 2 ; x ) 2 2 and is defined as follows:
χ ( 2 ) ( 2 ; x ) , θ ( 2 ) ( 2 ; x ) 2 2 i = 1 2 2 χ i ( 2 ) ( x ) , θ i ( 2 ) ( x ) 0
Note that there are j 1 = 1 , ... , T F distinct indirect-effect terms { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d . Each of these indirect-effect terms will serve as a “source” for a “2nd-Level Adjoint Sensitivity System (2nd-LASS)” that will be constructed by applying the same sequence of steps that were used in Section 2.1, above, to construct the 1st-LASS. This implies that a distinct 2nd-level adjoint sensitivity function, of the form a ( 2 ) ( 2 2 ; j 1 ; x ) [ a 1 ( 2 ) ( j 1 ; x ) , a 2 ( 2 ) ( j 1 ; x ) , a 3 ( 2 ) ( j 1 ; x ) , a 4 ( 2 ) ( j 1 ; x ) ] H 2 , j 1 = 1 , ... , T F , corresponding to each distinct indirect-effect term, will be needed for constructing each of the corresponding 2nd-LASS, as follows:
  • For each j 1 = 1 , ... , T P , form the inner product in the Hilbert space H 2 of Equation (60) with a yet undefined function a ( 2 ) ( 2 2 ; j 1 ; x ) to obtain the following relation:
    { a ( 2 ) ( 2 2 ; j 1 ; x ) , V ( 2 ) [ 2 2 × 2 2 ; x ; f ] v ( 2 ) ( 2 2 ; x ) 2 2 } α 0 = { a ( 2 ) ( 2 2 ; j 1 ; x ) , q V ( 2 ) [ 2 2 ; u ( 2 ) ( 2 2 ; x ) ; f ; δ f ] 2 2 } α 0 ; x Ω ( α 0 ) .
  • Using the definition of the adjoint operator in the Hilbert space H 2 , recast the left-side of Equation (68) as follows:
    { a ( 2 ) ( 2 2 ; j 1 ; x ) , V ( 2 ) [ 2 2 × 2 2 ; x ; f ] v ( 2 ) ( 2 2 ; x ) 2 2 } α 0 = { v ( 2 ) ( 2 2 ; x ) , A ( 2 ) [ 2 2 × 2 2 ; x ; f ] a ( 2 ) ( 2 2 ; j 1 ; x ) 2 2 } α 0 + { P ( 2 ) [ v ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ; δ f ] } α 0 ,
    where { P ( 2 ) [ v ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ; δ f ] } α 0 denotes the bilinear concomitant defined on the phase-space boundary x Ω x ( α 0 ) and where A ( 2 ) [ 2 2 × 2 2 ; x ; f ] [ V ( 2 ) [ 2 2 × 2 2 ; x ; f ] ] * is the operator formally adjoint to V ( 2 ) [ 2 2 × 2 2 ; x ; f ] .
  • The first term on right-side of Equation (69) is now required to represent the indirect-effect term { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d defined in Equation (56). This requirement is satisfied by recalling Equation (57) and imposing the following relation on each function a ( 2 ) ( 2 2 ; j 1 ; x ) , j 1 = 1 , ... , T F :
    { A ( 2 ) [ 2 2 × 2 2 ; x ; f ] a ( 2 ) ( 2 2 ; j 1 ; x ) } α 0 = { s ( 2 ) ( 2 2 ; j 1 ; u ( 2 ) ; f ) } α 0 , j 1 = 1 , ... , T F ,
  • The definition of the vector a ( 2 ) ( 2 2 ; j 1 ; x ) will now be completed by selecting boundary conditions which will be represented in operator form as follows:
    { b A ( 2 ) [ u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ] } α 0 = 0 , x Ω ( α 0 ) , j 1 = 1 , ... , T F .
  • The boundary conditions represented by Equation (71) are selected so as to satisfy the following requirements: (a) these boundary conditions together with Equation (70) constitute a well posed problem for the functions a ( 2 ) ( 2 2 ; j 1 ; x ) ; (b) the implementation in Equation (69) of these boundary conditions together with those provided in Equation (61) eliminates all of the unknown values of the functions v ( 2 ) ( 2 2 ; x ) and a ( 2 ) ( 2 2 ; j 1 ; x ) in the expression of the bilinear concomitant { P ( 2 ) [ v ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ; δ f ] } α 0 . This bilinear concomitant may vanish after these boundary conditions are implemented, but if it does not, it will be reduced to a residual quantity which will be denoted as P ^ ( 2 ) [ u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ; δ f ] and which will comprise only known values of u ( 2 ) ( 2 2 ; x ) , a ( 2 ) ( 2 2 ; j 1 ; x ) , f and δ f .
The system of equations represented by Equation (70) together with the boundary conditions represented by Equation (71) constitute the 2nd-Level Adjoint Sensitivity System (2nd-LASS). The solution of the 2nd-LASS, i.e., the four-component vector a ( 2 ) ( 2 2 ; j 1 ; x ) , j 1 , ... , T P , will be called the 2nd-level adjoint sensitivity function. It is important to note that the 2nd-LASS is independent of any variations, δ f , in the components of the feature function and, hence, is independent of any parameter variations, δ α , as well.
The equations underlying the 2nd-LASS, represented by Eqs. (70), (71), together with the equations underlying the 2nd-LVSS, represented by Eqs. (60) and (61), are now employed in Equation (69) in conjunction with Equation (56) to obtain the following expression for the indirect-effect term { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d in terms of the 2nd-level adjoint sensitivity functions a ( 2 ) ( 2 2 ; j 1 ; x ) , for j 1 = 1 , ... , T P :
{ δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; v ( 2 ) ( 2 2 ; x ) ; f ] } i n d = { a ( 2 ) ( 2 2 ; j 1 ; x ) , q V ( 2 ) [ 2 2 ; u ( 2 ) ( 2 2 ; x ) ; f ; δ f ] 2 } α 0 { P ^ ( 2 ) [ u ( 2 ) ; a ( 2 ) ; f ; δ f ] } α 0 { δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ; δ f ] } i n d .
As the last equality (identity) in Equation (72) indicates, the 2nd-level variational sensitivity function v ( 2 ) ( 2 2 ; x ) has been eliminated from appearing in the expression of the indirect-effect term, having been replaced by the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 2 ; j 1 ; x ) , for each j 1 = 1 , ... , T F .
Inserting the expressions that define the vector q V ( 2 ) [ 2 2 ; u ( 2 ) ( 2 2 ; x ) ; f ; δ f ] from Equation (63)‒(65) into Equation (72) and adding the resulting expression for the indirect-effect term to the expression of the direct-effect term given in Equation (54) yields the following expression for the total second-order G-differential of the response R [ φ ( x ) , ψ ( x ) ; f ] :
{ δ R ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( j 1 ; 2 2 ; x ) ; f ; δ f ] } α 0 = j 2 = 1 T F { R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( j 1 ; 2 2 ; x ) ; f ] } α 0 δ f j 2 ,
where R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( x ) ; a ( 2 ) ( j 1 ; x ) ; f ] 2 R [ φ ( x ) , ψ ( x ) ; f ] / f j 1 f j 2 denotes the 2nd-order partial sensitivity of the response R [ φ ( x ) , ψ ( x ) ; f ] with respect to the components f j 2 ( α ) of the feature function f ( α ) , evaluated at the nominal parameter values α 0 , and has the following expression for j 1 , j 2 = 1 , ... , T P :
R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( j 1 ; 2 2 ; x ) ; f ] = { f j 2 λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S ( 1 ) [ j 1 ; u ( 2 ) ( 2 2 ; x ) ; f ( α ) ] } α 0 + { a 1 ( 2 ) ( j 1 ; x ) , [ Q ( f ) L ( f ) φ ( x ) ] f j 2 0 } α 0 + { a 2 ( 2 ) ( j 1 ; x ) , [ Q * ( f ) L * ( f ) ψ ( x ) ] f j 2 0 } α 0 + { a 3 ( 2 ) ( j 1 ; x ) , 2 S ( u ( 1 ) ( x ) ; f ) f j 2 φ [ L * ( f ) a 1 ( 1 ) ] f j 2 0 } α 0 + { a 4 ( 2 ) ( j 1 ; x ) , 2 S ( u ( 1 ) ( x ) ; f ) f j 2 ψ [ L ( f ) a 2 ( 1 ) ] f j 2 0 } α 0 { f j 2 P ^ ( 2 ) [ u ( 2 ) ( x ) ; a ( 2 ) ( j 1 ; x ) ; f ( α ) ] } α 0 .
Since the 2nd-LASS is independent of variations in the components of the feature-functions (and, hence, variations in the model parameters), the exact computation of all of the partial second-order sensitivities R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( j 1 ; 2 2 ; x ) ; f ] 2 R [ φ ( x ) , ψ ( x ) ; f ] / f j 1 f j 2 requires at most T F large-scale (adjoint) computations using the 2nd-LASS. When the 2nd-LASS is solved T F -times, the “off-diagonal” 2nd-order mixed sensitivities 2 R / f j 1 f j 2 will be computed twice, in two different ways, using two distinct 2nd-level adjoint sensitivity functions, thereby providing an independent intrinsic (numerical) verification that the 1st- and 2nd-order response sensitivities with respect to the components of the feature functions are computed accurately. In component form, the equations comprising the 2nd-LASS are solved, for each j 1 = 1 , ... , T F , in the following order:
L ( f ) a 3 ( 2 ) ( j 1 ; x ) = S ( 1 ) ( j 1 ; u ( 2 ) ; f ) a 1 ( 1 ) ,
L * ( f ) a 4 ( 2 ) ( j 1 ; x ) = S ( 1 ) ( j 1 ; u ( 2 ) ; f ) a 2 ( 1 ) ,
L * ( f ) a 1 ( 2 ) ( j 1 ; x ) = 2 S ( u ( 1 ) ; f ) φ φ a 3 ( 2 ) ( j 1 ; x ) + 2 S ( u ( 1 ) ; f ) ψ φ a 4 ( 2 ) ( j 1 ; x ) + S ( 1 ) ( j 1 ; u ( 2 ) ; f ) φ ,
L ( f ) a 2 ( 2 ) ( j 1 ; x ) = 2 S ( u ( 1 ) ; f ) φ ψ a 3 ( 2 ) ( j 1 ; x ) + 2 S ( u ( 1 ) ; f ) ψ ψ a 4 ( 2 ) ( j 1 ; x ) + S ( 1 ) ( j 1 ; u ( 2 ) ; f ) ψ .
Dirac delta-functionals may need to be used in Equation (74) in order to express in integral form the eventual non-zero residual terms in the residual bilinear concomitant and/or the terms containing derivatives with respect to the lower- and upper-boundary points. Ultimately, the expression of the partial second-order sensitivities R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ] obtained in Equation (74) is written in the following integral form, which mirrors Equation (48):
R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ( α ) ] λ 1 ( α ) ω 1 ( α ) d x 1 ... λ T I ( α ) ω T I ( α ) d x T I S ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ( α ) ] .
The computation of the partial second-order sensitivities R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ] using Equation (74) requires quadratures for performing the integrations over the four components of the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 2 ; j 1 ; x ) , which are obtained by solving the 2nd-LASS for j 1 = 1 , ... , T F . Thus, obtaining all of the second-order sensitivities R ( 2 ) [ j 2 ; j 1 ; u ( 2 ) ( 2 2 ; x ) ; a ( 2 ) ( 2 2 ; j 1 ; x ) ; f ] 2 R / f j 1 f j 2 with respect to the components f j 1 of the feature function f ( α ) requires performing at most T F large-scale computations for solving the 2nd-LASS.
By comparison, if the 2nd-CASAM-L [27] had been applied to compute the second-order sensitivities of the response directly with respect to the model parameters, T P (instead of T F ) large-scale computations for solving the corresponding 2nd-LASS would have been required, where T P denotes the total number of primary model parameters. Since T F < T P , fewer large-scale computations are needed when using the 2nd-FASAM-L rather than the 2nd-CASAM-L. Notably, the left-sides of the 2nd-LASS to be solved within the 2nd-FASAM-L are the same as those to be solved within the 2nd-CASAM-L. However, the source terms on the right-sides of these 2nd-LASS are different from each other: there are as many source-terms on the right-sides as there are components of the feature function within the 2nd-FASAM-L, and there are as many right-side sources as there are primary model parameters within the 2nd-CASAM-L.

5. Illustrative High-Order Feature Adjoint Sensitivity Analysis of Energy-Dependent Particle Detector Response

The application of the nth-FASAM-L methodology will be illustrated in this Section by considering the simplified model of the distribution in the asymptotic energy range of neutrons produced by a source of neutrons placed in an isotropic medium comprising a homogeneous mixture of “M” non-fissionable materials having constant (i.e., energy-independent) properties. For simplicity, but without diminishing the applicability of the nth-FASAM-L methodology, this medium is considered to be infinitely large. The simplified neutron transport equation that models the energy-distributions of neutrons in such materials is called the “neutron slowing-down equation” and is written using the neutron lethargy (rather than the neutron energy) as the independent variable, which is denoted as “u” and is defined as follows: u ln ( E 0 / E ) , where E denotes the energy-variable and E 0 denotes the highest energy in the system. Thus, the neutron slowing-down model [32,33,34] for the energy-distribution of the neutron flux in a homogeneous mixture of non-fissionable materials of infinite extent takes on the following drastically simplified form of the neutron transport balance equation:
d φ ( u ) d u + Σ a ξ ¯ Σ t φ ( u ) = S ( u ) ξ ¯ Σ t ; 0 < u u t h ;
φ ( 0 ) = 0 ; a t u = 0.
The quantities which appear in Equation (80) are defined below.
(1)
The lethargy-dependent neutron flux is denoted as φ ( u ) ; u t h denotes a cut-off lethargy, usually taken to be the lethargy that corresponds to the thermal neutron energy (ca. 0.0024 electron-volts).
(2)
The macroscopic elastic scattering cross section for the homogeneous mixture of “M” materials is denoted as Σ s and is defined as follows:
Σ s i = 1 M N m ( i ) σ s ( i ) ;
where σ s ( i ) , i = 1 , ... , M denotes the elastic scattering cross section of material “i”, and where the atomic or molecular number density of material “i” is denoted as N m ( i ) , i = 1 , ... , M and is defined as follows: N m ( i ) ρ i N A / A i , where N A is Avogadro’s number ( 0.602 × 10 24 n u c l e i / m o l e ) , while A i and ρ i denote the respective material’s mass number and density.
(3)
The average gain in lethargy of a neutron per collision is denoted as ξ ¯ and is defined as follows for the homogeneous mixture:
ξ ¯ 1 Σ s i = 1 M ξ i N m ( i ) σ s ( i ) ; ξ i 1 + a i ln a i 1 a i ; a i ( A i 1 A i + 1 ) 2 .
(4)
The macroscopic absorption cross section is denoted as Σ a and is defined as follows for the homogeneous mixture:
Σ a i = 1 M N m ( i ) σ γ ( i ) ,
where σ γ ( i ) , i = 1 , ... , M , denotes the microscopic radiative-capture cross section of material “i”.
(5)
The macroscopic total cross section is denoted as Σ t and is defined as follows for the homogeneous mixture:
Σ t Σ a + Σ s .
(6)
The source S ( u ) is considered to be a simplified “spontaneous fission” source stemming from fissionable actinides, such as 239Pu and 240Pu, emitting monoenergetic neutrons at the highest energy (i.e., zero lethargy). Such a source is comprised within the OECD/NEA polyethylene-reflected plutonium (PERP) OECD/NEA reactor physics benchmark [21,22] which can be modeled by the following simplified expression:
S ( u ) = S 0 δ ( u ) ; S 0 k = 1 2 λ k S N k S F k S ν k S W k S ,
where the superscript “S” indicates “source;” the subscript index k=1 indicates material properties pertaining to the isotope 239Pu; the subscript index k=2 indicates material properties pertaining to the isotope 240Pu; λ k S denotes the decay constant; N k S denotes the atomic density of the respective actinide; F k S denotes the spontaneous fission branching ratio; ν k S denotes the average number of neutrons per spontaneous fission; W k S denotes a function of parameters used in a Watt’s fission spectrum to approximate the spontaneous fission neutron spectrum of the respective actinide. The detailed forms of the parameters W k S are unimportant for illustrating the application of the nth-FASAM-L methodology. The nominal values for these imprecisely known parameters are available from a library file contained in SOURCES4C [26].
The response considered for the above neutron slowing-down model is the reaction rate, denoted as R , of neutrons of energy u = u d that would be measured by a detector characterized by an interaction cross section Σ d N d σ d , where N d denotes the atomic or molecular number density of the detector’s material while σ d denotes the detector’s microscopic interaction cross section. Mathematically, the detector’s reaction rate can be represented by the following functional of the neutron flux φ ( u ) :
R = Σ d φ ( u d ) = Σ d 0 u t h φ ( u ) δ ( u u d ) d u ; Σ d = N d σ d .
For this “source-detector” model, the following primary model parameters are subject to experimental uncertainties:
(i) the atomic number densities N m ( i ) ; the microscopic radiative-capture cross section σ γ ( i ) ; the scattering cross section σ s ( i ) , for each material “i”, i = 1 , ... , M , included in the homogeneous mixture;
(ii) the source parameters λ k S , N k S , F k S , ν k S , W k S , for k=1,2;
(iii) the atomic density N d and the microscopic interaction cross section σ d that characterize the detector’s material.
These above primary parameters are considered to constitute the components of a “vector of primary model parameters” defined as follows:
α ( N m ( 1 ) , σ γ ( 1 ) , σ s ( 1 ) , ... , N m ( M ) , σ γ ( M ) , σ s ( M ) , λ 1 S , λ 2 S , N 1 S , N 2 S , F 1 S , F 2 S , ν 1 S , ν 2 S , W 1 S , W 2 S , N d , σ d ) ( α 1 , , α T P ) ; T P 3 M + 12.
On the other hand, the structure of the computational model comprising Eqs. (80), (81) and (87) suggests that the components f i ( α ) of the feature function f ( α ) can be defined as follows:
f ( α ) [ f 1 ( α ) , f 2 ( α ) , f 3 ( α ) ] ; f 1 ( α ) Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ; f 2 ( α ) S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ; f 3 ( α ) Σ d ( α ) .
Solving Eqs. (80) and (81) while using the definitions introduced in Equation (89) yields the following expression for the flux φ ( u ) in terms of the components f i ( α ) of the feature function f ( α ) :
φ ( u ) = H ( u ) f 2 ( α ) exp [ u f 1 ( α ) ] ; H ( 0 ) = 0 ; H ( u ) = 1 , i f u > 0.
In terms of the components f i ( α ) of the feature function f ( α ) , the model’s response takes on the following expression:
R ( α ) = f 3 ( α ) f 2 ( α ) exp [ u d f 1 ( α ) ] .
As Equation (91) indicates, the model response can be considered to depend directly on T P 3 M + 12 primary model parameters. Alternatively, the model response can be considered to depend directly on 3 feature functions and only indirectly (through the 3 feature functions) on the primary model parameters. In the former consideration/interpretation, the response sensitivities to the primary model parameters will be obtained by applying the nth-CASAM-L methodology. In the later consideration/interpretation, the response sensitivities to the primary model parameters will be obtained by applying the nth-FASAM-L methodology, which will involve two stages, as follows: the response sensitivities with respect to the feature functions will be obtained in the first stage, while the subsequent computation of the response sensitivities to the primary model parameters will be performed in the second stage, by using the response sensitivities with respect to the feature functions obtained in the first stage. The computational distinctions that stem from these differing considerations/interpretations within the nth-CASAM-L methodology versus the nth-FASAM-L methodology will become evident in the remainder of this Section by means of using the illustrative neutron slowing-down model, which is representative of the general situation for any linear system.
According to the “reciprocity relation” for linear systems highlighted in Equation (8), the detector response defined in Equation (87) can be alternatively expressed in terms of the solution of the “adjoint slowing-down model”, i.e., the model that would be adjoint to the forward slowing-down model represented by Eqs. (80) and (81). The “adjoint slowing-down model” is constructed in the Hilbert space H B of all square-integrable functions φ ( u ) H B , ψ ( u ) H B endowed with the following inner product, denoted as φ ( u ) , ψ ( u ) B :
φ ( u ) , ψ ( u ) B 0 u t h φ ( u ) ψ ( u ) d u .
Using the inner product φ ( u ) , ψ ( u ) B defined in Equation (92), the adjoint slowing-down model is constructed by the usual procedure, namely: (i) construct the inner product of Equation (80) with a function ψ ( u ) H B ; (ii) integrate by parts the resulting relation so as to transfer the differential operation from the forward function φ ( u ) onto the adjoint function ψ ( u ) ; (iii) use the initial condition provided in Equation (81) and eliminate the unknown function φ ( u t h ) by choosing the final-value condition ψ ( u t h ) = 0 ; (iv) choose the source for the resulting adjoint slowing-down model so as to satisfy the reciprocity relation shown in Equation (8). The result of these operations is the following adjoint slowing down model for the adjoint slowing-down function ψ ( u ) :
d ψ ( u ) d u + f 1 ( α ) ψ ( u ) = f 3 ( α ) δ ( u u d ) ,
ψ ( u t h ) = 0 , a t u = u t h .
In terms of the adjoint slowing-down function ψ ( u ) , the detector response takes on the following alternative expression:
R = f 2 ( α ) 0 u t h ψ ( u ) δ ( u ) d u .
The correctness of the alternative expression for the detector response provided in Equation (95) can be readily verified by solving the adjoint slowing down equation to obtain the following closed form expression for the adjoint slowing-down function ψ ( u ) :
ψ ( u ) = H ( u d u ) f 3 ( α ) exp [ ( u u d ) f 1 ( α ) ] ,
and subsequently inserting the above expression into Equation (95) to obtain the same final result as was obtained in Equation (91) in terms of the forward slowing-down flux φ ( u ) .

5.1. First-Order Adjoint Sensitivity Analysis: 1st-FASAM-L Versus 1st-CASAM-L

In this Subsection, the computation of the first-order sensitivities of the response R ( α ) with respect to the primary model parameters will first be demonstrated by using the 1st-FASAM-L. Subsequently, the same first-order sensitivities will be obtained by using the 1st-CASAM-L and the two alternative paths will be compared to each other, showing that the same expressions are obtained for the respective sensitivities, as expected. Although the computational efforts are not identical, they are comparable in terms of efficiency, with a slight advantage for the 1st-FASAM-L methodology.

5.1.1. Application of the 1st-FASAM-L

The 1st-FASAM-L will be applied to the neutron slowing-down paradigm illustrative model by following the principles outlined in Section 3. In this case, the model response is written in terms of the feature functions as follows:
R ( φ , f ) = f 3 0 u t h φ ( u ) δ ( u u d ) d u ,
where the flux φ ( u ) is the solution of the 1st-Level Forward/Adjoint System (1st-LFAS) comprising Eqs. (80) and (81), where Equation (80) is written in terms of the feature functions as follows:
d φ ( u ) d u + f 1 φ ( u ) = f 2 δ ( u ) ; 0 < u u t h .
The first-order sensitivities of the response R ( φ , f ) with respect to the components of the feature function f ( α ) are provided by the first-order Gateaux (G-)variation δ R ( φ 0 , f 0 ; v ( 1 ) , δ f ) of R ( φ , f ) , for variations v ( 1 ) ( u ) δ φ ( u ) and δ f 3 around the phase-space point ( φ 0 , f 0 ) , as shown in Equation (18), to obtain:
δ R ( φ 0 , f 0 ; v ( 1 ) , δ f ) { d d ε [ ( f 3 0 + ε δ f 3 ) 0 u t h ( φ 0 + ε v ( 1 ) ) δ ( u u d ) d u ] } ε = 0 { δ R ( φ 0 , f 0 ; δ f 3 ) } d i r + { δ R ( φ 0 , f 0 ; v ( 1 ) ) } i n d ,
where the “direct-effect” term { δ R ( φ 0 , f 0 ; δ f 3 ) } d i r and, respectively, the “indirect-effect” term { δ R ( φ 0 , f 0 ; v ( 1 ) ) } i n d are defined as follows:
{ δ R ( φ 0 , f 0 ; δ f 3 ) } d i r ( δ f 3 ) 0 u t h φ 0 ( u ) δ ( u u d ) d u ,
{ δ R ( φ 0 , f 0 ; v ( 1 ) ) } i n d f 3 0 0 u t h v ( 1 ) ( u ) δ ( u u d ) d u .
The “1st-level variational sensitivity function” v ( 1 ) ( u ) is obtained as the solution of the “1st-Level Variational Sensitivity System (1st-LVSS)” obtained by taking the first-order G-differentials of the 1st-LFAS defined by Eqs. (98) and (81), which are derived as shown in Eqs. (28) and (29), to obtain:
{ d d ε [ d ( φ 0 + ε v ( 1 ) ) d u + ( f 1 0 + ε δ f 1 ) ( φ 0 + ε v ( 1 ) ) ] } ε = 0 = δ ( u ) { d d ε ( f 2 0 + ε δ f 2 ) } ε = 0 ,
{ d d ε [ φ 0 ( u ) + ε v ( 1 ) ( u ) ] } ε = 0 = 0 ; a t u = 0.
Carrying out the differentiations with respect to ε in the above equations and setting ε = 0 in the resulting expressions yields the following 1st-LVSS:
d v ( 1 ) ( u ) d u + f 1 ( α 0 ) v ( 1 ) ( u ) = ( δ f 2 ) δ ( u ) ( δ f 1 ) φ 0 ( u ) ,
v ( 1 ) ( u ) = 0 ; a t u = 0.
For further reference, the closed-form solution of the above 1st-LVSS has the following expression:
v 1 ( u ) = { [ ( δ f 2 ) ( δ f 1 ) u f 2 ] H ( u ) exp ( u f 1 ) } α 0 .
In principle, the above expression for v ( 1 ) ( u ) could be used in Equation (101) to obtain the value if the indirect-effect term. In practice, however, the 1st-LVSS cannot be solved analytically so the closed form expression of v ( 1 ) ( u ) is not available. Consequently, rather than (numerically) solve repeatedly the 1st-LVSS for every possible variation induced by the primary parameters in the component feature functions, the alternative route to determining the expression of the indirect-effect term is to develop the 1st-Level Adjoint sensitivity System (1st-LASS) by following the procedure described in Section 3. The Hilbert space, denoted as H 1 , appropriate for this illustrative model is the space of all square-integrable functions endowed with the following inner product, denoted as χ ( 1 ) ( u ) , θ ( 1 ) ( u ) 1 , between two elements, χ ( 1 ) ( u ) H 1 , θ ( 1 ) ( 2 ; x ) H 1 , belonging to this Hilbert space:
χ ( 1 ) ( u ) , θ ( 1 ) ( u ) 1 0 u t h χ ( 1 ) ( u ) θ ( 1 ) ( u ) d u .
In this particular instance, the Hilbert space H 1 coincides with the original Hilbert space H B used for the original forward and adjoint slowing down models. More generally, similar situations occur when the response depends either just on the forward or just on the adjoint state function(s).
Using Equation (107), construct in the Hilbert space H 1 the inner product of Equation (104) with a square-integrable function a ( 1 ) ( u ) H 1 to obtain the following relation:
{ 0 u t h a ( 1 ) ( u ) [ d v ( 1 ) ( u ) d u + f 1 v ( 1 ) ( u ) ] d u } α 0 = { 0 u t h a ( 1 ) ( u ) [ ( δ f 2 ) δ ( u ) ( δ f 1 ) φ ( u ) ] d u } α 0 .
Using the definition of the adjoint operator in the Hilbert space H 1 , which in this case amounts to integration by parts of the left-side of Equation (108), obtain the following relation:
{ 0 u t h a ( 1 ) ( u ) [ d v ( 1 ) ( u ) d u + f 1 v ( 1 ) ( u ) ] d u } α 0 = { 0 u t h v ( 1 ) ( u ) [ d a ( 1 ) ( u ) d u + f 1 a ( 1 ) ( u ) ] d u } α 0 + { a ( 1 ) ( u t h ) v ( 1 ) ( u t h ) a ( 1 ) ( 0 ) v ( 1 ) ( 0 ) } α 0 .
Require the first term on right-side of Equation (109) to represent the indirect-effect term defined in Equation (101), to obtain the following relation:
{ d a ( 1 ) ( u ) d u + f 1 a ( 1 ) ( u ) } α 0 = { f 3 δ ( u u d ) } α 0 ; 0 < u u t h .
Implement the boundary conditions represented by Equation (105) into Equation (109) and eliminate the unknown boundary-value v ( 1 ) ( u t h ) from this relation by imposing the following boundary condition:
a ( 1 ) ( u t h ) = 0 , a t u = u t h .
The system of equations comprising Equation (110) together with the boundary condition represented Equation (111) is the 1st-Level Adjoint Sensitivity System (1st-LASS) and its solution a ( 1 ) ( u ) is the 1st-level adjoint sensitivity function.
Using Equation (101), together with the equations underlying the 1st-LASS and 1st-LVSS in Equation (108) reduces the latter to the following expression for the indirect-effect term:
{ δ R ( φ 0 , f 0 ; a ( 1 ) ) } i n d = { 0 u t h a ( 1 ) ( u ) [ ( δ f 2 ) δ ( u ) ( δ f 1 ) φ ( u ) ] d u } α 0 .
Adding the expression obtained in Equation (112) with the expression for the direct-effect term defined in Equation (100) yields the following expression for the total 1st-order variation δ R ( φ 0 , f 0 ; v ( 1 ) , δ f ) of the response R [ φ ( u ) ; f ( α ) ] with respect to the components of the feature function f ( α ) :
δ R ( φ 0 , f 0 ; v ( 1 ) , δ f ) = { 0 u t h a ( 1 ) ( u ) [ ( δ f 2 ) δ ( u ) ( δ f 1 ) φ ( u ) ] d u } α 0 + { ( δ f 3 ) 0 u t h φ ( u ) δ ( u u d ) d u } α 0 { δ R ( φ , f ; a ( 1 ) , δ f ) } α 0 .
The identity which appears in Equation (113) emphasizes the fact that “1st-level variational sensitivity function” v ( 1 ) ( u ) , which is expensive to compute, has been eliminated from the final expression of the 1st-order total variation { δ R ( φ , f ; a ( 1 ) , δ f ) } α 0 , being replaced by the dependence on the 1st-level adjoint sensitivity function a ( 1 ) ( u ) , which is independent of variations δ f ( α ) in the components of the feature function and is consequently also independent of any variations δ α in the primary model parameters. Hence, the 1st-LASS needs to be solved only once to determine the 1st-level adjoint sensitivity function a ( 1 ) ( u ) , which requires the same amount of computational effort as solving the original forward system for the function φ ( u ) .
The expressions of the sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the components of the feature function f ( α ) are given by the expressions that multiply the respective components of f ( α ) in Equation (113), namely:
R ( φ ; f ) f 1 = 0 u t h a ( 1 ) ( u ) φ ( u ) d u ;
R ( φ ; f ) f 2 = 0 u t h a ( 1 ) ( u ) δ ( u ) d u ,
R ( φ ; f ) f 3 = 0 u t h φ ( u ) δ ( u u d ) d u .
The above expressions are to be evaluated at the nominal parameter values α 0 but the indication { } α 0 has been omitted for simplicity.
Solving the 1st-LASS yields the following closed-form expression for the 1st-level adjoint sensitivity function a ( 1 ) ( u ) :
a ( 1 ) ( u ) = H ( u d u ) f 3 ( α ) exp [ ( u u d ) f 1 ( α ) ] ,
where the Heaviside functional has the usual meaning, namely: H ( u d u ) = 0 i f u > u d and H ( u d u ) = 1 i f u < u d .
Inserting the expression obtained in Equation (117) into Eqs. (114)‒(116) yields the following closed-form expressions for the sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the components of the feature function f ( α ) :
R ( φ ; f ) f 1 = u d f 2 ( α ) f 3 ( α ) exp [ u d f 1 ( α ) ] ;
R ( φ ; f ) f 2 = f 3 ( α ) exp [ u d f 1 ( α ) ] ,
R ( φ ; f ) f 3 = f 2 ( α ) exp [ u d f 1 ( α ) ] .
The correctness of the expressions obtained in Eqs. (118)‒(120) can be verified by directly differentiating the closed-form expression given in Equation (91).
Alternatively, the 1st-FASAM-L methodology could have been applied to the alternative expression, in terms of the adjoint slowing-down function, for the detector response provided in Equation (95). It can be verified that the final expressions for the response sensitivities with respect to the feature functions f i ( α ) , i = 1 , 2 , 3 , obtained by using Equation (95) as the starting point in conjunction with the adjoint slowing-down model are the same as obtained in Eqs. (118)‒(120).
The expressions of the first-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the primary model parameters α j , j = T P 3 M + 12 , as defined in Equation (88), are obtained by using the expressions obtained in Eqs. (118)‒(120) in conjunction with the chain rule of differentiation of the compound functions f i ( α ) , i = 1 , 2 , 3 . Note that the feature function f 3 ( α ) f 3 ( N d , σ d ) depends only on the parameters that characterize the detector, so the first-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the primary model parameters N d and σ d can be readily obtained by using Equation (120), as follows:
R ( φ ; f ) N d = R ( φ ; f ) f 3 f 3 N d = σ d f 2 ( α ) exp [ u d f 1 ( α ) ]
R ( φ ; f ) σ d = R ( φ ; f ) f 3 f 3 σ d = N d f 2 ( α ) exp [ u d f 1 ( α ) ]
Similarly, the primary model parameters ( λ 1 S , λ 2 S , N 1 S , N 2 S , F 1 S , F 2 S , ν 1 S , ν 2 S , W 1 S , W 2 S ) that characterize the neutron source distribution only appear through the definition of the feature function f 2 ( α ) S 0 ( α ) / ξ ¯ ( α ) Σ t ( α ) . It therefore follows that the first-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to these primary model parameters are obtained as follows:
R ( φ ; f ) λ i S = R ( φ ; f ) f 2 f 2 λ i S = N k S F k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) f 3 ( α ) exp [ u d f 1 ( α ) ] , k = 1 , 2 ;
R ( φ ; f ) N k S = R ( φ ; f ) f 2 f 2 N k S = λ i S F k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) f 3 ( α ) exp [ u d f 1 ( α ) ] , k = 1 , 2 ;
R ( φ ; f ) F k S = R ( φ ; f ) f 2 f 2 F k S = λ i S N k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) f 3 ( α ) exp [ u d f 1 ( α ) ] , k = 1 , 2 ;
R ( φ ; f ) ν k S = R ( φ ; f ) f 2 f 2 ν k S = λ i S N k S F k S W k S ξ ¯ ( α ) Σ t ( α ) f 3 ( α ) exp [ u d f 1 ( α ) ] , k = 1 , 2 ;
R ( φ ; f ) W k S = R ( φ ; f ) f 2 f 2 W k S = λ i S N k S F k S ν k S ξ ¯ ( α ) Σ t ( α ) f 3 ( α ) exp [ u d f 1 ( α ) ] , k = 1 , 2.
On the other hand, the primary model parameters ( N m ( 1 ) , σ γ ( 1 ) , σ s ( 1 ) , ... , N m ( M ) , σ γ ( M ) , σ s ( M ) ) that characterize the composition of the homogenized material in which the neutrons slow-down appear through the definitions of both feature functions f 1 ( α ) Σ a ( α ) / ξ ¯ ( α ) Σ t ( α ) and f 2 ( α ) S 0 ( α ) / ξ ¯ ( α ) Σ t ( α ) . It therefore follows that the first-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to these primary model parameters are obtained as follows:
R ( φ ; f ) N m ( i ) = R ( φ ; f ) f 1 f 1 N m ( 1 ) + R ( φ ; f ) f 2 f 2 N m ( 1 ) ; i = 1 , ... , M ;
R ( φ ; f ) σ γ ( i ) = R ( φ ; f ) f 1 f 1 σ γ ( i ) + R ( φ ; f ) f 2 f 2 σ γ ( i ) ; i = 1 , ... , M ;
R ( φ ; f ) σ s ( i ) = R ( φ ; f ) f 1 f 1 σ s ( i ) + R ( φ ; f ) f 2 f 2 σ s ( i ) ; i = 1 , ... , M ;
The explicit differentiations in Eqs. (128)‒(130) are straightforward to perform, but are too lengthy to be presented here and are not material to applying the principles of the 1st-FASAM-L methodology.
In summary, the application of the 1st-FASAM-L to compute the first-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the primary model parameters α j , j = 1 , ... , T P 3 M + 12 , requires the following computations:
  • One “large-scale” computation to solve the 1st-LASS to obtain the 1st-level adjoint sensitivity function a ( 1 ) ( u ) .
  • Three “quadratures”, as indicated in Eqs. (114)‒(116), involving the 1st-level adjoint sensitivity function a ( 1 ) ( u ) to obtain the three sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the components f i ( α ) , i = 1 , 2 , 3 , of the feature function f ( α ) . These computations are inexpensive.
  • Chain-rule type differentiations using the definitions of the components f i ( α ) , i = 1 , 2 , 3 of the feature function f ( α ) , and the three sensitivities obtained in Eqs. (114)‒(116). These computations are inexpensive.

5.1.2. Application of the 1st-CASAM-L

The application of the 1st-CASAM-L methodology will yield the first-order response sensitivities directly with respect to the primary model parameters. These sensitivities will be obtained by determining the first-order Gateaux (G-) variation δ R ( φ 0 , α 0 ; v ( 1 ) , δ α ) of the response R ( φ , α ) as for variations v ( 1 ) ( u ) δ φ ( u ) and δ α around the phase-space point ( φ 0 , α 0 ) , using the definition provided in Equation (87), to obtain:
δ R ( φ 0 , α 0 ; v ( 1 ) , δ α ) { δ R ( φ 0 , α 0 ; δ α ) } d i r + { δ R ( φ 0 , α 0 ; v ( 1 ) ) } i n d { { d d ε [ ( N d + ε δ N d ) ( σ d + ε δ σ d ) 0 u t h ( φ 0 + ε v ( 1 ) ) δ ( u u d ) d u ] } ε = 0 } α 0 ,
where the “direct-effect” term { δ R ( φ 0 , α 0 ; δ α ) } d i r and, respectively, the “indirect-effect” term { δ R ( φ 0 , α 0 ; v ( 1 ) ) } i n d are defined as follows:
{ δ R ( φ 0 , α 0 ; δ f 3 ) } d i r { [ ( δ N d ) σ d + ( δ σ d ) N d ] 0 u t h φ ( u ) δ ( u u d ) d u } α 0 ,
{ δ R ( φ 0 , α 0 ; v ( 1 ) ) } i n d { N d σ d 0 u t h v ( 1 ) ( u ) δ ( u u d ) d u } α 0 .
The “1st-level variational sensitivity function” v ( 1 ) ( u ) is obtained as the solution of the “1st-Level Variational Sensitivity System (1st-LVSS)” obtained by taking the first-order G-differentials of the 1st-LFAS defined by Eqs. (80) and (81) to obtain:
{ d d ε [ d ( φ 0 + ε v ( 1 ) ) d u + Σ a ( α 0 + ε δ α ) ξ ¯ ( α 0 + ε δ α ) Σ t ( α 0 + ε δ α ) ( φ 0 + ε v ( 1 ) ) ] } ε = 0 = δ ( u ) { d d ε S 0 ( α 0 + ε δ α ) ξ ¯ ( α 0 + ε δ α ) Σ t ( α 0 + ε δ α ) } ε = 0 ,
 
Carrying out the differentiations with respect to ε in the above equations and setting ε = 0 in the resulting expressions yields the following 1st-LVSS:
d v ( 1 ) ( u ) d u + Σ a ( α 0 ) ξ ¯ ( α 0 ) Σ t ( α 0 ) v ( 1 ) ( u ) = δ ( u ) { i = 1 T P α i [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i } α 0 { φ ( u ) i = 1 T P α i [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i } α 0 ,
v ( 1 ) ( u ) = 0 ; a t u = 0.
To avoid solving the above the 1st-LVSS repeatedly, for every possible variation in the primary parameters, the appearance of the function v ( 1 ) ( u ) will be eliminated for the expression of the indirect-effect term by replacing it with the solution of the 1st-Level Adjoint Sensitivity System (1st-LASS) which will be constructed in the Hilbert space H 1 , as before: use Equation (107) to form the inner product of Equation (136) with a square-integrable function a ( 1 ) ( u ) H 1 to obtain the following relation:
{ 0 u t h a ( 1 ) ( u ) [ d v ( 1 ) ( u ) d u + Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) v ( 1 ) ( u ) ] d u } α 0 = { i = 1 T P α i [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i × 0 u t h a ( 1 ) ( u ) δ ( u ) d u } α 0 { i = 1 T P α i [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i 0 u t h a ( 1 ) ( u ) φ ( u ) d u } α 0 .
Integration by parts of the left-side of Equation (138) yields the following relation:
{ 0 u t h a ( 1 ) ( u ) [ d v ( 1 ) ( u ) d u + Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) v ( 1 ) ( u ) ] d u } α 0 = { a ( 1 ) ( u t h ) v ( 1 ) ( u t h ) } α 0 { a ( 1 ) ( 0 ) v ( 1 ) ( 0 ) } α 0 { 0 u t h v ( 1 ) ( u ) [ d a ( 1 ) ( u ) d u + Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) a ( 1 ) ( u ) ] d u } α 0 .
Requiring the first term on right-side of Equation (139) to represent the indirect-effect term defined in Equation (133) yields the following relation:
{ d a ( 1 ) ( u ) d u + Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) a ( 1 ) ( u ) } α 0 = { N d σ d δ ( u u d ) } α 0 .
Eliminate the unknown boundary-value v ( 1 ) ( u t h ) from Equation (139) by imposing the following boundary condition:
a ( 1 ) ( u t h ) = 0 , a t u = u t h .
The system of equations comprising Eqs. (140) and (141) is the 1st-Level Adjoint Sensitivity System (1st-LASS) and its solution a ( 1 ) ( u ) is the 1st-level adjoint sensitivity function. As already shown in the general 1st-FASAM methodology presented in Section 3, the 1st-LASS which arises within the framework of the 1st-CASAM-L is identical to the 1st-LASS that arises within the 1st-FASAM methodology, which is the reason underlying the use of the same notation for the 1st-level adjoint sensitivity function, namely a ( 1 ) ( u ) , in both cases.
Implementing the equations underlying the 1st-LASS and 1st-LVSS into Equation (138) and recalling the expression of the indirect-effect term provided in Equation (133) yields the following expression for the indirect-effect term:
{ δ R ( φ 0 , α 0 ; a ( 1 ) ) } i n d = { i = 1 T P α i [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i 0 u t h a ( 1 ) ( u ) δ ( u ) d u } α 0 { i = 1 T P α i [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i 0 u t h a ( 1 ) ( u ) φ ( u ) d u } α 0 .
Adding the expression obtained in Equation (142) with the expression for the direct-effect term defined in Equation (132) yields the following expression for the total 1st-order variation δ R ( φ 0 , f 0 ; v ( 1 ) , δ f ) of the response R [ φ ( u ) ; f ( α ) ] with respect to the components of the feature function f ( α ) :
δ R ( φ 0 , α 0 ; v ( 1 ) , δ α ) = { i = 1 T P α i [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i 0 u t h a ( 1 ) ( u ) δ ( u ) d u } α 0 { i = 1 T P α i [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] δ α i 0 u t h a ( 1 ) ( u ) φ ( u ) d u } α 0 + { [ ( δ N d ) σ d + ( δ σ d ) N d ] 0 u t h φ ( u ) δ ( u u d ) d u } α 0 { δ R ( φ , α ; a ( 1 ) , δ α ) } α 0 .
The identity which appears in Equation (143) emphasizes the fact that “1st-level variational sensitivity function” v ( 1 ) ( u ) , which is expensive to compute, has been eliminated from the final expression of the 1st-order total variation { δ R ( φ , α ; a ( 1 ) , δ α ) } α 0 , being replaced by the dependence on the 1st-level adjoint sensitivity function a ( 1 ) ( u ) , which is independent of any variations δ α in the primary model parameters. Hence, the 1st-LASS needs to be solved only once to determine the 1st-level adjoint sensitivity function a ( 1 ) ( u ) , which requires the same amount of computational effort as solving the original forward system for the function φ ( u ) .
The expressions of the first-order sensitivities of the response R [ φ ( u ) ; α ] with respect to the primary model parameters are the expressions that multiply the corresponding parameter variations δ α i in Equation (143). In particular, the (two) first-order sensitivities of the response R [ φ ( u ) ; α ] with respect to the primary model parameters underlying the detector’s interaction cross section arise solely from the direct-effect term and have the following expressions:
R ( φ ; α ) N d = σ d 0 u t h φ ( u ) δ ( u u d ) d u = σ d φ ( u d ) ;
R ( φ ; α ) σ d = N d 0 u t h φ ( u ) δ ( u u d ) d u = N d φ ( u d ) .
The above expressions are to be evaluated at the nominal parameter values α 0 but the indication { } α 0 has been omitted for simplicity. As expected, the above expressions are identical to the corresponding expressions obtained using the 1st-FASAM-L, as provided in Eqs. (121) and (122), respectively.
The first-order sensitivities of the response R [ φ ( u ) ; α ] with respect to the primary model parameters underlying the spontaneous fission source arise solely from the first term on the right-side of Equation (143) and have the following expressions in terms of the 1st-level adjoint sensitivity function a ( 1 ) ( u ) :
R ( φ ; α ) λ k S = N k S F k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) 0 u t h a ( 1 ) ( u ) δ ( u ) d u , k = 1 , 2 ;
R ( φ ; f ) N k S = λ i S F k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) 0 u t h a ( 1 ) ( u ) δ ( u ) d u , k = 1 , 2 ;
R ( φ ; f ) F k S = λ i S N k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) 0 u t h a ( 1 ) ( u ) δ ( u ) d u , k = 1 , 2 ;
R ( φ ; f ) ν k S = λ i S N k S F k S W k S ξ ¯ ( α ) Σ t ( α ) 0 u t h a ( 1 ) ( u ) δ ( u ) d u , k = 1 , 2 ;
R ( φ ; f ) W k S = λ i S N k S F k S ν k S ξ ¯ ( α ) Σ t ( α ) 0 u t h a ( 1 ) ( u ) δ ( u ) d u , k = 1 , 2.
As expected, the above expressions are identical to the corresponding expressions obtained using the 1st-FASAM-L, as provided in Eqs. (123)‒(127), respectively.
The first-order sensitivities of the response R [ φ ( u ) ; α ] with respect to the primary model parameters ( N m ( 1 ) , σ γ ( 1 ) , σ s ( 1 ) , ... , N m ( M ) , σ γ ( M ) , σ s ( M ) ) that characterize the composition of the homogenized material in which the neutrons slow-down arise from both the first and the second terms on the right-side of Equation (143) and have the following expressions in terms of the 1st-level adjoint sensitivity function a ( 1 ) ( u ) :
R ( φ ; f ) N m ( i ) = [ 0 u t h a ( 1 ) ( u ) δ ( u ) d u ] N m ( i ) [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] [ 0 u t h a ( 1 ) ( u ) φ ( u ) d u ] N m ( i ) [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] ; i = 1 , ... , M ;
R ( φ ; f ) σ γ ( i ) = [ 0 u t h a ( 1 ) ( u ) δ ( u ) d u ] σ γ ( i ) [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] [ 0 u t h a ( 1 ) ( u ) φ ( u ) d u ] σ γ ( i ) [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] ; i = 1 , ... , M ;
R ( φ ; f ) σ s ( i ) = [ 0 u t h a ( 1 ) ( u ) δ ( u ) d u ] σ s ( i ) [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] [ 0 u t h a ( 1 ) ( u ) φ ( u ) d u ] σ s ( i ) [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] ; i = 1 , ... , M .
As expected, the above expressions are identical to the corresponding expressions obtained using the 1st-FASAM-L, as provided in Eqs. (128)‒(130), respectively.
In summary, the application of the 1st-CASAM-L to compute the first-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the primary model parameters α j , j = 1 , ... , T P 3 M + 12 , requires the following computations:
  • One “large-scale” computation to solve the 1st-LASS to obtain the 1st-level adjoint sensitivity function a ( 1 ) ( u ) . As has been already remarked, this 1st-LASS is exactly the same as the 1st-LASS needed within the 1st-FASAM-L methodology for computing the first-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the components f i ( α ) , i = 1 , 2 , 3 , of the feature function f ( α ) .
  • A total of T P 3 M + 12 “quadratures” involving the 1st-level adjoint sensitivity function a ( 1 ) ( u ) to obtain numerically the T P 3 M + 12 sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the primary model parameters α j , j = 1 , ... , T P 3 M + 12 . These numerical computations are inexpensive by comparison to solving the 1st-LASS but are more expensive than performing “chain-rule”-type differentiation “on paper,” as performed if applying the 1st-FASAM-L. Hence, the 1st-FASAM-L methodology enjoys a slight computational advantage over the 1st-CASAM-L methodology.

5.2. Second-Order Adjoint Sensitivity Analysis: 2nd-FASAM-L Versus 2nd-CASAM-L

In this Subsection, the computation of the first-order sensitivities of the response R ( α ) with respect to the primary model parameters will first be demonstrated by using the 2nd-FASAM-L. Subsequently, the same first-order sensitivities will be obtained by using the 2nd-CASAM-L and the two alternative paths will be compared to each other, showing that the same expressions are obtained for the respective sensitivities, as expected. Both the 2nd-FASAM-L and the 2nd-CASAM-L methodologies obtain the second-order sensitivities by considering the first-order G-differential of each of the first-order sensitivities. Therefore, the 2nd-FASAM-L methodology will provide significant computational advantages by comparison to the 2nd-CASAM-L methodology the since it will require at most 3 large-scale computations, i.e., the same number of large-scale computations as the number of components f i ( α ) , i = 1 , 2 , 3 , of the feature function f ( α ) . In contradistinction, the 2nd-CASAM-L methodology will require one large-scale (adjoint) computation for each primary model parameter α j , j = 1 , ... , T P 3 M + 12 , amounting to a total of number of T P 3 M + 12 large-scale computations.

5.2.1. Application of the 2nd-FASAM-L

As has been shown in Section 4, the 2nd-FASAM-L methodology generically determines the 2nd-order sensitivities 2 R [ u ( 1 ) ( 2 ; x ) ; f ( α ) ] / f j 2 f j 1 of the response with respect to the components of the “feature” function f ( α ) by conceptually considering that the first-order sensitivities R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] R [ u ( 1 ) ( 2 ; x ) ; f ( α ) ] / f j 1 , are “model responses.” Consequently, the 2nd-order sensitivities are obtained as the “1st-order sensitivities of the 1st-order sensitivities” by applying the concepts underlying 1st-FASAM to each 1st-order sensitivity R ( 1 ) [ j 1 ; u ( 1 ) ( 2 ; x ) ; a ( 1 ) ( 2 ; x ) ; f ( α ) ] , j 1 = 1 , ... , T F .

5.2.1.1. Second-Order Sensitivities Stemming From

R ( φ ; f ) / f 1 The above principles will be applied to the first-order sensitivity R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; f ( α ) ] R ( φ ; f ) / f 1 expressed by Equation (114) to obtain the second-order sensitivities of the form 2 R ( φ ; f ) / f j f 1 , j = 1 , 2 , 3 . The argument “1” in the notation R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; f ( α ) ] indicates that this sensitivity is with respect to the first component, namely f 1 ( α ) , of the feature function f ( α ) , while also depending on the functions φ ( u ) and a ( 1 ) ( u ) . These functions are the solutions of the “2nd-Level Forward/Adjoint System (2nd-LFAS)” which is obtained by concatenating the original 1st-Level Forward/Adjoint System (1st-LFAS) with the 1st-Level Adjoint Sensitivity System (1st-LASS), cf. Eqs. (98), (81), (110) and (111), as reproduced below:
d φ ( u ) d u + f 1 φ ( u ) = f 2 δ ( u ) ; 0 < u u t h ;
d a ( 1 ) ( u ) d u + f 1 a ( 1 ) ( u ) = f 3 δ ( u u d ) ; 0 < u u t h ;
φ ( 0 ) = 0 ; a t u = 0 ; a ( 1 ) ( u t h ) = 0 , a t u = u t h .
The first-order G-differential of R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; f ( α ) ] is obtained from Equation (114), by definition, as follows:
{ δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; v ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ; δ f ] } α 0 { d d ε 0 u t h [ a ( 1 ) ( u ) + ε δ a ( 1 ) ( u ) ] [ φ ( u ) + ε v ( 1 ) ( u ) ] d u } α 0 , ε = 0 = { 0 u t h [ v ( 1 ) ( u ) a ( 1 ) ( u ) + δ a ( 1 ) ( u ) φ ( u ) ] d u } α 0 j = 1 3 2 R ( φ ; f ) f j f 1 ( δ f j ) .
Note that the first-order G-differential { δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; v ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 consists solely of the “indirect-effect term”; there is no “direct-effect” term since there is no explicit dependence on variations δ f ( α ) .
The variational functions v ( 1 ) ( u ) and δ a ( 1 ) ( u ) are the solutions of the system of equations obtained by taking the first-G-differential of the 2nd-LFAS. Applying the definition of the first G-differential to the equations underlying the 2nd-LFAS yields the following 2nd-Level Variational Sensitivity System (2nd-LVSS)” for the 2nd-level variational sensitivity function v ( 2 ) ( 2 ; u ) [ v ( 1 ) ( u ) , δ a ( 1 ) ( u ) ] :
{ d v ( 1 ) ( u ) d u + f 1 v ( 1 ) ( u ) } α 0 = { ( δ f 2 ) δ ( u ) ( δ f 1 ) φ ( u ) } α 0 ,
{ d [ δ a ( 1 ) ( u ) ] d u + f 1 [ δ a ( 1 ) ( u ) ] } α 0 = { ( δ f 3 ) δ ( u u d ) ( δ f 1 ) a ( 1 ) ( u ) } α 0 ; 0 < u u t h ;
v ( 1 ) ( u ) = 0 , a t u = 0 ; δ a ( 1 ) ( u t h ) = 0 , a t u = u t h .
The above 2nd-LVSS would need to be solved repeatedly, for every possible variation in the feature functions f i ( α ) , i = 1 , 2 , 3 . This need is circumvented by deriving an alternative expression for { δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; v ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 , in which the 2nd-level variational function v ( 2 ) ( 2 ; u ) [ v ( 1 ) ( u ) , δ a ( 1 ) ( u ) ] is replaced by a 2nd-level adjoint sensitivity function which will be independent of variations in the feature functions f i ( α ) , i = 1 , 2 , 3 . This 2nd-level adjoint sensitivity function will be the solution of a 2nd-Level Adjoint Sensitivity System (2nd-LASS) to be constructed below by following the steps generally outlined in Section 5, in a Hilbert space, denoted as H 2 , which is endowed with the following inner product, denoted as χ ( 2 ) ( 2 ; u ) , θ ( 2 ) ( 2 ; u ) 2 , between two elements, χ ( 2 ) ( 2 ; u ) [ χ 1 ( 2 ) ( u ) , χ 2 ( 2 ) ( u ) ] H 2 and θ ( 2 ) ( 2 ; u ) [ θ 1 ( 2 ) ( u ) , θ 2 ( 2 ) ( u ) ] H 2 :
χ ( 2 ) ( 2 ; u ) , θ ( 2 ) ( 2 ; u ) 2 0 u t h [ χ 1 ( 2 ) ( u ) θ 1 ( 2 ) ( u ) + χ 2 ( 2 ) ( u ) θ 2 ( 2 ) ( u ) ] d u .
Using Equation(161) to form the inner product in the Hilbert space H 2 of the 2nd-LVSS, cf. Eqs. (158) and (159), with a yet undefined function a ( 2 ) ( 2 ; 1 ; u ) [ a 1 ( 2 ) ( 2 ; 1 ; u ) , a 2 ( 2 ) ( 2 ; 1 ; u ) ] H 2 yields the following relation:
0 u t h a 1 ( 2 ) ( 2 ; 1 ; u ) [ d v ( 1 ) ( u ) d u + f 1 v ( 1 ) ( u ) ] d u + 0 u t h a 2 ( 2 ) ( 2 ; 1 ; u ) [ d d u δ a ( 1 ) ( u ) + f 1 δ a ( 1 ) ( u ) ] d u = 0 u t h a 1 ( 2 ) ( 2 ; 1 ; u ) [ ( δ f 2 ) δ ( u ) ( δ f 1 ) φ ( u ) ] d u + 0 u t h a 2 ( 2 ) ( 2 ; 1 ; u ) [ ( δ f 3 ) δ ( u u d ) ( δ f 1 ) a ( 1 ) ( u ) ] d u .
The above relation holds for the nominal parameter values, but the notation { } α 0 has been omitted, for simplicity.
Using the definition of the adjoint operator in the Hilbert space H 2 , which amounts to integration by parts, recast the left-side of Equation (162) into the form below:
0 u t h a 1 ( 2 ) ( 2 ; 1 ; u ) [ d v ( 1 ) ( u ) d u + f 1 v ( 1 ) ( u ) ] d u + 0 u t h a 2 ( 2 ) ( 2 ; 1 ; u ) [ d d u δ a ( 1 ) ( u ) + f 1 δ a ( 1 ) ( u ) ] d u = 0 u t h v ( 1 ) ( u ) [ d a 1 ( 2 ) ( 2 ; 1 ; u ) d u + f 1 a 1 ( 2 ) ( 2 ; 1 ; u ) ] d u + 0 u t h δ a ( 1 ) ( u ) [ d a 2 ( 2 ) ( 2 ; 1 ; u ) d u + f 1 a 2 ( 2 ) ( 2 ; 1 ; u ) ] d u + a 1 ( 2 ) ( 2 ; 1 ; u t h ) v ( 1 ) ( u t h ) a 1 ( 2 ) ( 2 ; 1 ; 0 ) v ( 1 ) ( 0 ) a 2 ( 2 ) ( 2 ; 1 ; u t h ) δ a ( 1 ) ( u t h ) + a 2 ( 2 ) ( 2 ; 1 ; 0 ) δ a ( 1 ) ( 0 ) .
The first two terms on right-side of Equation (163) are now required to represent the G-differential { δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; v ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 defined in Equation (157), which yields the following relations:
( d / d u + f 1 0 0 d / d u + f 1 ) ( a 1 ( 2 ) ( 2 ; 1 ; u ) a 2 ( 2 ) ( 2 ; 1 ; u ) ) = ( a ( 1 ) ( u ) φ ( u ) ) .
The definition of the vector a ( 2 ) ( 2 ; 1 ; u ) [ a 1 ( 2 ) ( 2 ; 1 ; u ) , a 2 ( 2 ) ( 2 ; 1 ; u ) ] will now be completed by selecting boundary conditions so as to eliminate the unknown values v ( 1 ) ( u t h ) and δ a ( 1 ) ( 0 ) in Equation (163). This is accomplished by imposing the following boundary conditions:
a 1 ( 2 ) ( 2 ; 1 ; u t h ) = 0 ; a 2 ( 2 ) ( 2 ; 1 ; 0 ) = 0.
The system of equations represented by Eqs. (164) and (165) constitute the 2nd-Level Adjoint Sensitivity System (2nd-LASS) for the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 1 ; u ) [ a 1 ( 2 ) ( 2 ; 1 ; u ) , a 2 ( 2 ) ( 2 ; 1 ; u ) ] . It is important to note that the 2nd-LASS is independent of any variations, δ f , in the components of the feature function and, hence, is independent of any parameter variations, δ α , as well.
The equations underlying the 2nd-LASS, together with the equations underlying the 2nd-LVSS, are now employed in Equation(162), in conjunction with Equation (163), to obtain the following expression for the G-differential { δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; v ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 in terms of the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 1 ; u ) [ a 1 ( 2 ) ( 2 ; 1 ; u ) , a 2 ( 2 ) ( 2 ; 1 ; u ) ] :
{ δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; v ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 = ( δ f 2 ) 0 u t h a 1 ( 2 ) ( 2 ; 1 ; u ) δ ( u ) d u ( δ f 1 ) 0 u t h a 1 ( 2 ) ( 2 ; 1 ; u ) φ ( u ) d u + ( δ f 3 ) 0 u t h a 2 ( 2 ) ( 2 ; 1 ; u ) δ ( u u d ) d u ( δ f 1 ) 0 u t h a 2 ( 2 ) ( 2 ; 1 ; u ) a ( 1 ) ( u ) d u j = 1 3 2 R ( φ ; f ) f j f 1 ( δ f j ) { δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; a ( 2 ) ( 2 ; 1 ; u ) ; f ( α ) ] } α 0 .
As the last equality (identity) in Equation (166) indicates, the 2nd-level variational sensitivity function v ( 2 ) ( 2 ; u ) [ v ( 1 ) ( u ) , δ a ( 1 ) ( u ) ] has been eliminated from the final expression of the G-differential { δ R ( 1 ) [ 1 ; φ ( u ) ; a ( 1 ) ( u ) ; v ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 , having been replaced by the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 1 ; u ) [ a 1 ( 2 ) ( 2 ; 1 ; u ) , a 2 ( 2 ) ( 2 ; 1 ; u ) ] . Identifying in Equation (166) the expressions that multiply the variations δ f i , i = 1 , 2 , 3 , yields the following expressions for the second-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the components of the feature function f ( α ) :
2 R ( φ ; f ) f 1 f 1 = 0 u t h a 1 ( 2 ) ( 2 ; 1 ; u ) φ ( u ) d u 0 u t h a 2 ( 2 ) ( 2 ; 1 ; u ) a ( 1 ) ( u ) d u ;
2 R ( φ ; f ) f 2 f 1 = 0 u t h a 1 ( 2 ) ( 2 ; 1 ; u ) δ ( u ) d u ;
2 R ( φ ; f ) f 3 f 1 = 0 u t h a 2 ( 2 ) ( 2 ; 1 ; u ) δ ( u u d ) d u .
The 2nd-LASS can be solved to obtain the following closed-form expressions for the components of the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 1 ; u ) [ a 1 ( 2 ) ( 2 ; 1 ; u ) , a 2 ( 2 ) ( 2 ; 1 ; u ) ] ;
a 1 ( 2 ) ( 2 ; 1 ; u ) = f 3 ( α ) ( u d u ) H ( u d u ) exp [ ( u u d ) f 1 ( α ) ] ,
a 2 ( 2 ) ( 2 ; 1 ; u ) = u f 2 ( α ) exp [ u f 1 ( α ) ] ,
Inserting the expressions obtained in Eqs. (170) and (171) into Eqs. (167)‒(169) and performing the respective integrations yields the following expressions for the respective second-order sensitivities:
2 R ( φ ; f ) f 1 f 1 = ( u d ) 2 f 2 ( α ) f 3 ( α ) exp [ u d f 1 ( α ) ] ;
2 R ( φ ; f ) f 2 f 1 = u d f 3 ( α ) exp [ u d f 1 ( α ) ] ;
2 R ( φ ; f ) f 3 f 1 = u d f 2 ( α ) exp [ u d f 1 ( α ) ] ;
The correctness of the expressions obtained in Eqs. (172)‒(174) can be verified by directly differentiating the closed-form expressions provided in Eqs. (118)‒(120).

5.2.1.2. Second-Order Sensitivities Stemming From R ( φ ; f ) / f 2

Applying the procedure used in the previous Subsection to the first-order sensitivity R ( 1 ) [ 2 ; a ( 1 ) ( u ) ; f ( α ) ] R ( φ ; f ) / f 2 expressed by Equation (115) will provide the second-order sensitivities of the form 2 R ( φ ; f ) / f j f 2 , j = 1 , 2 , 3 . The argument “2” in the notation R ( 1 ) [ 2 ; a ( 1 ) ( u ) ; f ( α ) ] indicates that this sensitivity is with respect to the second component, namely f 2 ( α ) , of the feature function f ( α ) . Remarkably, this sensitivity does not depend on the forward function φ ( u ) but only depends on the 1st-level adjoint sensitivity function a ( 1 ) ( u ) . As previously discussed, these functions are the solutions of the “2nd-Level Forward/Adjoint System (2nd-LFAS).”
The first-order G-differential of R ( 1 ) [ 2 ; a ( 1 ) ( u ) ; f ( α ) ] is obtained by applying its definition to Equation (115), as follows:
{ δ R ( 1 ) [ 2 ; a ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 = { 0 u t h δ a ( 1 ) ( u ) δ ( u ) d u } α 0 j = 1 3 2 R ( φ ; f ) f j f 2 ( δ f j ) . .
As indicated in Equation (175), the first-order G-differential { δ R ( 1 ) [ 2 ; a ( 1 ) ( u ) ; δ a ( 1 ) ( u ) ; f ( α ) ] } α 0 consists solely of the indirect-effect term which depends on the 1st-level variational function δ a ( 1 ) ( u ) . As before, the need for computing this variational function is circumvented by constructing a 2nd-Level Adjoint Sensitivity System (2nd-LASS) for a 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 2 ; u ) [ a 1 ( 2 ) ( 2 ; 2 ; u ) , a 2 ( 2 ) ( 2 ; 2 ; u ) ] , by implementing the same steps as outlined above for obtaining the 2nd-order sensitivities that stem from the first-order sensitivity R ( φ ; f ) / f 2 , namely Eqs. (162)‒(165). These steps will not be repeated here in detail; they lead to the following 2nd-LASS for 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 2 ; u ) [ a 1 ( 2 ) ( 2 ; 2 ; u ) , a 2 ( 2 ) ( 2 ; 2 ; u ) ] :
Preprints 102806 i001
a 1 ( 2 ) ( 2 ; 2 ; u t h ) = 0 ; a 2 ( 2 ) ( 2 ; 2 ; 0 ) = 0.
Solving the 2nd-LASS defined by Eqs. (176) and (177) yields the following closed-form expressions for the components of the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 2 ; u ) [ a 1 ( 2 ) ( 2 ; 2 ; u ) , a 2 ( 2 ) ( 2 ; 2 ; u ) ] :
a 1 ( 2 ) ( 2 ; 2 ; u ) = 0 ; a 2 ( 2 ) ( 2 ; 2 ; u ) = H ( u ) exp [ u f 1 ( α ) ]
.
The alternative expression of the G-differential { δ R ( 1 ) [ 2 ; a ( 1 ) ( u ) ; a ( 2 ) ( 2 ; 2 ; u ) ; δ f ( α ) ] } α 0 in terms of the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 2 ; u ) [ a 1 ( 2 ) ( 2 ; 2 ; u ) , a 2 ( 2 ) ( 2 ; 2 ; u ) ] has the following form (which is obtained by implementing the same steps as those leading to Equation (166), above):
{ δ R ( 1 ) [ 2 ; a ( 1 ) ( u ) ; a ( 2 ) ( 2 ; 2 ; u ) ; δ f ( α ) ] } α 0 = ( δ f 3 ) 0 u t h a 2 ( 2 ) ( 2 ; 2 ; u ) δ ( u u d ) d u ( δ f 1 ) 0 u t h a 2 ( 2 ) ( 2 ; 2 ; u ) a ( 1 ) ( u ) d u j = 1 3 2 R ( φ ; f ) f j f 2 ( δ f j ) .
Identifying in Equation (179) the expressions that multiply the variations δ f i , i = 1 , 2 , 3 , yields the following second-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the components of the feature function f ( α ) :
2 R ( φ ; f ) f 1 f 2 = 0 u t h a 2 ( 2 ) ( 2 ; 2 ; u ) a ( 1 ) ( u ) d u ;
2 R ( φ ; f ) f 2 f 2 = 0 ;
2 R ( φ ; f ) f 3 f 2 = 0 u t h a 2 ( 2 ) ( 2 ; 2 ; u ) δ ( u u d ) d u .
Inserting the expression obtained for a 2 ( 2 ) ( 2 ; 2 ; u ) in Equation (178) into Eqs. (180) and (182), and performing the respective integrations yields the following expressions for the respective second-order sensitivities:
2 R ( φ ; f ) f 1 f 2 = u d f 3 ( α ) exp [ u d f 1 ( α ) ] ;
2 R ( φ ; f ) f 3 f 2 = exp [ u d f 1 ( α ) ] ;
The correctness of the expressions obtained in Eqs. (183) and (184) can be verified by directly differentiating accordingly the closed-form expressions given in Eqs. (118)‒(120).

5.2.1.3. Second-Order Sensitivities Stemming From R ( φ ; f ) / f 3

Applying the above principles to the first-order sensitivity R ( 1 ) [ 3 ; φ ( u ) ; a ( 1 ) ( u ) ; f ( α ) ] R ( φ ; f ) / f 3 obtained in Equation (116) will provide the second-order sensitivities of the form 2 R ( φ ; f ) / f j f 3 , j = 1 , 2 , 3 . The argument “3” in the notation R ( 1 ) [ 3 ; φ ( u ) ; a ( 1 ) ( u ) ; f ( α ) ] indicates that this sensitivity is with respect to the third component, namely f 3 ( α ) , of the feature function f ( α ) . Notably, this sensitivity depends on the forward function φ ( u ) but does not depend on the 1st-level adjoint sensitivity function a ( 1 ) ( u ) .
The first-order G-differential of R ( 1 ) [ 3 ; φ ( u ) ; a ( 1 ) ( u ) ; f ( α ) ] is obtained, by definition, as follows:
{ δ R ( 1 ) [ 3 ; φ ( u ) ; v ( 1 ) ( u ) ; f ; δ f ] } α 0 = { 0 u t h v ( 1 ) ( u ) δ ( u u d ) d u } α 0 j = 1 3 2 R ( φ ; f ) f j f 3 ( δ f j ) .
Note that the first-order G-differential { δ R ( 1 ) [ 3 ; φ ( u ) ; v ( 1 ) ( u ) ; f ; δ f ] } α 0 consists solely of the indirect-effect term. As before, the need for computing the variational function v ( 1 ) ( u ) is circumvented by constructing a 2nd-Level Adjoint Sensitivity System (2nd-LASS) for a 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 3 ; u ) [ a 1 ( 2 ) ( 2 ; 3 ; u ) , a 2 ( 2 ) ( 2 ; 3 ; u ) ] , by implementing the same steps were used for obtaining the previous 2nd-order sensitivities. These steps lead to the following 2nd-LASS for 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 3 ; u ) [ a 1 ( 2 ) ( 2 ; 3 ; u ) , a 2 ( 2 ) ( 2 ; 3 ; u ) ] :
( d / d u + f 1 0 0 d / d u + f 1 ) ( a 1 ( 2 ) ( 2 ; 3 ; u ) a 2 ( 2 ) ( 2 ; 3 ; u ) ) = ( δ ( u u d ) 0 ) .
a 1 ( 2 ) ( 2 ; 2 ; u t h ) = 0 ; a 2 ( 2 ) ( 2 ; 2 ; 0 ) = 0.
Solving the 2nd-LASS defined by Eqs. (186) and (187) yields the following closed-form expressions for the components of the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 3 ; u ) [ a 1 ( 2 ) ( 2 ; 3 ; u ) , a 2 ( 2 ) ( 2 ; 3 ; u ) ] :
a 1 ( 2 ) ( 2 ; 3 ; u ) = H ( u d u ) exp [ ( u u d ) f 1 ( α ) ] ; a 2 ( 2 ) ( 2 ; 3 ; u ) = 0
In terms of the 2nd-level adjoint sensitivity function a ( 2 ) ( 2 ; 3 ; u ) [ a 1 ( 2 ) ( 2 ; 3 ; u ) , a 2 ( 2 ) ( 2 ; 3 ; u ) ] the alternative expression of the G-differential { δ R ( 1 ) [ 3 ; φ ( u ) ; a ( 2 ) ( 2 ; 3 ; u ) ; f ; δ f ] } α 0 has the following form (which is obtained by implementing the same steps as those leading to Equation (166), above):
{ δ R ( 1 ) [ 3 ; φ ( u ) ; a ( 2 ) ( 2 ; 3 ; u ) ; f ; δ f ] } α 0 = ( δ f 2 ) 0 u t h a 1 ( 2 ) ( 2 ; 3 ; u ) δ ( u ) d u ( δ f 1 ) 0 u t h a 1 ( 2 ) ( 2 ; 3 ; u ) φ ( u ) d u j = 1 3 2 R ( φ ; f ) f j f 3 ( δ f j ) .
Identifying in Equation (189) the expressions that multiply the variations δ f i , i = 1 , 2 , 3 , yields the following second-order sensitivities of the response R [ φ ( u ) ; f ( α ) ] with respect to the components of the feature function f ( α ) :
2 R ( φ ; f ) f 1 f 3 = 0 u t h a 1 ( 2 ) ( 2 ; 3 ; u ) φ ( u ) d u ;
2 R ( φ ; f ) f 2 f 3 = 0 u t h a 1 ( 2 ) ( 2 ; 3 ; u ) δ ( u ) d u ;
2 R ( φ ; f ) f 3 f 3 = 0 .
Inserting the expression obtained for a 1 ( 2 ) ( 2 ; 3 ; u ) in Equation (188) into Eqs. (190) and (191), and performing the respective integrations yields the following expressions for the respective second-order sensitivities:
2 R ( φ ; f ) f 1 f 3 = u d f 2 ( α ) exp [ u d f 1 ( α ) ] ;
2 R ( φ ; f ) f 2 f 3 = exp [ u d f 1 ( α ) ]
The correctness of the expressions obtained in Eqs. (193) and (194) can be verified by directly differentiating the closed-form expressions given in Eqs. (118)‒(120).
Summarizing the results obtained in Subsection 5.2.1 leads to the following conclusions:
  • The second-order sensitivities 2 R ( φ ; f ) / f i f j , i , j = 1 , 2 , 3 , of the model response with respect to the three features components f i ( α ) , i = 1 , 2 , 3 , of the feature function f ( α ) are obtained by performing 3 “large-scale” computations to solve the 3 corresponding 2nd-LASS which all have the same left-side but have differing sources on their right-sides. The source-term for each of these 2nd-LASS corresponds to one of the 3 first-order sensitivities. Thus, computing the second-order sensitivities 2 R ( φ ; f ) / f i f j requires as many “large-scale” computations as there are non-zero first-order sensitivities, i.e., at most as many “large-scale” computations as there are components f i ( α ) , i = 1 , 2 , 3 , of the feature function f ( α ) .
  • The mixed second-order sensitivities 2 R ( φ ; f ) / f i f j , i j = 1 , 2 , 3 , are computed twice, involving distinct 2nd-level adjoint sensitivity functions. Therefore, the symmetry property 2 R ( φ ; f ) / f i f j = 2 R ( φ ; f ) / f j f i provides an intrinsic mechanism for verifying the accuracy of the computations of the respective 2nd-level adjoint sensitivity functions.
  • The unmixed second order sensitivities 2 R ( φ ; f ) / f i f i i = 1 , 2 , 3 , are computed just once.

5.2.2. Application of the 2nd-CASAM-L

The principles underlying the application of the 2nd-CASAM-L methodology are the same as those underlying the 2nd-FASAM-L methodology: both methodologies obtain the second-order sensitivities by considering the first-order G-differential of each of the first-order sensitivities. As has been shown in the foregoing, the 2nd-FASAM-L methodology requires at most 3 large-scale computations (i.e., the same number of large-scale computations as the number of components f i , i = 1 , 2 , 3 , of the feature function f ) for solving the three 2nd-Level Adjoint Sensitivity Systems that arise by considering the three first-order sensitivities of the detector response with respect to the three components of the feature function f ( α ) . In contradistinction, the 2nd-CASAM-L methodology requires one large-scale (adjoint) computation for each primary model parameter α j , j = 1 , ... , T P 3 M + 12 , amounting to a total of number of T P 3 M + 12 large-scale computations. The specific computations are described below.

5.2.2.1. Second-Order Sensitivities Stemming from the First-Order Sensitivities with Respect to the Medium’s Material Properties

The expressions of the first-order sensitivities of the detector response with respect to the material properties (i.e., microscopic cross sections and atomic number densities) of the medium in which the neutrons are slowing-down (i.e., losing energy or, equivalently, gaining lethargy) are provided in Eqs. (151)‒(153). These expressions have the following generic form:
R ( φ ; f ) a j ( i ) = g j ( i ) ( α ) 0 u t h a ( 1 ) ( u ) δ ( u ) d u h j ( i ) ( α ) 0 u t h a ( 1 ) ( u ) φ ( u ) d u ; i = 1 , ... , M ; j = 1 , 2 , 3 ,
where a 1 ( i ) N m ( i ) , a 2 ( i ) σ γ ( i ) , a 3 ( i ) σ s ( i ) , and
g 1 ( i ) ( α ) N m ( i ) [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] ; h 1 ( i ) ( α ) N m ( i ) [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] ;
g 2 ( i ) ( α ) σ γ ( i ) [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] ; h 2 ( i ) ( α ) σ γ ( i ) [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] ;
g 3 ( i ) ( α ) σ s ( i ) [ S 0 ( α ) ξ ¯ ( α ) Σ t ( α ) ] ; h 3 ( i ) ( α ) σ s ( i ) [ Σ a ( α ) ξ ¯ ( α ) Σ t ( α ) ] .
The second-order sensitivities stemming from the first-order sensitivities represented by Equation (195) are obtained from the first G-differential of this equation, which has the following expression, by definition, for each i = 1 , ... , M ; j = 1 , 2 , 3 :
δ [ R ( φ ; f ) / a j ( i ) ] { 0 u t h a ( 1 ) ( u ) δ ( u ) d u } α 0 { d d ε [ g j ( i ) ( α 0 + ε δ α ) ] } α 0 , ε = 0 + { g j ( i ) ( α 0 ) d d ε 0 u t h [ a ( 1 ) ( u ) + ε δ a ( 1 ) ( u ) ] δ ( u ) d u } α 0 , ε = 0 { 0 u t h a ( 1 ) ( u ) φ ( u ) d u } α 0 { d d ε [ h j ( i ) ( α 0 + ε δ α ) ] } α 0 , ε = 0 { h j ( i ) ( α 0 ) d d ε 0 u t h [ a ( 1 ) ( u ) + ε δ a ( 1 ) ( u ) ] [ φ ( u ) + ε v ( 1 ) ( u ) ] d u } α 0 , ε = 0 = { δ [ R ( φ ; f ) / a j ( i ) ] } d i r + { δ [ R ( φ ; f ) / a j ( i ) ] } i n d = n = 1 T P { 2 R ( φ ; f ) α n a j ( i ) δ α n } α 0 ,
where:
{ δ [ R ( φ ; f ) / a j ( i ) ] } d i r { 0 u t h a ( 1 ) ( u ) δ ( u ) d u } α 0 n = 1 T P { g j ( i ) ( α ) α n } α 0 δ α n { 0 u t h a ( 1 ) ( u ) φ ( u ) d u } α 0 n = 1 T P { h j ( i ) ( α ) α n } α 0 δ α n ,
{ δ [ R ( φ ; f ) / a j ( i ) ] } i n d { g j ( i ) ( α 0 ) 0 u t h δ a ( 1 ) ( u ) δ ( u ) d u } α 0 { h j ( i ) ( α 0 ) 0 u t h [ a ( 1 ) ( u ) v ( 1 ) ( u ) + φ ( u ) δ a ( 1 ) ( u ) ] d u } α 0 .
The direct-effect term can be computed immediately, since all quantities are known. The indirect-effect term can be computed, in principle, after solving the 2nd-LVSS to obtain the 2nd-level variational sensitivity function v ( 2 ) ( 2 ; u ) [ v ( 1 ) ( u ) , δ a ( 1 ) ( u ) ] . As has been repeatedly discussed in the foregoing, solving the 2nd-LVSS is expensive computationally, so this variational function is replaced in the expression of the indirect-effect term by a 2nd-level adjoint sensitivity function, by following the same steps as outlined in Section 5.2.1. Since there are 3 M first-order sensitivities of the form R ( φ ; f ) / a j ( i ) , i = 1 , ... , M ; j = 1 , 2 , 3 , there will be 3 M distinct 2nd-level adjoint sensitivity functions, one corresponding to each first-order sensitivity. These 3 M distinct 2nd-level adjoint sensitivity functions will be the solutions of the corresponding 3 M distinct 2nd-Level Adjoint Sensitivity Systems (2nd-LASS). Each of these 3 M 2nd-LASS will have a distinct source-term on the right-side (each distinct source stemming from the corresponding first-order sensitivities of the form R ( φ ; f ) / a j ( i ) ), but all of these 3 M 2nd-LASS will have the same left-sides, which will have the same form as the left-side of the 2nd-LASS needed previously, in Subsection 5.2.1 for the computations of the 2nd-order sensitivities of the response with respect to the components of the feature functions, cf. Equation (164), (176), and (186). Since the left-sides of these 2nd-LASS represent the (differential) operators that need to be inverted, the actual inversion of these operators needs to be performed once only, and the inverted operator should be stored; subsequently, the inverted operator can be used 3 M times, operating on 3 M distinct source terms, to compute the respective 3 M distinct 2nd-level adjoint sensitivity functions.

5.2.2.2. Second-Order Sensitivities Stemming from the First-Order Sensitivities with Respect to the Source Properties

The expressions of the first-order sensitivities of the detector response with respect to the parameters that characterize the source which emits the neutrons into the medium are provided in Eqs. (146)‒(150). These expressions have the following generic form:
R ( φ ; f ) b k ( i ) = ω k ( i ) ( α ) 0 u t h a ( 1 ) ( u ) δ ( u ) d u ; i = 1 , ... , 5 ; k = 1 , 2 ,
where:
b k ( 1 ) λ k S ; ω k ( 1 ) ( α ) N k S F k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) ; k = 1 , 2 ;
b k ( 2 ) N k S ; ω k ( 2 ) ( α ) λ i S F k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) ; k = 1 , 2 ;
b k ( 3 ) F k S ; ω k ( 3 ) ( α ) λ i S N k S ν k S W k S ξ ¯ ( α ) Σ t ( α ) ; k = 1 , 2 ;
b k ( 4 ) ν k S ; ω k ( 4 ) ( α ) λ i S N k S F k S W k S ξ ¯ ( α ) Σ t ( α ) ; k = 1 , 2 ;
b k ( 5 ) ν k S ; ω k ( 5 ) ( α ) λ i S N k S F k S ν k S ξ ¯ ( α ) Σ t ( α ) ; k = 1 , 2.
The second-order sensitivities stemming from the first-order sensitivities represented by Equation (202) are obtained from the first G-differential of this relation, which has the following expression, by definition, for each i = 1 , ... , 5 ; k = 1 , 2 :
δ [ R ( φ ; f ) / b k ( i ) ] { 0 u t h a ( 1 ) ( u ) δ ( u ) d u } α 0 { d d ε [ ω k ( i ) ( α 0 + ε δ α ) ] } α 0 , ε = 0 + { ω k ( i ) ( α 0 ) d d ε 0 u t h [ a ( 1 ) ( u ) + ε δ a ( 1 ) ( u ) ] δ ( u ) d u } α 0 , ε = 0 = = { δ [ R ( φ ; f ) / b k ( i ) ] } d i r + { δ [ R ( φ ; f ) / b k ( i ) ] } i n d = n = 1 T P { 2 R ( φ ; f ) α n b k ( i ) δ α n } α 0 ,
where:
{ δ [ R ( φ ; f ) / b k ( i ) ] } d i r { 0 u t h a ( 1 ) ( u ) δ ( u ) d u } α 0 n = 1 T P { ω k ( i ) ( α ) α n } α 0 δ α n ,
{ δ [ R ( φ ; f ) / b k ( i ) ] } i n d { ω k ( i ) ( α 0 ) 0 u t h δ a ( 1 ) ( u ) δ ( u ) d u } α 0 .
The direct-effect term can be computed immediately, since all quantities are known. The indirect-effect term can be computed, in principle, after solving the 2nd-LVSS to obtain the 2nd-level variational sensitivity function v ( 2 ) ( 2 ; u ) [ v ( 1 ) ( u ) , δ a ( 1 ) ( u ) ] , but this path is expensive computationally, so this variational function is replaced in the expression of the indirect-effect term by a 2nd-level adjoint sensitivity function, by following the same steps as outlined in Section 5.2.1. Since there are 10 first-order sensitivities of the form R ( φ ; f ) / b k ( i ) , i = 1 , ... , 5 ; k = 1 , 2 , there will be 10 distinct 2nd-level adjoint sensitivity functions, one corresponding to each first-order sensitivity. Thus, there will be 10 distinct 2nd-Level Adjoint Sensitivity Systems (2nd-LASS) to be solved, each having a distinct source-term on the right-side, but all of them having the same left-sides as the left-side of the 2nd-LASS needed previously, in Subsection 5.2.1 for the computations of the 2nd-order sensitivities of the response with respect to the components of the feature functions, namely Eqs. (164), (176), and (186).

5.2.2.3 Second-Order Sensitivities Stemming from the First-Order Sensitivities with Respect to the Detector Properties

The expressions of the first-order sensitivities of the detector response with respect to the detector’s material properties (i.e., microscopic cross section and atomic number density) are provided in Eqs. (144) and (145). These expressions have the following generic form:
R ( φ ; f ) ζ ( i ) = c ( i ) ( α ) 0 u t h φ ( u ) δ ( u u d ) d u ; i = 1 , 2 ,
where:
ζ ( 1 ) = N d ; ζ ( 2 ) = σ d ; c ( 1 ) ( α ) = σ d ; c ( 2 ) ( α ) = N d .
The second-order sensitivities stemming from the first-order sensitivities represented by Equation (211) are obtained by determining the first G-differential of this relation, which has the following expression, by definition, for each i = 1 , 2 :
δ [ R ( φ ; f ) / ζ ( i ) ] { 0 u t h φ ( u ) δ ( u u d ) d u } α 0 { d d ε [ c ( i ) ( α 0 + ε δ α ) ] } α 0 , ε = 0 + { c ( i ) ( α 0 ) d d ε 0 u t h [ φ 0 ( u ) + ε v ( 1 ) ( u ) ] δ ( u u d ) d u } α 0 , ε = 0 = { δ [ R ( φ ; f ) / ζ ( i ) ] } d i r + { δ [ R ( φ ; f ) / ζ ( i ) ] } i n d = n = 1 T P { 2 R ( φ ; f ) α n ζ ( i ) δ α n } α 0 ,
where:
{ δ [ R ( φ ; f ) / ζ ( i ) ] } d i r { 0 u t h φ ( u ) δ ( u u d ) d u } α 0 n = 1 T P { c ( i ) ( α 0 ) α n } α 0 δ α n ,
{ δ [ R ( φ ; f ) / ζ ( i ) ] } i n d { c ( i ) ( α 0 ) 0 u t h v ( 1 ) ( u ) δ ( u u d ) d u } α 0 .
The direct-effect term can be computed immediately, since all quantities are known. The indirect-effect term can be computed, in principle, after solving the 2nd-LVSS to obtain the 2nd-level variational sensitivity function v ( 2 ) ( 2 ; u ) [ v ( 1 ) ( u ) , δ a ( 1 ) ( u ) ] , but this path is expensive computationally. As before, this variational function is replaced in the expression of the indirect-effect term by a 2nd-level adjoint sensitivity function, by following the same steps as outlined in Section 5.2.1. Since there are 2 first-order sensitivities of the form R ( φ ; f ) / ζ ( i ) , i = 1 , 2 , there will be 2 distinct 2nd-level adjoint sensitivity functions, one corresponding to each first-order sensitivity. As before, the two 2nd-LASS to be solved have distinct source-terms on their right-sides, but both have the same left-sides as the left-side of the 2nd-LASS needed previously, as in Eqs. (164), (176), and (186).
In summary, the results discussed in Subsection 5.2.2 indicate that computing the 2nd-order sensitivities of the model response directly with respect to the T P 3 M + 12 primary model parameters, α j , by applying the 2nd-CASAM-L methodology requires one large-scale (adjoint) computation for each primary model parameter α j , amounting to a total of number of T P 3 M + 12 large-scale computations for solving the respective 2nd-LASS. All of these 2nd-LASS have the same left-side (which is also the same as needed for computing the 2nd-order sensitivities of the response with respect to the feature functions by applying the 2nd-FASAM-L) but have differing sources on their right-sides. The unmixed second order sensitivities 2 R ( φ ; f ) / α i α i i = 1 , ... , T P 3 M + 12 , are computed just once. The mixed second order sensitivities 2 R ( φ ; f ) / α i α j , i j , are computed twice, involving distinct 2nd-level adjoint sensitivity functions. Therefore, the symmetry property 2 R ( φ ; f ) / α i α j = 2 R ( φ ; f ) / α j α i provides an intrinsic mechanism for verifying the accuracy of the computations of the respective 2nd-level adjoint sensitivity functions.

6. Concluding Discussion

This work has presented the mathematical framework of the “2nd-Order Feature Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (2nd-FASAM-L)” along with an illustrative application to a paradigm model of energy slowing-down of neutrons in an infinitely large homogeneous mixture of materials, as found in many energy-related systems. It has been shown that the 2nd-FASAM-L is the most efficient methodology for computing exactly the first- and second-order sensitivities of model responses with respect to the features (functions) of model parameters and, subsequently, to the primary model parameters themselves. This efficiency stems from the maximal reduction of the number of adjoint computations (which are “large-scale” computations) which are needed for obtaining these sensitivities. In the extreme case when the model presents no features (functions) of the primary model parameters, the 2nd-FASAM-L reduces to the 2nd-CASAM-L (“2nd-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems) developed by Cacuci [27]. Comparing the mathematical framework of the 2nd-FASAM-L methodology to the framework of the 2nd-CASAM-L methodology indicates the following commonalities and distinctions:
  • The components f i ( α ) , i = 1 , ... , T F , of the “feature function” f ( α ) [ f 1 ( α ) , ... , f T F ( α ) ] play within the 2nd-FASAM-L the same role as played by the components α j , j = 1 , ... , T P , of the “vector of primary model parameters” α ( α 1 , , α T P ) within the framework of the 2nd-CASAM-L. Notably, the total number of model parameters is always larger (usually by wide margin) than the total number of components of the feature function f ( α ) , i.e., T P T F .
  • The 1st-FASAM-L and the 1st-CASAM-L methodologies require a single large-scale “adjoint” computations for solving the 1st-LASS (1st-Level Adjoint Sensitivity System), so they are similarly efficient for computing the exact expressions of the first-order sensitivities of a model response to the model’s uncertain parameters, boundaries, and internal interfaces, with a slight computational advantage towards the 1st-FASAM-L, which requires only T P quadratures, as opposd to T F quadratures required by the 1st-CASAM-L methodology.
  • For computing the exact expressions of the second-order response sensitivities with respect to the primary model’s parameters, the 2nd-FASAM-L methodology requires as many large-scale “adjoint” computations as there are “feature functions of parameters” f i ( α ) , i = 1 , ... , T F , for solving the left-side of the 2nd-LASS with T F distinct sources on its right-side. By comparison, the 2nd-CASAM-L methodology requires T P large-scale computations for solving the same left-side of the 2nd-LASS but with T P distinct sources. Since T F T P , the 2nd-FASAM-L methodology is considerably more efficient than the 2nd-CASAM-L methodology for computing the exact expressions of the second-order sensitivities of a model response to the model’s uncertain parameters, boundaries, and internal interfaces.
  • Both the 2nd-FASAM-L and the 2nd-CASAM-L methodologies are formulated in linearly increasing higher-dimensional Hilbert spaces −as opposed to exponentially increasing parameter-dimensional spaces− thus overcoming the curse of dimensionality in sensitivity analysis of nonlinear systems. Both the 2nd-FASAM-L and the 2nd-CASAM-L methodologies are incomparably more efficient and more accurate than any other methods (statistical, finite differences, etc.) for computing exact expressions of response sensitivities (of any order) with respect to the model’s uncertain parameters, boundaries, and internal interfaces.
Ongoing work aims at generalizing the 2nd-FASAM-L methodology to enable the exact and most efficient computation of response sensitivities of arbitrarily-high (nth-) order with respect to features (functions) of model parameters, thus becoming the companion for ‒and most efficient alternative to‒ the nth-CASAM-L methodology [27], whenever the model comprises features (functions) of model parameters.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kramer, M.A.; Calo, J.M.; Rabitz, H. An improved computational method for sensitivity analysis: Green’s Function Method with “AIM”, Appl. Math. Modelling, 1981, 5, 432–442. [Google Scholar] [CrossRef]
  2. Cacuci, D.G. Sensitivity theory for nonlinear systems: I. Nonlinear functional analysis approach. J. Math. Phys, 1981, 22, 2794–2802. [Google Scholar] [CrossRef]
  3. Dunker, A.M. The decoupled direct method for calculating sensitivity coefficients in chemical kinetics, J. Chem. Phys., 1984, 81, 2385–2393. [Google Scholar] [CrossRef]
  4. Bellman, R.E. Dynamic programming. Rand Corporation, Princeton University Press, ISBN 978-0-691-07951-6, USA, 1957. Republished: Bellman, RE (2003) Dynamic Programming. Courier Dover Publications, ISBN 978-0-486-42809-3, USA,.
  5. Iman, R.L.; Helton, J.C.; Campbell, J.E. An approach to sensitivity analysis of computer models, Part 1. Introduction, input variable selection and preliminary variable assessment. J. Quality Technol., 1981, 13, 174–183. [Google Scholar] [CrossRef]
  6. Iman, R.L.; Helton, J.C.; Campbell, J.E. An approach to sensitivity analysis of computer models, Part 2. Ranking of input variables, response surface validation, distribution effect and technique synopsis. J. Quality Technol., 1981, 13, 232–240. [Google Scholar] [CrossRef]
  7. Cukier, R.I.; Levine, H.B.; Shuler, K.E. Nonlinear sensitivity analysis of multiparameter model systems. J. Comp. Phys., 1978, 26, 1–42. [Google Scholar] [CrossRef]
  8. Hora, S.C.; Iman, R.L. A Comparison of maximum/bounding and Bayesian/Monte Carlo for fault tree uncertainty analysis, Technical Report SAND85-2839, Sandia National Laboratories, Albuquerque, NM, USA, 1986.
  9. Rios Insua, D. Sensitivity analysis in multiobjective decision making. Springer Verlag, New York, USA, 1990.
  10. Saltarelli, A.; Chan, K.; Scott, E.M. Editors. Sensitivity analysis. J. Wiley & Sons Ltd. Chichester, UK, 2000.
  11. Wigner, E.P. Effect of small perturbations on pile period, Chicago Report CP-G-3048, Chicago, IL, USA, 1945.
  12. Weiberg, A.M.; Wigner, E.P. The Physical Theory of Neutron Chain Reactors. University of Chicago Press, Chicago, Illinois, USA, 1958.
  13. Weisbin, C.R.; et al. Application of sensitivity and uncertainty methodology to fast reactor integral experiment analysis. Nucl. Sci. Eng., 1978, 66, pp. 307. [Google Scholar] [CrossRef]
  14. Williams, M.L. Perturbation Theory for Nuclear Reactor Analysis. In: Ronen, Y. (Ed.) Handbook of Nuclear Reactor Calculations, CRC Press, Boca Raton, Florida, USA, 1986; Volume 3, p 63-188.
  15. Shultis, J.K.; Faw, R.E. Radiation Shielding. American Nuclear Society, La Grange Park, Illinois, USA, 2000.
  16. Stacey, W.M. Nuclear reactor physics. John Wiley & Sons, New York. USA, 2001.
  17. Práger, T.; Kelemen, F.D. Adjoint methods and their application in earth sciences. In: Faragó I, Havasi Á, Zlatev Z (Eds.) Advanced numerical methods for complex environmental models: Needs and availability. Bentham Science Publishers, Oak Park, IL, USA, 2014; Chapter 4A, p.203–275.
  18. Luo, Z.; Wang, X.; Liu, D. Prediction on the static response of structures with large-scale uncertain-but-bounded parameters based on the adjoint sensitivity analysis. Structural and Multidisciplinary Optimization, 2020, 61, 123–139. [Google Scholar] [CrossRef]
  19. Cacuci, D.G. Second-order adjoint sensitivity analysis methodology for computing exactly and efficiently first- and second-order sensitivities in large-scale linear systems: I. Computational methodology. J. Comp. Phys., 2015, 284, 687–699. [Google Scholar] [CrossRef]
  20. Cacuci, D.G. Second-order adjoint sensitivity analysis methodology for large-scale nonlinear systems: I. Theory. Nucl. Sci. Eng., 2016, 184, 16–30. [Google Scholar] [CrossRef]
  21. Cacuci, D.G.; Fang, R. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology (nth-CASAM): Overcoming the Curse of Dimensionality in Sensitivity and Uncertainty Analysis, Volume II: Application to a Large-Scale System, 463 pages. 2023. [Google Scholar] [CrossRef]
  22. Valentine, T.E. Polyethylene-reflected plutonium metal sphere subcritical noise measurements, SUB-PU-METMIXED-001. International Handbook of Evaluated Criticality Safety Benchmark Experiments, NEA/NSC/DOC(95)03/I-IX, Organization for Economic Cooperation and Development (OECD), Nuclear Energy Agency (NEA), Paris, France, 2006.
  23. Alcouffe, R.E.; Baker, R.S.; Dahl, J.A.; Turner, S.A.; Ward, R. PARTISN: A Time-Dependent, Parallel Neutral Particle Transport Code System. LA-UR-08-07258; Los Alamos National Laboratory: Los Alamos, NM, USA, 2008. [Google Scholar]
  24. Conlin, J.L.; Parsons, D.K.; Gardiner, S.J.; Gray, M.; Lee, M.B.; White, M.C. MENDF71X: Multigroup Neutron Cross-Section Data Tables Based upon ENDF/B-VII.1X.; Los Alamos National Laboratory Report LA-UR-15-29571; Los Alamos National Laboratory: Los Alamos, NM, USA, 2013. [Google Scholar]
  25. Chadwick, M.B.; Herman, M.; Obložinský, P.; Dunn, M.E.; Danon, Y.; Kahler, A.C.; Smith, D.L.; Pritychenko, B.; Arbanas, G.; Brewer, R.; et al. (2011) ENDF/B-VII.1: Nuclear data for science and technology: Cross sections, covariances, fission product yields and decay data. Nucl. Data Sheets, 2011, 112, 2887–2996. [Google Scholar] [CrossRef]
  26. Wilson, W.B.; Perry, R.T.; Shores, E.F.; Charlton, W.S.; Parish, T.A.; Estes, G.P.; Brown, T.H.; Arthur, E.D.; Bozoian, M.; England, T.R.; et al. SOURCES4C: A code for calculating (α,n), spontaneous fission, and delayed neutron sources and spectra. In: Proceedings of the American Nuclear Society/Radiation Protection and Shielding Division 12th Biennial Topical Meeting, Santa Fe, NM, USA, 14–18 April 2002.
  27. Cacuci, D.G. The nth-order comprehensive adjoint sensitivity analysis methodology (nth-CASAM): Overcoming the curse of dimensionality in sensitivity and uncertainty analysis, Volume I: Linear systems, 362 pages, Springer Nature, Cham, Switzerland, 2022. [CrossRef]
  28. Lewins, J. IMPORTANCE: The Adjoint Function, Pergamon Press Ltd., Oxford, UK, 1965.
  29. Stacey, W.M. Variational Methods in Nuclear Reactor Physics, Academic Press, new York, USA, 1974.
  30. Cacuci, D.G. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology (nth-CASAM): Overcoming the Curse of Dimensionality in Sensitivity and Uncertainty Analysis, Volume III: Nonlinear Systems, 369 pages, Springer Nature, Cham, Switzerland, 2023. [CrossRef]
  31. Cacuci, D.G. Computation of High-Order Sensitivities of Model Responses to Model Parameters. II: Introducing the Second-Order Adjoint Sensitivity Analysis Methodology for Computing Response Sensitivities to Functions/Features of Parameters. Energies, 2023, 16, 6356. [Google Scholar] [CrossRef]
  32. Meghreblian, R.V.; Holmes, D.K. Reactor Analysis, McGraw-Hill, New York, USA, 1960.
  33. Lamarsh, J.R. Introduction to Nuclear Reactor Theory, Adison-Wesley Publishing Co., Reading MA, USA, 1966; pp. 491–492.
  34. Hetrick, D.L. Dynamics of Nuclear Reactors, American Nuclear Society, Inc., La Grange Park, IL., USA, 1993; pp. 164–174.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated