2. Discretization in System State Space
Among new directions in distributed systems research connections between distributed systems theory on the one hand, and canonical problems in turbulence and statistical mechanics on the other could be suggested. In one class of problems, spatio-temporal dynamical analysis clarifies old and complex questions in the theory of shear flow turbulence. In another class of problems, structured, distributed control design exhibits dimensionality-dependence and phase transition phenomena similar to those in statistical mechanics.
Figure 1.
Mesoscale system structure can occur as a result of external influence at the system with microscale structure. It exhibits the properties of both micro- and macroscale structure. On the one hand, the clusters at meso-scale can be considered as a set of different structures, as single particles at microscale but of larger size. On the other hand each cluster could be viewed as a rigid structure, as a macroscale object but of smaller size.
Figure 1.
Mesoscale system structure can occur as a result of external influence at the system with microscale structure. It exhibits the properties of both micro- and macroscale structure. On the one hand, the clusters at meso-scale can be considered as a set of different structures, as single particles at microscale but of larger size. On the other hand each cluster could be viewed as a rigid structure, as a macroscale object but of smaller size.
Assume that (unidirectional) time
t is introduced, and consider non-isolated systems consisting of elements. Evolution of each system over time is determined by the current states of both the system itself and other elements of the system. The evolution is also affected by external disturbing influences
W (the absence of influence can be interpreted as zero impact). External influences
W can be formally included in the general set of system states. The inclusion of
W into of the system state can significantly complicate the descriptive model. External influences
W naturally fall into two groups:
controlled ones
u (or simply, control) and uncontrolled ones
w.
It is usually assumed that at time t the system state
is finite-dimensional, and to describe the dynamics of the system a system of differential equations is used
with some functions
and external disturbances
W. In models of complex systems of this type, consisting of a huge number of components, it is customary to assume a large dimension
n of state space. But, on the one hand, the choice of the threshold for
n significantly limits the “upper bound” of complexity, and, on the other hand, does not allow to take possible “flexibility” of the system into account during its changing process. We will assume that the system consists of a continuum of elements
, parameterized by
, and the evolution of each of the elements is described by equation
Such setups can occur, for instance, in stochastic games with many players [
3] and mean-field games with almost infinitely many players [
4]. Additionally, we assume that the external influence
W for all its arbitrariness at each moment of time
k has a structure
of finite order. Finite structure of external influence after some transition process causes discretization of spatial elements (clustering) in the considered complex system
Discretization occurs due to the self-organization of groups of elements and their synchronization. For clusters
, a set of
variables
averaged over cluster
is naturally introduced. Set of
,
could be generalized as a set of some integrals over the clusters. Such approach is usually applied to simplify physical models (dimension reduction). The general integral characteristics of clusters of elements with similar properties are introduced and dynamic models in reduced state spaces are considered. Experiments show that such simplifications are justified and often give good results.
Typically, the process of clustering (self-organization) in a system is not “one-shot”, but is constantly reproduced due to changes in external influences and critical changes in internal states. But at the same time we will assume that a change in the structure of external influences does not occur permanently, but in some time instants This leads to the necessity to consider the dynamic processes under condition of state space structure change over time.
So, when the structure of external influence changes, the discretization of spatial elements may change. Let us assume that this transient process takes a duration of time no longer than some
(see
Figure 2). We will assume that
is many times smaller than the intervals between successive changes in the structure of external influences:
In addition to discretizing spatial variables we obtain time sampling, neglecting duration of transient process intervals. After such discretization in many practical applications the system of differential equations describes the dynamics of changes in the original complex system over the time interval from
to
:
where
is aggregated state
or
from cluster
,
,
is a finite set of current parameters at time interval
.
4. Parameter Identification Problem
Estimation of system parameters values can be formulated as an optimization problem. The discrepancy between estimated and real parameter values could be expressed in terms of some loss function value and thus to solve the system identification problem one has to minimize given loss function. To identify the system structure it is required to choose such control formation strategy , which minimizes some loss function.
Suppose that for a given parameter vector
the optimal control strategy
is known. After substituting it in (
8), we get the problem of minimizing the function
when observing its values against under the influence of noise
. Under the assumptions made, the minimum of the function
is reached when
. To solve the formulated problem, the method from [
5] can be used.
The distributed optimization task can be formulated in terms of finding the method of constructing cluster (meso-) control, in which the same control action is applied to all elements of the cluster. In this case, the discretized loss function can be represented as a distributed functional
This problem could be solved by appliying the stochastic optmimization type method from [
15].
The considered problem setting is a particular case of more general problem of minimizing a non-stationary differentiable function with respect to
. Let
be the
-algebra of all probabilistic events which happened during time interval
before start of time interval
n. Hereinafter
is a symbol of the conditional mathematical expectation with respect to the
-algebra
,
is a symbol of the mathematical expectation. The minimum point
of function
needs to be estimated.
More precisely, using the observations and inputs , construct an estimate of an unknown vector minimizing the time-varying mean-risk functional.
4.1. Assumptions
Let us formulate Assumptions about disturbances and functions .
For , the successive differences of observation noise are bounded: , or if a sequence is random, where , are observation noises occurred during the same time interval n but at different time instants.
Functions have unique minimum points and with a constant . Here and further is a scalar product of two vectors.
The gradient is uniformly bounded in the mean-squared sense at the minimum points
the gradient
satisfies the Lipschitz condition:
with a constant
.
random vector does not depend on , random vectors do not depend on ; if are random variables, then also do not depend on .
Using available observations, it is necessary to construct a sequence of estimates of the unknown vector that minimizes the function . To solve the problem, we will use an iterative algorithm with two measurements.
Let the trial simultaneous disturbance
be an observable (set or user-controlled) sequence of independent random vectors with known distribution functions
— and specified vector functions
, satisfy the conditions
4.2. Algorithm
Let’s choose an arbitrary initial estimate vector
and scalar parameters
,
for an iterative algorithm
4.3. Main Result
Let us introduce the following notation:
Here
,
could be chosen to satisfy inequation
and
and
chosen to satisfy following conditions:
Theorem 1.
Let Assumptions 1–5 and conditions for kernels (9)-(10) and α (12) be satisfied. Set , choose interval size parameter k
Proof.
To analyze the convergence of the algorithm (
11) estimates, a method from [
10] is used. Choose
as Lyapunov function. To prove the theorem (
13) is true it is sufficient to show the following six propositions are satisfied.
Proposition 1.
The iterative process , which defines the direction of the estimate change, is a Markov process, i.e. the distribution of the random vector depends only on and n:
For algorithm (
11) we have
where the right hand side depends only on
and n in the sense that
does not depend on any other random variables. Proposition is true.
Proposition 2.
, , has first-order derivative, and its gradient satisfies Lipshitz condition:
The proposition is valid due to the choice of Lyapunov function (
14).
Proposition 3.
Pseudo-gradient condition:
At first consider
. Due to (
11) pseudo-gradient (
15) after using Assumption 5 becomes
Consider the expression under mathematical expectation. Using Taylor series representation it could be written as:
Estimate absolute value of the sum of integral elements in the obtained expression. After using (
9), Assumption 4 we get
Substitute the estimate, elements containing gradients of
and (
14) into (
16), regard the relation
and Cauchy–Bunyakovsky–Schwarz inequality:
Apply Assumption 2 and estimate
Proposition is true for
.
Using (
10), Cauchy–Bunyakovsky–Schwarz inequality, Assumptions 1 and 3 it could be shown that
Proposition holds true with
,
.
Proposition 5.
Proposition is valid due to arbitrariness of initial approximation choice and an assumption regarding final order of the external disturbance W affecting the system and thus the final order of the system state vector.
Proposition 6.
;
The first inequality could be met by choice of α and the second one is true since ν is constant. Proposition is true.
The fulfillment of the given propositions allow to prove the theorem using the result in [
10]. □
Remark 1. After the algorithm converges the parameter estimates continue to fluctuate around the true parameter value until the new change of system structure and value of .
Remark 2.
The most common problem setup for unknown-but-bound disturbances in existing works is formulated as follows. It is required to minimize the value of the objective function with some adversarial deterministic noise such that and ):
The considered setup implies that external noise depends on system state which is not a direct function of θ but rather is affected by values of during some time period
Value is formed on basis of previous values of θ.
5. Application for Orientation Improvement of Radio Telescope’s Elements
For astronomers, an urgent task was to obtain images of objects that can’t be recorded using optical telescopes, which are situated on earth or in space. This problem was largely solved using radio telescopes [
11]. The main tasks of the telescope are: to collect radiation that falls on the mirror system with minimal losses, and also obtain the most accurate image of the object [
6]. There are various ways to solve this problem, consider for example, [
8]. One of them is improving the quality of the device [
12], which collects radiation for obtaining images. Another one is combining radio telescopes into systems [
9]. If the radiation is collected with significant errors, then the image will have disturbances [
27].
An important and time-consuming part of image acquisition is the precise tuning of the radio telescope antenna (or systems consisting of such antennas) [
13]. The quality of the image obtained on a radio telescope directly depends on the quality of the construction of a reflecting system of mirrors that focuses the radiation coming from outside. To improve the image quality, it is necessary to focus the radiation of the device in such a way that it works as accurately as possible, especially if it is located in space [
14]. Traditional antenna tuning algorithms are sufficient for the task. However, they lose their effectiveness under uncontrolled unpredictable external influences. We consider the case when these are deformations of the radio telescope shields that arise due to environmental influences such as temperature changes, wind and other influences.
In practice, radiation is subject to various distorting influences, and as a result the quality of the observed image decreases, despite the presence of various stabilizers and filters [
11]. One of the ways to solve such a problem is using randomized stochastic optimization algorithms [
15]. A method for improving image quality by improving the tuning characteristics of the radio telescope mirror system is considered in [
25].
In an ideal antenna system, the signal is reflected from different mirror plates and assembled at one point. Deformations of radio telescope structures, external temperature, wind, mechanical influences lead to deviation of the optical path lengths of the rays from the required ones. As a result, the focus point on the plane of the receiver shifts [
12]. To improve reflection accuracy on the surface of radio telescope mirrors, the following methods are used: autocollimation [
22], telescope calibration by spectral density radiation flux, synchronous calibration method [
21], laser geodetic measurements [
6], improvement of the kinematics of antenna elements [
19], radio holography physical method [
24]. The goal is to develop a stochastic optimization type algorithm to improve the quality of the settings of the radio telescope mirror system model which could be used in a system similar to Radioastron [
20,
26]. The main criterion for efficiency is the recording power of the desired signal and the time required for adjusting the parameters.
The antenna segments can be set to the optimal position to improve the quality of image recording. Consider an irradiator (radiation generator), a receiver and a mirror system of a radio telescope, consisting of identical plates that reflect the incoming signal ( in the RATAN-600 installation). Radiation is created in the irradiator, which, falling on the plates and focused in the receiver. Let’s divide the number axis into time intervals of duration , starting from some moment , where is the k-th time interval k is the index of the time interval. Assume that we know:
1) the position (orientation) of each i-th plate, which is specified by the vector of parameters , where is the rotation angle of the i-th reflective element horizontally; is the vertical rotation angle, is the forward horizontal displacement of the i-th reflecting element. Let be a vector that contains all parameters of the mirror system in a given time interval k,
2) radiation coming from each mirror (),
3) the common signal coming from all mirrors to the receiver ,
4) characteristics of the signal in the generator.
The front of a signal is the sum of harmonics that comes to us with different phases from a certain direction. The perfect placement of the plates brings all radiation from the objects into focus. We obtain the signal as a sum of sines with phases. Different
arrive at different times with different phases. Let us evaluate the difference between the signals from an ideal antenna and from a real one (with deformations). Signals reflected from ideal mirror segments will have the same phase
(
). The signals from segments with deformations will look like this:
The objective function of the problem
is the signal power) is defined as follows:
It is required to maximize the objective function:
We consider the problem of optimizing the position of mirrors in the limit over time (not over a specific time interval).
The reflective elements of the antenna are made exactly the same, so they provide equivalent observations in all directions.
At the same time, if one moves along an ideal reflective surface, its local characteristics change. Therefore, a real reflective surface composed of identical elements will repeat deviations from the ideal surface from element to element. These deviations will be greater, if the shape of the surface of the element differs from the shape of that portion of the ideal surface which this element should represent. If the size of the reflective elements increases, then the deviations naturally increase. These deviations are an error distributed over the reflector of a variable profile antenna, which, at large values, creates unacceptable distortions reducing the efficiency of the antenna. We will call the signal power measurement errors arising due to unknown and uncontrolled deformations in the reflective elements of the antenna caused by weather, wind, and temperature changes.
After conventional procedures for adjusting the inclination angles and positions of the radio telescope mirrors, we obtain an initial approximation
to the optimal tuning values. Then there is still the possibility of “tuning” in a certain neighborhood
T containing
and the optimal value
corresponding to the maximum power of the received signal. We use a stochastic optimization algorithm with two measurements per iteration, which allows us to reduce the negative impact of various disturbances on the power of the recorded signal [
7]:
Select
We sequentially generate vectors with components from +/-1, which are chosen with equal probability.
We measure the power values for two positions of the antenna system:
and
Measurements are obtained with noise
and
:
Next, we form the following estimate
according to the rule:
where
are the parameters of the algorithm,
is the projection onto the set
.
To construct the kernels and on the interval orthogonal Legendre polynomials could be used. In this case, for initial values (i.e. ) the type of kernels is as follows:
for (i.e. ):
and for both functions are equal to zero.
The test disturbance is formed in such a way that random vector does not depend on and
The main element that reflects the incoming signal is the segment of the mirror system. It is important to configure these shields so that the signal comes into focus. The system allows to customize the required dimensions of the reflective shield and its curvature. Stochastic optimisation algorithm (
11) can be applied for tuning a system which consists of a large number of telescopes in space under conditions of interference and also when individual elements of the system are deformed. About 3000 parameters of the mirror surfaces are adjusted, during focusing of the mirror system.The faster setting up of the mirrors is very important for the accuracy of observations. For acceleration, it is proposed to use the Nesterov acceleration method in distributed form [
16,
17,
18]. In
Figure 3 on page 11, the dependence of the signal power on the number of iterations of the algorithm is given.