Model orchestration refers to the assembly of models, data, configurations, and runtime environment that the current model according to the dependencies between model service packages. The open sharing of geospatial models significantly improves the efficiency of model integration and publication. However, owning to the large number of resources, wide sources and different structures, the discovery of high-quality resources is extremely challenging. In this section, we design a prioritization-based orchestration method , which consists of the pre-selection sub-method and priority-selection sub-method . The method can proactively discover optimal resources, thus significantly reducing labor costs.
3.2.1. Priority-selection of dependent resources
Most early orchestration methods employed weighted a sum of QoS attributes to evaluate and screen resources. These methods are simple and feasible but ignore the relationship between multi-dimensional attributes and the impact of attribute interactions on resource discovery. This may result in higher rankings for resources with a single better attribute. To this end, we propose an improved two-stage prioritization-based orchestration method
, as shown
Figure 5.
The proposed method initially retrieves resources (models, data storages) from the resource instance catalog using the CapID of dependent resources. The retrieval results are marked as candidate resource sets, including the candidate model service instance set and the candidate data storage instance set . Owning to the large number of candidate resource instances with varying performance, the following two-step screening was performed.
Step 1: Dependency pre-selection
. Abnormal instances in the candidate resource sets were eliminated using an isolation forest (iForest) [
24]. This can reduce the risk of participation of abnormal instances in orchestration. For model service instances, we construct an isolation forest using metrics such as service running time (SRT), number of service reboots (NSR), number of service migrations (NSM), and number of service updates (NSU). The feature space is given by Formula (1).
For data storage instances, we adopt metrics such as the storage quota (SQ), storage utilization (SU), number of host reboots (NHR), and number of storage reboots (NSTR).
The isolation forest acts as an aggregate of isolation trees (iTree). Different isolation trees play different roles as anomaly identification specialists, which identify resource instances with shorter paths as anomalies. Specifically, the isolated forest detects the abnormal resources by introducing an outlier function, as shown in Formula (2).
where
is the mathematical expectation of the depth of leaf node
in an isolated forest subtree.
is the average depth of isolated trees containing
sample data in an isolated forest,
is the Euler constant. When
, that is, when the evaluation score is close to 1, the resource is judged to be abnormal. We evaluate and mark the abnormal status of resources in the candidate resource sets using Formula (2), with “1” indicating normal and “-1” otherwise. Furthermore, the pre-selected service instance set
and the pre-selected data instance set
were obtained after removing abnormal resources.
Step 2: Dependency priority-selection
. Owing to the large number of pre-selected resources in
and
, prioritization is required to further filter out the optimal resources. We propose an improved multi-criteria decision-making method to achieve a more comprehensive resource evaluation. The method objectively assigns weights to the decision-making factors of TOPSIS [
25] by entropy weighting method, as shown in Formula (3). This can eliminate the influence of traditional subjective weights on the accuracy evaluation of resources.
where
denotes the resource decision matrix of dimension
and its row vector is the resource decision vector
.
is the number of pre-selected resources,
is the dimension of the resource decision vector.
is a decision weight vector with dimensions of
. Vector element
denotes the weight coefficients of the decision factors.
denotes the information entropy of decision factor
,
is the normalized probability,
is an element of vector
. c denotes the resource evaluation vector with dimensions
. Element of
represents the resource evaluation value.
In terms of service resources, we focus on four types of QoS attributes as decision factors, as shown in Formula (4).
where
denotes the average response time of model service
i under the concurrent access.
denotes the service throughput, that is, the number of service requests that are successfully processed. We define the service response time exceeding threshold
as a violation, the service violation rate
is expressed as the ratio of the violation time
to the service response time
, that is,
.
denotes the service error rate, that is, the ratio of error requests under concurrent access to the total number of requests. In terms of data resources, we focus on four types of QoS attributes as decision factors, as shown in Formula (5).
where
,
,
,
denote the storage capacity, number of queries per second, number of transactions per second, and number of disk I/O operations per second, respectively.
Finally, all pre-selected resource instances are prioritized using Formula (3). Resource instances with the highest priority rankings are selected for orchestration. The access addresses of these priority-selected resources are injected into the target geospatial models.