Analyzing the Robustness of Complex Networks with Attack Success Rate

Preprint

Article

Analyzing the Robustness of Complex Networks with Attack Success Rate

Altmetrics

Downloads

Views

Comments

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

This preprints belongs to the Topic

Complex Systems and Network Science

Submitted:

28 September 2023

Posted:

09 October 2023

You are already at the latest version

Alerts

Abstract

Analyzing network robustness against random failures or malicious attacks is a critical research issue in network science as it helps to enhance the robustness of beneficial networks or efficiently disintegrate harmful networks. Most studies commonly neglect the impact of the attack success rate (ASR) and assume that attacks on the network will always be successful. However, in real-world scenarios, an attack may not always succeed. This paper proposes a novel robustness measure called Robustness-ASR (RASR), which utilizes mathematical expectations to assess network robustness when considering the ASR of each node. To efficiently compute the RASR for large-scale networks, a parallel algorithm named PRQMC is presented, which leverages randomized quasi-Monte Carlo integration to approximate the RASR with a faster convergence rate. Additionally, a new attack strategy named HBnnsAGP is introduced to better assess the lower bound of network RASR. Finally, the experimental results on 6 representative real-world complex networks demonstrate the effectiveness of the proposed methods compared with the state-of-the-art baselines.

Keywords:

Subject: Computer Science and Mathematics - Other

1. Introduction

Complex networks can effectively represent many real-world networks, such as the Internet, social networks, power grids, and so on. Most networks are beneficial to people and bring many positive effects. However, some networks also have negative effects, with the most important examples being terrorism and disease transmission networks [1,2]. Whether beneficial or harmful, these networks substantially influence the functioning and development of our society. In recent decades, the study of diverse complex networks has gained significant attention from researchers across various fields such as computer science, statistical physics, systems engineering, and applied mathematics [3,4,5,6,7]. One hot topic point in these studies is the error and attack tolerance of complex networks [8,9,10,11,12,13,14,15,16], a concept referred to as robustness within the context of this paper.

The robustness of a network refers to its ability to keep functioning when some of its components, such as nodes or edges, malfunction due to random failures or malicious attacks [12,17,18]. The study of network robustness is valuable from two main perspectives. Firstly, the failure of components can lead to the breakdown of beneficial networks and result in significant economic losses. A typical example is the Northeast blackout of 2003 [19,20]. Analyzing network robustness aids in developing methods to enhance it. On the other hand, for harmful networks, such as terrorist networks [21] or COVID-19 transmission networks [22], analyzing their robustness assists in developing effective attack strategies to dismantle them. Therefore, analyzing network robustness is of great importance.

To analyze the robustness of the network, it is necessary to choose a suitable metric to evaluate how robust a network is. Since almost all network applications are typically designed to operate in a connected environment [23], network connectivity is selected as the primary indicator to assess network robustness in this study.

The robustness of a network depends not only on its structural features but also on the mechanisms of random failures or malicious attacks. In random failures, nodes or edges are attacked with equal probability, while malicious attacks target nodes or edges in decreasing order of their importance. Typically random failures are less severe than malicious attacks [24,25]. Evaluating the impacts of node or edge removal using various malicious attack strategies is a crucial approach to analyzing network robustness. Determining the lower bound of network robustness is critical as it allows for analysis of network robustness under worst-case scenarios, identification of the most vulnerable components, and development of robustness improvement methods. An effective approach to addressing this issue involves identifying an optimal attack strategy that inflicts maximum damage on the network [26].

Extensive research has been conducted on the robustness of complex networks. Albert et al. [8] studied the robustness of scale-free networks and found that while these networks are robust to random failures, they are extremely vulnerable to malicious attacks. Iyer et al. [9] conducted a systematic examination of the robustness of complex networks by employing simultaneous and sequential targeted attacks based on various centrality measures such as degree, betweenness, closeness, and eigenvector centrality. Fan et al. [10] proposed a deep reinforcement learning algorithm, FINDER, to effectively identify critical network nodes. Wang et al. [11] introduced region centrality and proposed an efficient network disintegration strategy based on this concept, which combines topological properties and geographic structure in complex networks. Ma et al. [12] conducted a study on the robustness of complex networks against incomplete information. They employed link prediction methods to restore missing network topology information and identify critical nodes. Lou et al. [14] introduced LFR-CNN, a CNN-based approach that utilizes learning feature representation for predicting network robustness, which exhibits excellent predictive performance notably smaller prediction errors.

However, the aforementioned research generally assumes that attacks on the network will always be successful, neglecting the important factor of attack success rate (ASR). In fact, an attack may not succeed in real-world scenarios. For example, even if the enemy forces launch an attack on a target within a military communication network, there is no guarantee of successfully destroying it. Figure 1 illustrates the main process of network disintegration under varying ASR. Moreover, selecting an optimum attack strategy that can lead to maximal destructiveness to the network is challenging due to the NP-hard nature of this problem [10]. Existing methods often encounter difficulties in achieving a desirable balance between effectiveness and computational efficiency.

Therefore, the purpose of this paper is to analyze network robustness when considering ASR under an optimal attack strategy. To achieve this purpose, a novel robustness measure called Robustness-ASR (RASR) is introduced, which utilizes mathematical expectations to evaluate network robustness when considering ASR. In addition, an efficient algorithm called PRQMC is proposed to calculate the RASR for large-scale networks. Furthermore, to assess the lower bound of network RASR, a new attack strategy, named HBnnsAGP, is proposed. The main contributions of this study are as follows:

We introduce and define a novel robustness measure called RASR, which utilizes mathematical expectations to assess network robustness when considering the ASR of each node.
To efficiently calculate the RASR for large-scale networks, we propose the PRQMC algorithm. PRQMC leverages randomized quasi-Monte Carlo (QMC) integration to approximate the RASR with a faster convergence rate and utilizes parallelization to speed up the calculation.
To assess the lower bound of network RASR, we present a new attack strategy, named HBnnsAGP. In HBnnsAGP, a novel centrality measure called BCnns is proposed to quantify the importance of a node.
The experimental results on 6 representative real-world networks demonstrate the effectiveness of the proposed methods compared with the baselines.

The rest of this paper is organized as follows. Section 2 provides an introduction to the preliminaries, including classical centrality measures, traditional network robustness measures, and the principles of Monte Carlo (MC) and QMC integration. Section 3 presents the proposed methods for analyzing network robustness when considering ASR, including the RASR, the PRQMC algorithm, and the HBnnsAGP attack strategy. The experiments and results are demonstrated in Section 4. Finally, Section 5 concludes the paper.

2. Preliminaries

A complex network can be modeled as an unweighted undirected graph

G = (V, E)

, where

V (|V| = N)

and

E (|E| = M)

represent the set of nodes and the set of edges in the network G, respectively. The network G can be also represented as an adjacency matrix

A = {(a_{i j})}_{N \times N}

, if node i and node j are connected,

a_{i j} = 1

, otherwise

a_{i j} = 0

2.1. Centrality Measures

The concept of a centrality measure attempts to quantify how important a node is [27]. Here we introduce two classical centrality measures: degree centrality and betweenness centrality.

2.1.1. Degree centrality (DC)

DC is the simplest measure of centrality. The DC of a node is defined by its degree, that is, its number of edges. The DC is formally defined as follows.

Definition 1.

Given a network

G = (V, E)

A = {(a_{i j})}_{N \times N}

is the adjacency matrix of the network G. The DC of node i is defined as:

D C (i) = \sum_{\begin{matrix} j \in V \end{matrix}} a_{i j} .

(1)

The DC is frequently a reliable and effective measure of a node’s importance. A higher DC value typically signifies a more critical node.

2.1.2. Betweenness centrality (BC)

BC quantifies the number of shortest paths passing through a particular node in a network[28]. BC characterizes the extent to which a node acts as a mediator among all other nodes in a network[27]. Nodes that lie on numerous shortest paths are likely to play a crucial role in information transmission, exhibiting higher BC values. The BC is defined as follows.

Definition 2.

Given a network

G = (V, E)

. The BC of node v in G is defined as:

B C (v) = \sum_{\begin{matrix} s, t \in V \end{matrix}} \frac{σ (s, t ∣ v)}{σ (s, t)},

(2)

where,

v \in V

σ (s, t)

is the total number of shortest paths from node s to node t and

σ (s, t ∣ v)

is the number of those paths that pass through node v.

σ (s, t) = 1

, if

s = t

σ (s, t ∣ v) = 0

, if

v \in s, t

2.2. Accumulated Normalized Connectivity

Traditionally, network robustness has been evaluated by calculating the size of the giant connected component (GCC) after the network has endured attacks. The Accumulated Normalized Connectivity (ANC), also known as R, is a well-known measure of network robustness for node attacks [10,17,29]. The ANC is defined as follows.

Definition 3.

For a network

G = (V, E)

|V| = N

. Given an attack sequence of nodes

(v_{1}, v_{2}, \dots, v_{N})

, where

v_{i} \in V

indicates the ith node to be attacked, the ANC of G under this attack sequence is defined as:

A N C (v_{1}, v_{2}, \dots, v_{N}) = \frac{1}{N} \sum_{k = 1}^{N} \frac{σ_{g c c} (G ∖ {v_{1}, v_{2}, \dots, v_{k}})}{σ_{g c c} (G)},

(3)

here,

σ_{g c c} (G ∖ {v_{1}, v_{2}, \dots, v_{k}})

is the size of the GCC of the residual network after the sequential removal of nodes from the set

{v_{1}, v_{2}, \dots, v_{k}}

in G, and

σ_{g c c} (G)

the initial size of the GCC of G before any nodes are removed. The normalization factor

\frac{1}{N}

ensures that the robustness of networks with different sizes can be compared.

A larger ANC value indicates a higher level of network robustness against attacks. Additionally, the ANC can be used to assess the destructiveness of attacks, lower ANC values correspond to more destructive attack strategies. The ANC value can be viewed as an estimate of the area beneath the ANC curve, which is plotted with the horizontal axis as

k / N

and the vertical axis as

σ_{g c c} (G ∖ {v_{1}, v_{2}, \dots, v_{k}}) / σ_{g c c} (G)

2.3. Monte Carlo Integration

Monte Carlo (MC) integration is a numerical technique that is particularly useful for higher-dimensional integrals[30]. Caflisch[31] provides a comprehensive review of this method. The integral of a Lebesgue integrable function

f (X)

can be expressed as the average or expectation of the function evaluated at random locations. Considering

X

as a random variable uniformly distributed on the one-dimensional unit interval

[0, 1]

, the integration of

f (X)

over this interval can be represented as follows:

I [f] = E [f (X)] = \int_{[0, 1]} f (X) d P (X),

(4)

in which

P (X)

is the probability measure of

X

on the interval

[0, 1]

, then

d P (X) = d X,

(5)

therefore

I [f] = E [f (X)] = \int_{[0, 1]} f (X) d X .

(6)

Similarly, for an integral on the unit hypercube

{[0, 1]}^{N}

in N dimensions,

I [f] = E [f (X)] = \int_{{[0, 1]}^{N}} f (X) d X,

(7)

in which

X = (x_{1}, x_{2}, \dots, x_{N})

is a uniformly distributed vector in

{[0, 1]}^{N}

, where

x_{i} \in [0, 1], i \in {1, 2, \dots, N}

. Given that the hyper-volume of

{[0, 1]}^{N}

is equal to 1, thus

{[0, 1]}^{N}

can be viewed as the total probability space.

The MC integration method approximates definite integrals utilizing random sampling. It draws K uniform samples from

{[0, 1]}^{N}

, in turn generating a points set

{X_{1}, X_{2}, \dots, X_{K}}

. The empirical approximation of the integral

I [f]

is then procured by computing the mean of the K sample outcomes

f (X_{i})

, which can be expressed as follows:

I [f] \approx I_{K} [f] = \frac{1}{K} \sum_{i = 1}^{K} f (X_{i}) .

(8)

According to the Strong Law of Large Numbers [32], this approximation is convergent with probability 1, that is,

lim_{K \to \infty} P (|I_{K} [f] - I [f]| = 0) = 1 .

(9)

Figure 2 illustrates the application of the MC integration method in approximating definite integrals over a one-dimensional unit interval. As shown in Figure 2a, MC integration approximates the area under the curve of the integral by summing the areas of the bars corresponding to the sampled points. The bars are rearranged sequentially to avoid overlap on the

X

-axis, as shown in Figure 2b.

The error of MC integration is

ε_{K} = |I_{K} [f] - I [f]| .

(10)

By the Central Limit Theorem [32], for any

a, b

where

a < b

, we have

\begin{matrix} lim_{K \to \infty} P (a < \frac{ε_{K}}{σ / \sqrt{K}} < b) = \int_{a}^{b} \frac{1}{\sqrt{2 π}} e^{- t^{2} / 2} d t = P (a < v < b), \end{matrix}

(11)

where v is a standard normal random variable and

σ

is the square root of the variance of f, given by

σ = {(\int_{{[0, 1]}^{N}} {(f (X) - I [f])}^{2} d X)}^{1 / 2} .

(12)

When K is sufficiently large, we have

ε_{K} \approx σ K^{- 1 / 2} v .

(13)

This implies that the order of error convergence rate of the MC integration is

O (K^{- 1 / 2})

[33], which means that the accuracy of the integral error decreases at a rate proportional to the total number of samples K increases. That is, “an additional factor of 4 increase in computational effort only provides an additional factor of 2 improvements in accuracy" [31].

In practical applications, the MC integration method draws K uniform samples from an N-dimensional pseudo-random sequence (PRS) generated by a computer to obtain the points set

{X_{1}, X_{2}, \dots, X_{K}}

2.4. Quasi-Monte Carlo Integration

The quasi-Monte Carlo (QMC) integration is a method of numerical integration that operates in the same way as MC integration, but instead uses a deterministic low-discrepancy sequence (LDS) [34] to approximate the integral. The advantage of using LDSs is a faster rate of convergence. QMC integration has a rate of convergence close to

O (K^{- 1})

, which is much faster than the rate for the MC integration,

O (K^{- 1 / 2})

[35]

Using the QMC integration method for approximating definite integrals is similar to the MC integration method. This can be expressed as:

\begin{matrix} I [f] = \int_{{[0, 1]}^{N}} f (X) d X \approx \frac{1}{K} \sum_{i = 1}^{K} f (Y_{i}), \end{matrix}

(14)

where

{Y_{1}, Y_{2}, \dots, Y_{K}}

is a points set obtained by combining the first K points from an N-dimensional LDS. Each

Y_{i}

is an N-dimensional point, with

Y_{i} = (y_{1}^{{i}}, y_{2}^{{i}}, \dots, y_{N}^{{i}})

for

i \in {1, 2, \dots, K}

, and

y_{j}^{{i}} \in [0, 1]

for

j \in {1, 2, \dots, N}

The error order of the QMC integration can be determined by the Koksma-Hlawka inequality [36,37], that is,

ε_{K} = |\int_{{[0, 1]}^{N}} f (X) d X - \frac{1}{K} \sum_{i = 1}^{K} f (Y_{i})| < V (f) D_{K}^{*},

(15)

where

V (f)

is the Hardy–Krause variation of the function f,

D_{K}^{*}

is the star discrepancy of

{Y_{1}, Y_{2}, \dots, Y_{K}}

, and is defined as:

D_{K}^{*} = sup_{Q \subset {[0, 1]}^{N}} |\frac{M (Y_{1}, Y_{2}, \dots, Y_{K})}{K} - λ_{N} (Q)|,

(16)

where

M (Y_{1}, Y_{2}, \dots, Y_{K})

is the number of points in

{Y_{1}, Y_{2}, \dots, Y_{K}}

inside the region Q, and

λ_{N} (Q)

is the Lebesgue measure of region Q in the unit hypercube

{[0, 1]}^{N}

. For more detailed information, please refer to [31].

For an N-dimensional LDS comprising K points, the star discrepancy of the sequence is

O (K^{- 1} {(log K)}^{N})

. Consequently, for a function F with

V (F) < \infty

, a QMC approximation based on this sequence yields a worst-case error bound in Equation (28) converging at a rate of

O (K^{- 1} {(log K)}^{N})

[38]. Since

log K ≪ K

, the QMC integration convergence rate approaches

O (K^{- 1})

for low-dimensional cases [33], which is asymptotically superior to MC.

Figure 3 illustrates the clear differences between MC and QMC integration methods. The subfigures provide a visual representation of their respective point distributions and demonstrate their application for approximating definite integrals over a one-dimensional unit interval. The points generated from an LDS exhibit greater uniformity than the points generated by a PRS. Consequently, with the same number of sampling points, LDS has the ability to uniformly fill the integration space, resulting in a faster convergence rate.

3. Methods

In this section, we will first introduce the major problem we focus on in this paper. Then, we give the details of the proposed methods for analyzing network robustness when considering ASR, including the RASR, the PRQMC algorithm, and the HBnnsAGP attack strategy.

3.1. Problem Formalization

Typically, it is assumed that removing a node will also remove all of its connected edges. Therefore, in this paper, we only consider node attack strategies.

For a network

G = (V, E)

|V| = N

. A node attack strategy can be represented as a sequence

Seq = (v_{1}, v_{2}, \dots, v_{N})

, where

v_{i} \in V

indicates the ith node to be attacked. Given a predefined metric

Φ (Seq)

to measure network robustness against attacks. The primary goal is to evaluate the lower bound of network robustness. Therefore, the objective is to minimize

Φ (Seq)

, as presented below:

M i n i m i z e Φ (Seq) .

(17)

To achieve this objective, it is crucial to determine the optimal node attack strategy that will minimize the

Φ (Seq)

3.2. The Proposed Robustness Measure RASR

The ANC, as defined in Definition 3, does not consider the ASR, or it is a special case where the ASR of each node is 100%. To this end, the proposed robustness measure RASR utilizes mathematical expectations to assess network robustness when considering ASR. Before introducing the RASR, we will first present a weighted ANC (named ANCw), which takes into account both the state of the attack sequence state and the associated attack cost.

For a network

G = (V, E)

with N nodes,

Seq = (v_{1}, v_{2}, \dots, v_{N})

is an attack sequence, where

v_{i} \in V

. The state of

Seq

is denoted as a random variable

S = (s_{v_{1}}, s_{v_{2}}, \dots, s_{v_{N}})

, where

s_{v_{i}} = \{\begin{matrix} T, & if the attack on v_{i} succeeded \\ F, & otherwise \end{matrix} .

(18)

Then, the ANCw is defined as follows.

Definition 4.

The ANCw of G under an attack sequence

Seq

is defined as:

\begin{matrix} A N C w (Seq, S) = \frac{1}{N + 1} \sum_{k = 0}^{N} \frac{σ_{g c c} (G ∖ {v_{i} | s_{v_{i}} = T, i = 1, 2, \dots, k})}{σ_{g c c} (G)} φ (v_{k}), \end{matrix}

(19)

here

σ_{g c c}

is the same as defined in Definition 3. When k = 0, it indicates that no nodes have been attacked.

φ (v_{k})

is a weighted function, that is,

φ (v_{k}) = \{\begin{matrix} 0, & if v_{k} is an isolated node \\ 1, & otherwise \end{matrix} .

(20)

There are two main reasons for using the weighted function

φ (v_{k})

. Firstly, it is important for an attacker to choose an optimal attack strategy at a minimum attack cost to efficiently disintegrate the network [11,26]. Secondly, as illustrated in Figure 1, with an increased number of nodes removed, the network will eventually fragment into isolated nodes, thereby losing its functionality as a network. Therefore, this paper sets the attack cost of an isolated node to 0.

Let

P_{v} = (p_{v_{1}}, p_{v_{2}}, \dots, p_{v_{N}})

represent the ASR of each node corresponding to

Seq

, where

p_{v_{i}}

represents the ASR of node

v_{i}

. Assuming that attacks on different nodes are independent, then the probability of

S

p (S) = \prod_{i = 1}^{N} p (s_{v_{i}}),

(21)

where

p (s_{v_{i}}) = \{\begin{matrix} p_{v_{i}}, & if s_{v_{i}} = T \\ 1 - p_{v_{i}}, & otherwise \end{matrix} .

(22)

Based on the above formulas, the proposed RASR can defined as follows.

Definition 5.

Considering the ASR of each node, the robustness of a network G against an attack sequence

Seq

can be quantified by the RASR, which is defined as:

\begin{matrix} R A S R = E (A N C w (Seq, S)) = \sum_{S \in Ω} A N C w (Seq, S) p (S) \end{matrix},

(23)

where

S

is a random variable representing the state of

Seq

, Ω is the sample space of

S

E (A N C w (Seq, S))

is the expectation of the ANCw.

In theory, the value of RASR can be calculated using Equation (23) once all the samples of

S

are obtained in the sample space

Ω

. However, it confronts “the curse of dimensionality" [39] when applied to networks with a large number of nodes. In such cases, the size of

Ω

grows exponentially to

2^{N}

. As a result, the analytical approach becomes infeasible when N is significantly large.

3.3. The Proposed PRQMC Algorithm

To efficiently calculate the RASR for large-scale networks, the PRQMC algorithm is proposed, which leverages randomized QMC integration to approximate the RASR with a faster convergence rate and utilizes parallelization techniques to speed up the calculation. In the following, we first introduce the RASR calculation model based on QMC integration and then give the PRQMC algorithm.

3.3.1. RASR Calculation Model Based on QMC Integration

The RASR of a network G, as defined in Definition 5, can be expressed using Lebesgue integration based on the principle of MC integration (see Section 2), that is,

\begin{matrix} R A S R = E (A N C w (Seq, S)) = \int_{Ω} A N C w (Seq, S) d P (S), \end{matrix}

(24)

where

S = (s_{v_{1}}, s_{v_{2}}, \dots, s_{v_{N}})

denotes a random variable representing the state of an attacking sequence

Seq

Ω

is the sample space of

S

P (S)

is the probability measure of

S

Let

P_{v} = (p_{v_{1}}, p_{v_{2}}, \dots, p_{v_{N}})

represent the ASR of each node corresponding to

Seq

X = (x_{1}, x_{2}, \dots x_{N})

is a uniformly distributed vector in

{[0, 1]}^{N}

, where

x_{i} \in [0, 1], i \in {1, 2, \dots, N}

. Then,

S = (s_{v_{1}}, s_{v_{2}}, \dots, s_{v_{N}})

can be represented as follows:

S = G (X),

(25)

where

s_{v_{i}} = G_{i} (x_{i}) = \{\begin{matrix} T, & if x_{i} \leq p_{v_{i}} \\ F, & otherwise \end{matrix}, i \in {1, 2, \dots, N} .

(26)

When the

Seq

is determined, the

A N C w (Seq, S)

can be represented as a function of

X

, that is,

F (X) = A N C w (Seq, G (X)) = A N C w (Seq, S) .

(27)

By substituting Equation (27) into Equation (24) and transforming the integral space from

Ω

{[0, 1]}^{N}

, we obtain the following expression for RASR:

R A S R = E [F (X)] = \int_{{[0, 1]}^{N}} F (X) d P (S) .

(28)

This equation represents the integration of

F (X)

with respect to the probability measure

P (S)

over the N-dimensional unit hypercube

{[0, 1]}^{N}

For the given network G, the sample space

Ω

has a size of

2^{N}

. Let the state of

Seq

S_{i}

, where

i \in {1, 2, 3, \dots, 2^{N}}

. Based on

P_{v}

, the unit hypercube

{[0, 1]}^{N}

can be divided into

2^{N}

regions denoted by

Q_{i}

, where region

Q_{i}

corresponds to state

S_{i}

i \in {1, 2, \dots 2^{N}}

. Figure 4 illustrates this process for the case when

N = 2

. Then, the integral in Equation (28) can be transformed into:

\int_{{[0, 1]}^{N}} F (X) d P (S) = \sum_{i = 1}^{2^{N}} \int_{Q_{i}} F (X^{i}) d P (S_{i}),

(29)

where

X^{{i}}

is a vector uniformly distributed within region

Q_{i}

The Lebesgue measure of region

Q_{i}

{[0, 1]}^{N}

, denoted by

λ_{N} (Q_{i})

, is equivalent to the probability measure of

S_{i}

, denoted as

P (S_{i})

. Based on the principle of MC integration, we have:

\begin{matrix} \sum_{i = 1}^{2^{N}} \int_{Q_{i}} F (X^{{i}}) d P (S_{i}) = \sum_{i = 1}^{2^{N}} \int_{Q_{i}} F (X^{{i}}) d X^{{i}} = \int_{{[0, 1]}^{N}} F (X) d X . \end{matrix}

(30)

Combining Equation (28), Equation (29), and Equation (30), we obtain:

R A S R = E [F (X)] = \int_{{[0, 1]}^{N}} F (X) d X .

(31)

By referencing Equation (14) and Equation (31), the RASR of a network can be approximated using the QMC integration method. The approximation of RASR, denoted by

\hat{R}

, is defined as follows.

Definition 6.

Consider a network

G = (V, E)

with N nodes. Suppose a sequence of nodes

Seq = (v_{1}, v_{2}, \dots, v_{N})

is targeted for attack,

P_{v} = (p_{v_{1}}, p_{v_{2}}, \dots, p_{v_{N}})

signifies the ASR of each node. The RASR of the network G can be approximated by

\hat{R}

, which is defined as:

\begin{matrix} \hat{R} = \frac{1}{K} \sum_{i = 1}^{K} F (Y_{i}) \approx R A S R . \end{matrix}

(32)

Here,

{Y_{1}, Y_{2}, \dots, Y_{K}}

, as specified in Equation (14), represents a set of points obtained from an N-dimensional LDS. K is the total number of samples. The function

F (X)

is defined in Equation (27).

The error bound of the QMC integral is determined by the star discrepancy of the chosen LDS, making the selection of LDSs important for improving the accuracy of approximations. Two frequently used LDSs are the Halton sequence and the Sobol sequence [40]. In this research, the Sobol sequence is adopted, as it demonstrates better performance in higher dimensions compared to the Halton sequence [41].

3.3.2. Parallel Randomized QMC (PRQMC) Algorithm

Despite the faster convergence rate of the QMC integration method compared to MC integration, it still necessitates a large number of samples to calculate the average value. Furthermore, the calculation of function

A N C w (Seq, S)

, typically done through attack simulations, demands considerable computational resources, especially for large-scale networks [42]. Consequently, the computational process of obtaining

\hat{R}

for large-scale networks remains time-consuming. Additionally, due to the deterministic nature of the LDS, the QMC integration method can be seen as a deterministic algorithm, thus presenting challenges in assessing the reliability of numerical integration results and potentially leading to being stuck in local optima. In light of these issues, the PRQMC algorithm capitalizes on the benefits of the Randomized QMC method and parallelization.

The PRQMC algorithm improves computational efficiency through parallelization. This is because the computational cost of sampling the attack sequence’s state

S

is significantly lower than that of computing the function

A N C w (Seq, S)

. Therefore, by initially sampling the attack sequence’s state

S

and obtaining a sufficient number of samples, it is possible to calculate the

\hat{R}

by parallelizing the computation of the function

A N C w (Seq, S)

with various samples. This approach effectively accelerates the calculation process by distributing the task across multiple processors or computing nodes.

Additionally, the PRQMC algorithm enhances randomness by randomly sampling points from the LDS, providing unbiased estimation and improved variance reduction capabilities. This is particularly advantageous in high-dimensional problems, where RQMC often outperforms QMC in terms of accuracy and efficiency [43].

The procedure of the PRQMC algorithm is presented in Algorithm 1, which consists of two main steps: “sampling state" and “paralleling stage". In the sampling stage, we first randomly sample K points

{Y_{1}, Y_{2}, \dots, Y_{K}}

from an N-dimensional Sobol sequence, then determine K states of the attack sequence,

{S_{1}, S_{2}, \dots, S_{K}}

, by comparing the values of each dimension of the sampled points with the ASR of each node. In the paralleling stag, we parallelize the computation of the function

A N C w (Seq, S_{i})

, then obtain

\hat{R}

by calculating the average value of

A N C w (Seq, S_{i})

Algorithm 1 PRQMC(

G, Seq, P, K

)

3.4. The Proposed HBnnsAGP Attack Strategy

To assess the lower bound of network RASR, a new attack strategy called the High BCnns Adaptive GCC-Priority (HBnnsAGP) is presented. In HBnnsAGP, a novel centrality measure called BCnns is proposed to quantify the significance of a node, and GCC-priority attack strategy is utilized to improve attack effectiveness. Algorithm 2 describes the procedure of HBnnsAGP, which contains two steps: “obtaining the first part of

Seq

" and “obtaining the second part of

Seq

". In the first step, the algorithm obtains the first part of the attack sequence by iteratively removing the node with the highest BCnns in GCC and recalculating BCnns for the remaining nodes until only isolated nodes remain in the residual network. In the second step, the algorithm arranges these isolated nodes in descending order according to their DC values in the initial network to obtain the second part of the attack sequence. This procedure is aimed at improving the effectiveness of attacks when the ASR is below 100%. It is important to note that isolated nodes when the ASR is 100% may no longer remain isolated, as depicted in, as shown in Figure 1. Additionally, previous research has shown that there is minimal difference in destructiveness between simultaneous attacks and sequential attacks based on DC [9]. Therefore, by sorting these isolated nodes in descending order based on their DC values from the initial network (similar to the approach used in simultaneous attacks), the second step further improves the effectiveness of attacks when the ASR is less than 100%.

In the following, we first introduce the BCnns and then give the GCC-priority attack strategy.

Algorithm 2 HBnnsAGP(

G, N_{1}, N_{2}

)

3.4.1. Non-central Nodes Sampling Betweenness Centrality (BCnns)

Contrasted with BC (see Definition 2), which evaluates a node’s role as a mediator in the network based on the count of shortest paths it traverses for all node pairs. BCnns quantifies the importance of nodes acting as bridge nodes between different network communities by counting the number of shortest paths that pass through a node for specific pairs of non-central nodes (nodes located in the periphery of the network and with less importance). These bridge nodes typically serve as mediators for non-central nodes across different communities. The BCnns is defined as follows.

Definition 7.

For a network

G = (V, E)

with N nodes, the BCnns of node v in network G is:

\begin{matrix} B C_{nns} (v) = \sum_{\begin{matrix} s \in S, t \in T \end{matrix}} \frac{σ (s, t ∣ v)}{σ (s, t)}, \end{matrix}

(33)

where

S, T \subset V_{n n s}

V_{n n s}

is the set of non-central nodes sampled from V,

V_{n n s} \subset V

S \cap T = \emptyset

. The

σ (s, t)

and

σ (s, t ∣ v)

have the same meaning as defined in Definition 2.

By selecting the appropriate pairs of non-central nodes, BCnns can more effectively measure the significance of nodes as bridges between different communities in a network. While these bridge nodes may not have the highest BC value, they are crucial for maintaining overall network connectivity and could potentially have the highest BCnns value.

The Definition of BCnns highlights the importance of selecting suitable nodes for sets S and T. Thus, we proposed an algorithm called Selection

S T

for node selection. Algorithm 3 describes the procedure of Selection

S T

. Initially, the nodes are sorted in ascending order based on their DC values, and the first

N_{1}

nodes with lower DC values are selected to create the non-central nodes set

V_{n n s}

. This is because nodes with lower DC values typically have lower centrality and are considered non-central nodes. Next, in order to achieve a more balanced sampling,

V_{n n s}

is divided into two subsets:

V_{n n s}^{o d d}

containing nodes at odd indices and

V_{n n s}^{e v e n}

containing nodes at even indices. Lastly,

N_{2}

nodes are randomly sampled from

V_{n n s}^{o d d}

to create set S, and

N_{2}

nodes are similarly sampled from

V_{n n s}^{e v e n}

to form set T.

Algorithm 3 Selection

S T

(

G_{c}, N_{1}, N_{2}

)

The

N_{1}

and

N_{2}

are chosen based on the size of the network and the node degree distribution. Typically, both

N_{1}

and

N_{2}

are much smaller compared to the total number of nodes N. Therefore, BCnns have higher computational efficiency compared to BC, especially for large-scale networks.

Figure 5 demonstrates the differences between BC and BCnns. Specifically, Figure 5a identifies the non-central nodes in red, Figure 5b showcases node sizes based on BC values, and Figure 5c adjusts node sizes based on their BCnns values. Notably, node 14 plays a critical bridging role between two communities, a role that BCnns captures more accurately than BC.

3.4.2. GCC-Priority Attack Strategy

As the attack progresses, the network fragments into connected components of varying sizes. The importance of these components varies within the residual network. The GCC refers to the largest connected component containing the most nodes. The destruction of the GCC accelerates the collapse of the network. The GCC-priority attack strategy enhances the attack’s effectiveness by targeting nodes within the GCC at each stage of the attack process.

4. Experimental Studies

In this section, we present a series of experiments to verify the effectiveness of our proposed methods. Firstly, we introduce the experimental settings including network datasets and baselines. Next, we compare the proposed PRQMC method with the baselines. Additionally, we demonstrate the effectiveness of the proposed HBnnsAGP attack strategy. Finally, we present further discussions on network robustness when considering the ASR.

4.1. Experimental Settings

4.1.1. Datasets

In our experiments, we selected six real-world classic complex networks of different scales, including Karate [44], Krebs [10], Airport [45], Crime [46], Power [47], Oregon1 [48]. Table 1 provides a detailed summary of these networks, with N and M representing the number of nodes and edges, respectively, and

< k >

denoting the average degree of the network.

4.1.2. Comparison Methods

To show the effectiveness of the proposed PRQMC algorithm, we compare it with MC and QMC methods.

MC: It calculates the estimated value of $\hat{R}$ using original MC integration and generates a set of points from a PRS.
QMC: It calculates the estimated value of $\hat{R}$ using original QMC integration and generates a set of points from an LDS.

To show the effectiveness of the proposed HBnnsAGP attack strategy, we compare it with three representative baseline attack strategies, including HDA[49], HBA[28], and FINDER[10].

High Degree Adaptive (HDA): HDA is an adaptive version of the high degree method that ranks nodes based on their DC and sequentially removes the node with the highest DC. HDA recomputes the DC of the remaining nodes after each node removal and is recognized for its superior computational efficiency.
High Betweenness Adaptive (HBA): HBA is an adaptive version of the high betweenness method. It operates by iteratively removing the node with the highest BC and recomputing BC for the remaining nodes. HBA has long been considered the most effective strategy for the network dismantling problem in the node-unweighted scenario [50]. However, the high computing cost prohibits its use in medium and large-scale networks.
FINDER: FINDER is notable as an algorithm based on deep reinforcement learning, which achieves superior performances in terms of both effectiveness and efficiency.

We implemented the proposed algorithm and baselines using the Python programming language. All experiments are performed on a server AMD EPYC 7742 64-Core Processor @ 2.25GHz, with memory (RAM) 1024 GB, running Linux ubuntu 11.10 Operating System.

4.2. Comparison of the PRQMC with Baselines

This subsection presents the comparison results to demonstrate the effectiveness of the proposed algorithm, PRQMC, on six real-world complex networks. Specifically, we compare PRQMC with two baselines: MC and QMC. All experiments use the same attack strategy, and the ASR of each node is randomly generated.

We first compare PRQMC with the baselines on two small-scale networks (Karate and Krebs). This is because precise values of RASR can be calculated analytically for small-scale networks. Then, for large-scale networks (Airport, Crime, Power, and Oregon1), we utilize the standard deviation curve as the convergence criterion, as the analytical method is not applicable to large-scale networks. Figure 6 and Figure 7 present the comparison of the convergence and error between PRQMC and baselines. The figure clearly illustrates that PRQMC achieves faster convergence and better accuracy with fewer samples compared to the baselines.

Additionally, Table 2 presents a comparison of the computational efficiency of PRQMC and the baselines, each with 5000 sampling iterations. In the PRQMC method, the number of parallel computing processes is set based on the network size, assigning 25 processes to the Karate and Krebs, and 100 processes to the other networks. The results in Table 2 indicate that the PRQMC method outperforms in terms of computational efficiency. Specifically, the PRQMC method operates nearly 50 times faster than the QMC and MC methods on the Oregon1.

4.3. Comparison of the HBnnsAGP with Baselines

In this subsection, we will demonstrate the effectiveness and efficiency of the proposed HBnnsAGP attack strategy. Specifically, we will compare HBnnsAGP with HDA, HBA, and FINDER on six real-world complex networks, while considering different ASR conditions. Initially, we will employ various attack strategies to generate corresponding attack sequences. Subsequently, we will utilize the PRQMC method to calculate the

\hat{R}

value under the following ASR distribution scenarios.

ASR = 100%: The ASR of each node is set to 100%.
ASR = 50%: The ASR of each node is set to 50%.
ASR = 50% for the first 30% of nodes: In the attack sequence generated by different attack strategies, the ASR of the first 30% of nodes is set to 50%.
Random ASR: The ASR of each node is randomly set between 50% and 100%. To obtain more reliable results, the average of 10 experimental outcomes is taken.

The sample numbers (

N_{1}

and

N_{2}

) for different networks used in HBnnsAGP are presented in Table 3. Table 4 presents the

\hat{R}

values of networks in the four specified scenarios. The data suggests that HBnnsAGP performs better than other attack strategies in terms of destructiveness. The destructiveness of HBnnsAGP, on average, has increased by 6.76%, 4.03%, and 7.26% in comparison to the FINDER, HBA, and HDA strategies, respectively.

Table 5 presents a comparison of computation times for HBnnsAGP and the baselines. As the network size increases, the computation time for the HBA method becomes excessively long. In contrast, the HBnnsAGP method maintains commendable computational efficiency even for larger-scale networks. For the Oregon1 network, HBnnsAGP is approximately 28 times faster than HBA. While the computational efficiency of HBnnsAGP slightly lags behind that of FINDER and HDA for larger-scale networks, it surpasses them in terms of attack destructiveness.

Figure 8 represents the ANCw curves of the networks under various attack strategies when the ASR of each node is set to 100%. In this scenario, the state of the attack sequence is unique. The figure shows that HBnnsAGP excels at identifying critical nodes in the network, leading to the effective disruption of the network structure compared to other methods. Hence the effectiveness of the proposed HBnnsAGP attack strategy is verified.

4.4. Further Discussions About Network Robustness

The data presented in Table 4 indicates that reducing the ASR can significantly enhance network robustness. Generally, this reduction can be achieved through reinforcing node protection. Comparing Scenario 2 and Scenario 3, it is apparent that simply reducing the ASR of the first 30% nodes in the attack sequence (Scenario 3) effectively enhances network robustness. This improvement is approximately 78.25% of that in Scenario 2. Therefore, enhancing the protection of a small subset of crucial nodes in the network can effectively enhance its robustness.

5. Conclusion

In this paper, we conducted a study to analyze the robustness of networks when considering ASR. Firstly, we introduce a novel metric called RASR to assess network robustness in this scenario. Then, we propose the PRQMC algorithm to efficiently calculate the RASR for large-scale networks. PRQMC utilizes RQMC integration to approximate the RASR with a faster convergence rate and employs parallelization to speed up the calculation. Next, we propose a new attack strategy called HBnnsAGP to evaluate the lower bound of network RASR. In HBnnsAGP, we quantify the significance of a node using BCnns and enhance the destructiveness of the attack using the GCC-priority attack strategy. Experimental results on six representative real-world networks demonstrate the effectiveness of the proposed methods. Furthermore, our work demonstrates that reinforcing the protection of a small subset of critical nodes significantly improves network robustness. These findings offer valuable insights for devising more robust networks. The efficiency of the proposed methods can be further enhanced, particularly when analyzing ultra-large-scale networks. In future research, we aim to explore efficient algorithms to enhance the network RASR and devise promising methods for analyzing ultra-large-scale networks.

References

Eiselt, H. Destabilization of terrorist networks. Chaos, Solitons & Fractals 2018, 108, 111–118. [CrossRef]
Pastor-Satorras, R.; Castellano, C.; Van Mieghem, P.; Vespignani, A. Epidemic processes in complex networks. Reviews of modern physics 2015, 87, 925. [CrossRef]
Barabási, A.L. Network science. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 2013, 371, 20120375. [CrossRef]
Newman, M. Networks; Oxford university press, 2018.
Albert, R.; Barabási, A.L. Statistical mechanics of complex networks. Reviews of modern physics 2002, 74, 47. [CrossRef]
Berahmand, K.; Haghani, S.; Rostami, M.; Li, Y. A new attributed graph clustering by using label propagation in complex networks. Journal of King Saud University-Computer and Information Sciences 2022, 34, 1869–1883. [CrossRef]
Li, Z.; Ma, W.; Ma, N. Partial topology identification of tempered fractional-order complex networks via synchronization method. Mathematical Methods in the Applied Sciences 2023, 46, 3066–3079. [CrossRef]
Albert, R.; Jeong, H.; Barabási, A.L. Error and attack tolerance of complex networks. nature 2000, 406, 378–382. [CrossRef]
Iyer, S.; Killingback, T.; Sundaram, B.; Wang, Z. Attack robustness and centrality of complex networks. PloS one 2013, 8, e59613. [CrossRef]
Fan, C.; Zeng, L.; Sun, Y.; Liu, Y.Y. Finding key players in complex networks through deep reinforcement learning. Nature machine intelligence 2020, 2, 317–324. [CrossRef]
Wang, Z.G.; Deng, Y.; Wang, Z.; Wu, J. Disintegrating spatial networks based on region centrality. Chaos: An Interdisciplinary Journal of Nonlinear Science 2021, 31, 061101. [CrossRef]
Ma, W.; Fang, J.; Wu, J. Analyzing robustness of complex networks against incomplete information. IEEE Transactions on Circuits and Systems II: Express Briefs 2022, 69, 2523–2527. [CrossRef]
Ma, L.; Zhang, X.; Li, J.; Lin, Q.; Gong, M.; Coello, C.A.C.; Nandi, A.K. Enhancing Robustness and Resilience of Multiplex Networks Against Node-Community Cascading Failures. IEEE Transactions on Systems, Man, and Cybernetics: Systems 2022, 52, 3808–3821. https://doi.org/10.1109/TSMC.2021.3073212. [CrossRef]
Lou, Y.; Wu, R.; Li, J.; Wang, L.; Li, X.; Chen, G. A Learning Convolutional Neural Network Approach for Network Robustness Prediction. IEEE Transactions on Cybernetics 2023, 53, 4531–4544. https://doi.org/10.1109/TCYB.2022.3207878. [CrossRef]
Sun, G. Robustness Analysis of an Urban Public Traffic Network Based on a Multi-Subnet Composite Complex Network Model. Entropy 2023, 25. https://doi.org/10.3390/e25101377. [CrossRef]
Zelenkovski, K.; Sandev, T.; Metzler, R.; Kocarev, L.; Basnarkov, L. Random Walks on Networks with Centrality-Based Stochastic Resetting. Entropy 2023, 25. https://doi.org/10.3390/e25020293. [CrossRef]
Zhou, M.; Liu, J. A two-phase multiobjective evolutionary algorithm for enhancing the robustness of scale-free networks against multiple malicious attacks. IEEE transactions on cybernetics 2016, 47, 539–552. [CrossRef]
Tian, M.; Dong, Z.; Wang, X. Reinforcement learning approach for robustness analysis of complex networks with incomplete information. Chaos, Solitons & Fractals 2021, 144, 110643. [CrossRef]
Boccaletti, S.; Latora, V.; Moreno, Y.; Chavez, M.; Hwang, D.U. Complex networks: Structure and dynamics. Physics reports 2006, 424, 175–308. [CrossRef]
Barrat, A.; Barthelemy, M.; Vespignani, A. Dynamical processes on complex networks; Cambridge university press, 2008.
Arulselvan, A.; Commander, C.W.; Elefteriadou, L.; Pardalos, P.M. Detecting critical nodes in sparse graphs. Computers & Operations Research 2009, 36, 2193–2200. [CrossRef]
Oka, T.; Wei, W.; Zhu, D. The effect of human mobility restrictions on the COVID-19 transmission network in China. PloS one 2021, 16, e0254403. [CrossRef]
Lalou, M.; Tahraoui, M.A.; Kheddouci, H. The Critical Node Detection Problem in networks: A survey. Computer Science Review 2018, 28, 92–117. https://doi.org/https://doi.org/10.1016/j.cosrev.2018.02.002. [CrossRef]
Bastian, M.; Heymann, S.; Jacomy, M. Gephi: an open source software for exploring and manipulating networks. In Proceedings of the international AAAI conference on web and social media, 2009, Vol. 3, pp. 361–362. [CrossRef]
Xia, Y.; Fan, J.; Hill, D. Cascading failure in Watts–Strogatz small-world networks. Physica A: Statistical Mechanics and its Applications 2010, 389, 1281–1285. [CrossRef]
Zhang, L.; Xia, J.; Cheng, F.; Qiu, J.; Zhang, X. Multi-objective optimization of critical node detection based on cascade model in complex networks. IEEE Transactions on Network Science and Engineering 2020, 7, 2052–2066. [CrossRef]
Manoj, B.; Chakraborty, A.; Singh, R. Complex networks: A networking and signal processing perspective; Prentice Hall Communications Engineering and Emerging Technologies, Pearson, 2018.
Freeman, L.C. A set of measures of centrality based on betweenness. Sociometry 1977, pp. 35–41. [CrossRef]
Schneider, C.M.; Moreira, A.A.; Andrade Jr, J.S.; Havlin, S.; Herrmann, H.J. Mitigation of malicious attacks on networks. Proceedings of the National Academy of Sciences 2011, 108, 3838–3841. [CrossRef]
Press, W.H.; Farrar, G.R. Recursive stratified sampling for multidimensional Monte Carlo integration. Computers in Physics 1990, 4, 190–195. [CrossRef]
Caflisch, R.E. Monte carlo and quasi-monte carlo methods. Acta numerica 1998, 7, 1–49.
Feller, W. An introduction to probability theory and its applications, Volume 2; Vol. 81, John Wiley & Sons, 1991.
Liu, X.; Zheng, S.; Wu, X.; Chen, D.; He, J. Research on a seismic connectivity reliability model of power systems based on the quasi-Monte Carlo method. Reliability Engineering & System Safety 2021, 215, 107888. [CrossRef]
Hou, T.; Nuyens, D.; Roels, S.; Janssen, H. Quasi-Monte Carlo based uncertainty analysis: Sampling efficiency and error estimation in engineering applications. Reliability Engineering & System Safety 2019, 191, 106549. [CrossRef]
Asmussen, S.; Glynn, P.W. Stochastic simulation: algorithms and analysis; Vol. 57, Springer, 2007.
Koksma, J. Een algemeene stelling uit de theorie der gelijkmatige verdeeling modulo 1. Mathematica B (Zutphen) 1942, 11, 43.
Hlawka, E. Discrepancy and Riemann integration. Studies in Pure Mathematics 1971, 3.
L’Ecuyer, P. Quasi-Monte Carlo methods with applications in finance. Finance and Stochastics 2009, 13, 307–349. [CrossRef]
Bellman, R.E. Dynamic programming; Princeton university press, 2010.
Kocis, L.; Whiten, W.J. Computational investigations of low-discrepancy sequences. ACM Transactions on Mathematical Software (TOMS) 1997, 23, 266–294. [CrossRef]
Morokoff, W.J.; Caflisch, R.E. Quasi-monte carlo integration. Journal of computational physics 1995, 122, 218–230. [CrossRef]
Lou, Y.; Wu, R.; Li, J.; Wang, L.; Chen, G. A convolutional neural network approach to predicting network connectedness robustness. IEEE Transactions on Network Science and Engineering 2021, 8, 3209–3219. [CrossRef]
L’Ecuyer, P. Random number generation and quasi-Monte Carlo. Wiley StatsRef: Statistics Reference Online 2014, pp. 1–12. [CrossRef]
Zachary, W.W. An information flow model for conflict and fission in small groups. Journal of anthropological research 1977, 33, 452–473.
Colizza, V.; Pastor-Satorras, R.; Vespignani, A. Reaction–diffusion processes and metapopulation models in heterogeneous networks. Nature Physics 2007, 3, 276–282. [CrossRef]
Rossi, R.; Ahmed, N. The network data repository with interactive graph analytics and visualization. In Proceedings of the AAAI conference on artificial intelligence, 2015, Vol. 29. [CrossRef]
Watts, D.J.; Strogatz, S.H. Collective dynamics of ‘small-world’networks. nature 1998, 393, 440–442. [CrossRef]
Leskovec, J.; Kleinberg, J.; Faloutsos, C. Graph evolution: Densification and shrinking diameters. ACM transactions on Knowledge Discovery from Data (TKDD) 2007, 1, 2–es. [CrossRef]
Pastor-Satorras, R.; Vespignani, A. Epidemic spreading in scale-free networks. Physical review letters 2001, 86, 3200. [CrossRef]
Holme, P.; Kim, B.J.; Yoon, C.N.; Han, S.K. Attack vulnerability of complex networks. Physical review E 2002, 65, 056109. [CrossRef]

Figure 1. An example of network disintegration process under different ASR. Gray nodes indicate successful attacks, green nodes represent unsuccessful attacks, and blue nodes denote unattacked nodes.

Figure 2. An example of MC integration method for approximating a definite integral over a one-dimensional unit interval. (a) illustrates the approximation of the integral by summing the areas of bars that correspond to the sampled points. Each bar’s height represents the value of

f (X)

X_{i}

and its width is

1 / K

, where K denotes the total number of samples. (b) demonstrates the sequential rearrangement of the bars to prevent overlapping on the X-axis, ensuring a clear visualization of the areas.

f (X)

X_{i}

and its width is

1 / K

, where K denotes the total number of samples. (b) demonstrates the sequential rearrangement of the bars to prevent overlapping on the X-axis, ensuring a clear visualization of the areas.

Figure 3. A comparison of MC and QMC integration methods. (a) and (d) show the two-dimensional projections of a PRS and an LDS (a Sobol sequence) respectively. (b) and (c) depict the MC integration for approximating a definite integral over a one-dimensional unit interval, while (e) and (f) present the QMC integration for approximating a definite integral over a one-dimensional unit interval.

Figure 4. An example to illustrate the division of the unit hypercube, where

N = 2

and

P_{v} = (p_{v_{1}}, p_{v_{2}})

. The unit hypercube

{[0, 1]}^{2}

is divided into 4 regions, namely

Q_{1}, Q_{2}, Q_{3}, Q_{4}

, where each region corresponds to a state of

S e q

, denoted by

S_{1}, S_{2}, S_{3}, S_{4}

Figure 4. An example to illustrate the division of the unit hypercube, where

N = 2

and

P_{v} = (p_{v_{1}}, p_{v_{2}})

. The unit hypercube

{[0, 1]}^{2}

is divided into 4 regions, namely

Q_{1}, Q_{2}, Q_{3}, Q_{4}

, where each region corresponds to a state of

S e q

, denoted by

S_{1}, S_{2}, S_{3}, S_{4}

Figure 5. An illustrative example of non-central nodes and comparison of BC and BCnns. In this figure, (a) highlights non-central nodes in red, (b) showcases node sizes based on BC, and (c) showcases node sizes based on BCnns.

Figure 6. Comparison of the convergence and error of the PRQMC, QMC, and MC methods in assessing robustness for two smaller-scale networks.

Figure 7. Comparison of the convergence and standard deviation of the PRQMC, QMC, and MC methods in assessing robustness for four larger-scale networks.

Figure 8. The ANCw curves of networks under different attack strategies.

Table 1. Basic Information of 6 Real-World Networks. N and M Represent the Number of Nodes and Edges, Respectively, and

< k >

Denotes the Average Degree of the Network.

Table 1. Basic Information of 6 Real-World Networks. N and M Represent the Number of Nodes and Edges, Respectively, and

< k >

Denotes the Average Degree of the Network.

Network	Description	N	M	$< k >$
Karate [44]	Karate club network	34	78	4.59
Krebs [10]	Terrorist network	62	159	5.13
Airport [45]	Aviation network	332	2126	12.81
Crime [46]	Criminal network	829	1473	3.55
Power [47]	Power grid	4941	6594	2.67
Oregon1 [48]	AS peering network	10670	22002	4.12

Table 2. Computational Time Comparison of PRQMC, QMC, and MC Methods(s)

Network	MC	QMC	PRQMC
Karate	1.8	1.7	0.4
Krebs	4.5	4.3	0.6
Airport	106.7	104.9	3.0
Crime	518.2	520.6	7.4
Power	20,525.7	20,529.1	343.7
Oregon1	213,748.2	213,758.1	4,262.4

Table 3. The Sample Numbers (

N_{1}

and

N_{2}

) for Different Networks Used in HBnnsAGP

Table 3. The Sample Numbers (

N_{1}

and

N_{2}

) for Different Networks Used in HBnnsAGP

Network	$N_{1}$	$N_{2}$
Karate	16	8
Krebs	30	16
Airport	100	60
Crime	120	80
Power	1300	80
Oregon1	2300	80

Table 4. The Robustness of Networks Under Different ASR. All

\hat{R}

Values Are Multiplied by 100

Table 4. The Robustness of Networks Under Different ASR. All

\hat{R}

Values Are Multiplied by 100

Network	HBnnsAGP	FINDER	HBA	HDA
1. ASR=100%
Karate	12.77	14.12	15.04	15.04
Krebs	12.26	16.26	14.21	17.23
Airport	7.53	10.25	7.93	11.10
Crime	9.90	11.04	10.14	11.54
Power	0.91	5.02	1.01	5.23
Oregon1	0.68	1.06	0.73	1.01
Avg score	7.34	9.63	8.18	10.19
2. ASR=50%
Karate	59.90	60.55	60.88	61.41
Krebs	57.61	58.38	57.74	58.33
Airport	59.45	60.32	60.39	60.52
Crime	57.59	59.50	59.50	59.08
Power	16.54	19.73	17.31	19.93
Oregon1	49.68	51.58	51.28	51.33
Avg score	50.13	51.69	51.18	51.77
3. ASR = 50% for the first 30% of nodes
Karate	48.61	50.38	49.76	50.57
Krebs	45.51	47.16	45.78	47.38
Airport	48.78	50.68	51.00	50.47
Crime	41.84	48.17	46.90	47.12
Power	14.87	17.79	16.32	17.86
Oregon1	41.26	42.91	42.83	43.14
Avg score	40.15	42.91	42.10	42.76
4. Random ASR
Karate	35.12	36.29	36.50	37.57
Krebs	30.39	32.99	31.02	33.36
Airport	36.95	38.58	38.67	38.99
Crime	27.90	30.96	30.48	30.42
Power	5.17	8.46	5.18	8.80
Oregon1	21.61	24.23	23.52	23.86
Avg score	26.19	28.56	27.55	28.79

Table 5. The Computation Time of Different Attack Strategies(ms)

Network	HBnnsAGP	FINDER	HBA	HDA
Karate	1.6	16.3	1.9	0.5
Krebs	3.6	36.6	4.6	2.3
Airport	82.3	218.3	211.0	11.1
Crime	552.1	369.3	4,434.6	49.1
Power	6,760.7	1,397.9	78,119.9	1,796.8
Oregon1	15,799.1	8,641.5	477,802.8	2,065.9

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

MDPI Initiatives

Important Links

Choose an area of interest and we will send you notifications of new preprints at your preferred frequency.

Disclaimer

Analyzing the Robustness of Complex Networks with Attack Success Rate

Abstract

1. Introduction

2. Preliminaries

2.1. Centrality Measures

2.1.1. Degree centrality (DC)

2.1.2. Betweenness centrality (BC)

2.2. Accumulated Normalized Connectivity

2.3. Monte Carlo Integration

2.4. Quasi-Monte Carlo Integration

3. Methods

3.1. Problem Formalization

3.2. The Proposed Robustness Measure RASR

3.3. The Proposed PRQMC Algorithm

3.3.1. RASR Calculation Model Based on QMC Integration

3.3.2. Parallel Randomized QMC (PRQMC) Algorithm

3.4. The Proposed HBnnsAGP Attack Strategy

3.4.1. Non-central Nodes Sampling Betweenness Centrality (BCnns)

3.4.2. GCC-Priority Attack Strategy

4. Experimental Studies

4.1. Experimental Settings

4.1.1. Datasets

4.1.2. Comparison Methods

4.2. Comparison of the PRQMC with Baselines

4.3. Comparison of the HBnnsAGP with Baselines

4.4. Further Discussions About Network Robustness

5. Conclusion

References

MDPI Initiatives

Important Links

Subscribe