Appendix A
(i) Ollivier-Ricci (OR)
Ollivier's discretization of classical Ricci curvature has been widely used in the analysis of graphs or networks. In spaces of positive curvature, the average distance between balls is smaller than their center distance, while in negative curvature space, the average distance between balls is larger than their center distance. Based on the above principle, Ollivier extends the classical Ricci curvature to graphs and networks. Taking the edge
between nodes
and
as an example, the OR curvature is defined as:
In Eq. (1),
and
represent discrete probability distribution functions at nodes
and
, respectively;
represents the Wasserstein distance between
and
, which is used to measure the transportation cost between them. The specific calculation method is shown in Eq. (2), where
represents the path distance (Euclidean distance) between nodes
and
in the graph.
Eq. (3) represents all transfer possibilities that transform the discrete probability distribution function into , where is the set of all nodes in the graph. For , its probability distribution must be explicit and consistent with the neighbor nodes.
Ollivier-Ricci curvature realizes discretization based on classical Ricci curvature by comparing the optimal average distance between the neighbor nodes of and (based on edge ) and the Euclidean distance between and . During this process, the key is to solve for the optimal average distance, which is regarded as an optimal transportation problem, namely, minimizing the average distance.
(ii) Forman-Ricci (FR)
Forman's Ricci curvature discretization method is mainly based on the relationship between the Laplacian and Ricci curvature, which is more algebraic in nature. This method was originally used for discrete geometric objects much larger than weighted graphs, and was subsequently introduced into undirected networks by Areejit Samal et al.(2016)[
19], and further extended to directed networks. To calculate the Forman-Ricci curvature
of edge e in an undirected network, it is necessary to determine the weights of all edges and nodes related to the edge
in advance, and the specific calculation process is shown in Eq. (4).
In Eq. (4), represents the edge connecting nodes and ; represents the weight of edge ; and represent the weights associated with nodes and respectively; and 和 denotes the set of edges incident to nodes and respectively after excluding edge .
From a geometric point of view, FR curvature can quantify the amount of information propagated by the ends of edges in the network. The higher the degree of information diffusion at the end of the edge, the greater the absolute value of FR curvature. Specifically, an edge with a high negative FR curvature may have multiple adjacent edges connected to two nodes of the edge, that is, both ends are funnel-shaped and can be connected to a large number of neighbor nodes. This makes it very likely that the shortest paths formed between other nodes (including nodes that are far away in the network) pass through this edge. Therefore, edges with high negative FR curvature have high betweenness centrality.
(iii) Menger-Ricci (MR)
Among the concepts of metric space and discrete curvature, the simplest definition was first proposed by Menger. He defined the curvature of a metric triangle
composed of three points in the space as the reciprocal of
, which is the circumcircle radius of the triangle
, that is,
. Areejit Samal et al.[
20,
21] extended Menger's definition to the network. Let
be a metric space, and
represents a triangle that sides are
, then the Menger-Ricci curvature of
is
.
In Eq. (5),
, it can be seen from the equation that the MR curvature value is constantly positive. According to the differential geometry method, the MR curvature of edge
in the network can be defined as
.
In Eq. (6), represents the triangle adjacent to side . If an edge is part of several triangles in the network, it will have a high positive MR curvature.
(iv) Haantjes-Ricci (HR)
The basic principle of Haantjes-Ricci curvature is similar to that of Menger-Ricci curvature. The main difference lies in the fact that MR curvature only considers triangles formed by the two nodes of edge or simple paths of length 2, whereas HR curvature takes longer paths between two nodes into account. In special cases, these two curvature concepts can coincide through a universal constant.
Haantjes defined the curvature of a curve as the ratio of the arc length of the curve to its corresponding chord length. Specifically, given a curve
in a metric space
, and three points
on
(with
located between
and
), the HR curvature at point
is defined as
.
In Eq. (7),
represents the length of the arc
. In the network,
represents the path
, and the edge
represents the chord corresponding to
. On this basis, Areejit Samal et al.[
20,
21] defined the HR curvature of a simple path
as
.
Thus, the HR curvature of edge
can be defined as
.
In Eq. (9), represents the path connecting two nodes of edge . Due to calculation limitations, we only consider simple paths with length less than or equal to 4 when using this formula to calculate HR curvature in this paper.