Altmetrics
Downloads
85
Views
20
Comments
0
A peer-reviewed article of this preprint also exists.
Submitted:
10 October 2023
Posted:
11 October 2023
You are already at the latest version
Algorithm 1:APSP_PDijkstra |
Algorithm 2:The Pruned Dijkstra |
Algorithm 3: Check distance from source to target nodes according a 2-hop cover
|
Algorithm 4:Sketches_DIS-C: offline sampling |
Algorithm 5:Sketches_DIS-C: k sketches offline |
Algorithm 6:Sketches_DIS-C: k sketches online |
Algorithm 7:Sketches_DIS-C: k sketches online APSP |
1 | From now on, n is the number of nodes, and m is the number of edges in the graph |
2 | A path is a locally shortest path (LSP) if the path obtained by removing its first edge and the path obtained by removing its last edge are both the shortest paths. |
3 | Where is the degree of entry of node a, is the exit degree of node a; and are the costs of entering and leaving node a, respectively; is the generality of node a; and are the nodes and edges respectively of the graph , which is the graph corresponding to the conceptualization K in the j-th iteration. Thus, is the cost of the edge that goes from node a to node b; is a geometric weighting factor. Moreover, is the APSP distance matrix, and is the set of edges representing a relation . Finally, is the convergence threshold of the algorithm. |
4 | A detailed analysis of this is presented in [71]. |
5 | The strategy for choosing the seed nodes consists of biasing those with the highest value of generality in the conceptual graph; this implies that the most general concepts will be chosen. |
Reference | Expression | Description |
---|---|---|
Rada et al. (1989) [14] | length of path from a to b | |
Wu and Palmer (1994) [15] | a and b are concepts within the hierarchy, c is a less common super concept of a and b. is the number of nodes in the path from a to c, is the number of nodes in the path from b to c and is the number of nodes in the path from c to the root of the hierarchy. | |
Hirst and Stonge (1995) [16] | C and K are constants, is the length of the shortest path between a and b, and is the number of times the path changes direction. | |
Li et al. (2003) [18] | is the length of the shortest path between a and b, h is the minimum depth of LCS (the more specific concept that is an ancestor of a and b) in the hierarchy, and are parameters that scale the contribution of the shortest path length and depth, respectively. | |
Shenoy et al. (2012) [19] | L is the shortest distance between a and b calculated taking into account the direction of the edges. Each vertical direction is given a value of 1; each time it changes direction, one more is added. N is the depth of the entire tree. and are the distances from the root to the concept a and b respectively. is 1 for neighboring concepts and 0 for the others |
Reference | Expression | Description |
---|---|---|
Resnik (1995) [24] | Based on the notion of the LCS (Least Common Subsumer), if the terms share an LCS, then the IC is calculated from it. | |
Jiang and Conrath (1997) [26] | It focuses on determining the link strength of an edge connecting a parent node with a child node. These taxonomic links between concepts are reinforced by the difference between the IC of a concept and its LCS. | |
Gao et al. (2015) [27] | Where is a constant and is the set of senses of the concept x. | |
Jiang et al. (2017) [28] | Where is the set of hyponyms for a, is the set of pages in category c and is the set of categories. | |
The second proposed approach is the combination of the IC by using the category structure and the extension of ontology-based methods. | ||
Generalization of the Zhou et al. (2008) [29] approach. Where is an adjustment factor for the weight of the two characteristics involved in the IC calculation, is the depth of the leaf a, and is the maximum depth of a leaf. | ||
Generalization of the Sanchez et al. (2011) [30] approach. Where is the set of leaves of a in the category hierarchy, is the set of hypernyms, and is the maximum number of leaves in the hierarchy. |
Reference | Year | |
---|---|---|
Dijkstra/Floyd–Warshall | 1959/1962 | |
Fredman (1976) [50] | 1976 | |
Takaoka (1992) [51] | 1991 | |
Dobosiewicz (1990) [52] | 1990 | |
Han (2004) [53] | 2004 | |
Takaoka (2004) [54] | 2004 | |
Takaoka (2005) [55] | 2005 | |
Zwick (2004) [56] | 2004 | |
Chan (2008) [57] | 2005 | |
Han (2006) [58] | 2006 | |
Chan (2010) [41] | 2007 | |
Williams (2018) [44] | 2014 |
Dataset | Content | Type | Word pairs |
---|---|---|---|
MC30 [74] | Nouns | Similarity | 30 |
RG65 [75] | Nouns | Similarity | 65 |
PS [76] | Nouns | Similarity | 65 |
Agirre201 [77] | Nouns | Similarity | 201 |
SimLex665 [78] | Nouns | Similarity | 665 |
MTurk771 [79] | Nouns | Relation | 771 |
MTurk287 [80] | Nouns | Relation | 287 |
WS245Rel [81] | Nouns | Relation | 245 |
Rel122 [82] | Nouns | Relation | 122 |
SCWS [83] | Nouns | Relation | 1994 |
Dataset | Word pairs | Nodes | Edges |
---|---|---|---|
Dijkstra/Floyd–Warshall | 1959/1962 | ||
MC30 | 30 | 5,496 | 9583 |
RG65 | 65 | 5,080 | 9271 |
PS | 65 | 5,080 | 9271 |
Agirre201 | 201 | 29,738 | 75,120 |
SimLex665 | 665 | 44,798 | 138,122 |
MTurk771 | 771 | 55,403 | 185,523 |
MTurk287 | 287 | 25,576 | 58,637 |
WS245Rel | 245 | 24,029 | 56,983 |
Rel122 | 122 | 23,775 | 58,322 |
SCWS | 1,994 | 53,052 | 183,366 |
All | 4,445 | 119,034 | 818,788 |
Algorithm | MC30 | RG | PS | Agirre | SimLex | MTurk | MTurk | WSRel | Rel | SCWS | All | Avg. |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Dijkstra | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
Pruned Dijkstra | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.997 | 1.000 |
Sketches k=1 | 0.940 | 0.853 | 0.862 | 0.733 | 0.762 | 0.605 | 0.864 | 0.720 | 0.841 | 0.815 | 0.745 | 0.795 |
Sketches k=2 | 0.977 | 0.947 | 0.961 | 0.870 | 0.813 | 0.731 | 0.888 | 0.842 | 0.922 | 0.898 | 0.802 | 0.877 |
Sketches k=5 | 0.985 | 0.984 | 0.978 | 0.923 | 0.897 | 0.881 | 0.937 | 0.921 | 0.953 | 0.944 | 0.899 | 0.937 |
Sketches k=10 | 0.991 | 0.979 | 0.995 | 0.961 | 0.955 | 0.931 | 0.967 | 0.959 | 0.976 | 0.971 | 0.942 | 0.966 |
Sketches k=15 | 0.993 | 0.996 | 0.991 | 0.980 | 0.972 | 0.959 | 0.971 | 0.971 | 0.983 | 0.982 | 0.957 | 0.978 |
Sketches k=20 | 0.996 | 0.992 | 0.994 | 0.983 | 0.974 | 0.971 | 0.979 | 0.975 | 0.989 | 0.986 | 0.972 | 0.983 |
Algorithm | MC | RG | PS | Agirre | SimLex | MTurk | MTurk | WSRel | Rel | SCWS | All | Avg. |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Dijkstra | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
Sketches k=1 | 0.897 | 0.777 | 0.877 | 0.734 | 0.722 | 0.562 | 0.862 | 0.757 | 0.857 | 0.793 | 0.733 | 0.779 |
Sketches k=2 | 0.932 | 0.961 | 0.956 | 0.893 | 0.806 | 0.725 | 0.890 | 0.827 | 0.879 | 0.896 | 0.795 | 0.869 |
Sketches k=5 | 0.979 | 0.984 | 0.981 | 0.909 | 0.893 | 0.865 | 0.946 | 0.936 | 0.947 | 0.945 | 0.897 | 0.935 |
Sketches k=10 | 0.987 | 0.991 | 0.992 | 0.963 | 0.952 | 0.944 | 0.960 | 0.945 | 0.979 | 0.967 | 0.939 | 0.965 |
Sketches k=15 | 0.992 | 0.998 | 0.994 | 0.968 | 0.971 | 0.958 | 0.975 | 0.966 | 0.978 | 0.980 | 0.960 | 0.976 |
Sketches k=20 | 0.995 | 0.996 | 0.997 | 0.979 | 0.971 | 0.972 | 0.980 | 0.983 | 0.984 | 0.988 | 0.972 | 0.983 |
Algorithm | MC | RG | PS | Agirre | SimLex | MTurk | MTurk | WSRel | Rel | SCWS | All | Avg. |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Dijkstra | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
Pruned Dijkstra | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.968 | 0.997 |
Sketches k=1 | 0.938 | 0.876 | 0.858 | 0.702 | 0.742 | 0.605 | 0.843 | 0.671 | 0.799 | 0.831 | 0.736 | 0.782 |
Sketches k=2 | 0.977 | 0.944 | 0.959 | 0.823 | 0.804 | 0.729 | 0.855 | 0.821 | 0.900 | 0.906 | 0.807 | 0.866 |
Sketches k=5 | 0.978 | 0.978 | 0.984 | 0.911 | 0.887 | 0.876 | 0.925 | 0.907 | 0.950 | 0.954 | 0.897 | 0.931 |
Sketches k=10 | 0.987 | 0.981 | 0.991 | 0.941 | 0.955 | 0.931 | 0.963 | 0.946 | 0.973 | 0.978 | 0.942 | 0.962 |
Sketches k=15 | 0.987 | 0.993 | 0.985 | 0.968 | 0.971 | 0.969 | 0.963 | 0.960 | 0.980 | 0.986 | 0.954 | 0.974 |
Sketches k=20 | 0.989 | 0.988 | 0.991 | 0.973 | 0.979 | 0.978 | 0.976 | 0.961 | 0.985 | 0.990 | 0.965 | 0.980 |
Algorithm | MC | RG | PS | Agirre | SimLex | MTurk | MTurk | WSRel | Rel | SCWS | All | Avg. |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Dijkstra | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
Pruned Dijkstra | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.969 | 0.997 |
Sketches k=1 | 0.875 | 0.826 | 0.843 | 0.735 | 0.700 | 0.573 | 0.823 | 0.693 | 0.817 | 0.807 | 0.732 | 0.766 |
Sketches k=2 | 0.925 | 0.921 | 0.932 | 0.859 | 0.816 | 0.733 | 0.876 | 0.764 | 0.835 | 0.902 | 0.803 | 0.851 |
Sketches k=5 | 0.972 | 0.970 | 0.961 | 0.898 | 0.884 | 0.866 | 0.920 | 0.906 | 0.936 | 0.953 | 0.896 | 0.924 |
Sketches k=10 | 0.991 | 0.980 | 0.985 | 0.953 | 0.950 | 0.947 | 0.945 | 0.928 | 0.969 | 0.974 | 0.938 | 0.960 |
Sketches k=15 | 0.991 | 0.990 | 0.981 | 0.959 | 0.974 | 0.960 | 0.961 | 0.952 | 0.969 | 0.984 | 0.958 | 0.971 |
Sketches k=20 | 0.996 | 0.985 | 0.990 | 0.973 | 0.971 | 0.977 | 0.971 | 0.975 | 0.982 | 0.991 | 0.966 | 0.980 |
Algorithm | MC | RG | PS | Agirre | SimLex | MTurk | MTurk | WSRel | Rel | SCWS | All | Avg. |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Dijkstra | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.0 | 1.00 |
Pruned Dijkstra | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.0 | 1.00 |
Sketches k=1 | 0.94 | 0.86 | 0.86 | 0.72 | 0.75 | 0.61 | 0.85 | 0.69 | 0.82 | 0.82 | 0.0 | 0.79 |
Sketches k=2 | 0.98 | 0.95 | 0.96 | 0.85 | 0.81 | 0.73 | 0.87 | 0.83 | 0.91 | 0.90 | 0.0 | 0.88 |
Sketches k=5 | 0.98 | 0.98 | 0.98 | 0.92 | 0.89 | 0.88 | 0.93 | 0.91 | 0.95 | 0.95 | 0.0 | 0.94 |
Sketches k=10 | 0.99 | 0.98 | 0.99 | 0.95 | 0.95 | 0.93 | 0.97 | 0.95 | 0.97 | 0.97 | 0.0 | 0.97 |
Sketches k=15 | 0.99 | 0.99 | 0.99 | 0.97 | 0.97 | 0.96 | 0.97 | 0.97 | 0.98 | 0.98 | 0.0 | 0.98 |
Sketches k=20 | 0.99 | 0.99 | 0.99 | 0.98 | 0.98 | 0.97 | 0.98 | 0.97 | 0.99 | 0.99 | 0.0 | 0.98 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 MDPI (Basel, Switzerland) unless otherwise stated