Node Ranking in Complex Networks Based on LeaderRank and Modes Similaritya

GU Yi-ran; ZHU Zi-yan

doi:10.3969/j.issn.1001-0548.2017.02.020

The effect of important nodes in complex networks on the structure and function of the networks causes widespread concern. This paper presents a SRank algorithm based on LeaderRank and nodes similarity which is used to measure the interaction between nodes. The simulation of SIR model and Spearman's correlation coefficient on real social networks show that the SRankalgorithm preforms better on identifying influential nodes both in directed and undirected networks, compared with the other four classical algorithms.

HTML

随着科学技术的发展，世界变得越来越小，也越来越复杂，涌现了大量难以用经典概念解释的问题，如微博中谣言通过少量节点快速传播到整个网络，传染病通过交通网络中的少量节点快速传播扩散^[1]等。借助网络科学的发展与进步，学者们对这些问题进行了定量化的描述和研究，将这些可以对整个网络的结构和功能产生巨大作用的少量节点，称之为重要节点。挖掘这些重要节点并研究其内在动力学机理成为当前的研究热点之一。

近年来，学者在重要节点排序上提出了很多指标和算法，主要有以下4个方面^[2]：1) 基于节点邻居节点的排序方法，如度中心性 (degree centrality)^[3]，即节点的邻居节点越多其影响力越大，其缺点是仅仅考虑了节点的局部信息。2) 基于路径的排序方法，如接近中心性 (closeness centrality)^[4]，通过计算节点与网络中其他所有节点距离平均值来衡量节点的重要性，缺点是时间复杂度比较高。3) 基于特征向量的排序方法，如PageRank算法^[5]，该算法认为每一个节点的重要性取决于指向它的其他节点的数量和质量，缺点是容易陷入悬挂节点。LeaderRank^[6-7]算法在PageRank算法的基础上，通过加入一个背景节点，该节点与网络的所有节点双向连接，来替代PageRank算法中跳转概率s，从而提高了算法的收敛速度和鲁棒性。4) 基于节点的移除和收缩的排序方法^[8-9]，通过移除节点对网络的破坏性来衡量节点的重要性，但计算复杂度高。

这些算法大多考虑节点在全局中的重要性或者局部重要性，忽略了节点间相互作用对整个网络的影响。本文考虑在现实生活中，人们更信任那些和自己关系更为相似的人，即节点之间的相似度越高则节点间的影响力越大。共同邻居节点相似度^[10]表明，如果两个节点的共同邻居越多，这两个节点的相似度越高。因此，本文引入节点相似度指标作为节点局部重要性的衡量指标，在LeaderRank的基础上综合考虑节点的局部重要性和全局重要性，提出SRank算法。将该算法和其他几种经典算法进行比较，仿真结果表明，该算法在有向网络和无向网络的重要节点排序结果均优于其他算法。

3. 结束语

本文主要工作是在LeaderRank算法的基础上引入了节点相似度的概念，综合考虑节点的局部特征和全局特征，提出了SRank节点重要性排序算法。对四种真实网络进行重要性排序，在通过分析网络结构来说明本文算法在无向网络的节点重要度排序的合理性同时，利用传播仿真成功验证了本文算法在有向网络节点重要度排序的有效性。

本文研究工作还存在一些不足。提出的算法主要针对于连通网络，对不连通网络还存在一定的不确定性。今后的工作需要更全面、科学地考虑不连通网络节点重要性的排序和验证方法。

Reference (12)

[1]	周涛, 韩筱璞, 闫小勇. 人类行为时空特性的统计力学[J]. 电子科技大学学报, 2013, 42(4): 481-540.	ZHOU Tao, HAN Xiao-pu, YAN Xiao-yong. Statistical mechanics on temporal and spatial activities of human[J]. Journal of University of Electronic Science and Technology of China, 2013, 42(4): 481-540.
[2]	任晓龙, 吕琳媛. 网络重要节点排序方法综述[J]. 科学通报, 2014, 59(13): 1175-1197. doi: 10.1360/972013-1280	REN Xiao-long, LÜ Lin-yuan. Review of ranking nodes in complex networks[J]. Chin Sci Bull, 2014, 59(13): 1175-1197. doi: 10.1360/972013-1280
[3]	BONACICH P. Factoring and weighting approaches to status scores and clique identification[J]. Journal of Mathematical Sociology, 1972, 2(1): 113-120. doi: 10.1080/0022250X.1972.9989806
[4]	FREEMAN L C. Centrality in social networks conceptual clarification[J]. Social Networks, 1978, 1(3): 215-239. doi: 10.1016/0378-8733(78)90021-7
[5]	BRIN S, PAGE L. The anatomy of a large-scale hypertextual web search engine[J]. Computer Networks & Isdn Systems, 1998, 30(98): 107-117.
[6]	LÜ L, ZHANG Y C, YEUNG C H. Leaders in social networks, the delicious case[J]. Plos One, 2011, 6(6): e21202-. doi: 10.1371/journal.pone.0021202
[7]	LI Q, ZHOU T, LÜ L. Identifying influential spreaders by weighted LeaderRank[J]. Physica A Statistical Mechanics & Its Applications, 2014, 404(24): 47-55.
[8]	李鹏翔, 任玉晴, 席酉民. 网络节点 (集) 重要性的一种度量指标[J]. 系统工程, 2004, 22(4): 13-20.	LI Peng-xiang, REN Yu-qing, XI You-min. An importance measure of actors (set) within a network[J]. Systems Engineering, 2004, 22(4): 13-20.
[9]	DANGALCHEV C. Residual closeness in networks[J]. Physica A, 2006, 365(): 556-564. doi: 10.1016/j.physa.2005.12.020
[10]	LIU Z, ZHANG Q M, LÜ L. Link prediction in complex networks:a local naive Bayes model[J]. Epl, 2011, 96(4): 48007-. doi: 10.1209/0295-5075/96/48007
[11]	BONACICH P. Factoring and weighting approaches to status scores and clique identification[J]. J Math Sociol, 1972, 2(): 113-120. doi: 10.1080/0022250X.1972.9989806
[12]	马俊, 周刚, 许斌. 一种基于话题传播的微博用户影响力分析方法[J]. 信息工程大学学报, 2013, 14(6): 735-742.	MA Jun, ZHOU Gang, XU Bin. User influence analysis in microblog based on topic diffusion[J]. Journal of Information Engineering University, 2013, 14(6): 735-742.

网络	节点数	边数	有向	平均度	平均路径	直径	聚类系数
Zachary	34	78	无向	4.59	2.41	5	0.570 6
Dolphins	62	159	无向	5	3.37	8	0.265 2
Neural	239	1912	有向	8	3.99	5	0.196 5
Polblogs	793	15 783	有向	19.90	3.19	8	0.288 1

排名	度中心性		接近中心性		PageRank		LeaderRank		SRank
排名	节点	DC	节点	CC	节点	PR	节点	LR	节点	SR
1	34	17	34	23.25	34	0.098 3	34	0.080 4	1	0.018 7
2	1	16	1	23.17	1	0.094 6	1	0.075 9	34	0.017 8
3	33	12	3	21	33	0.070 1	33	0.058 0	33	0.010 6
4	3	10	33	20.92	3	0.055 1	3	0.049 1	3	0.007 65
5	2	9	32	19.33	2	0.051 5	2	0.044 6	2	0.007 23
6	32	6	2	19.17	32	0.036 7	32	0.031 3	4	0.004 15
7	4	6	14	18.5	4	0.035 2	4	0.031 3	14	0.002 373
8	24	5	9	18.5	24	0.031 4	14	0.026 8	9	0.002 09
9	14	5	4	17.67	7	0.029 7	9	0.026 8	8	0.001 91
10	9	5	20	17.5	6	0.029 7	24	0.026 8	32	0.001 91

排名	度中心性		接近中心性		PageRank		LeaderRank		SRank
排名	节点	DC	节点	CC	节点	PR	节点	LR	节点	SR
1	15	12	38	30.4	18	0.031 7	15	0.029 4	15	0.003 44
2	46	11	15	29.98	52	0.031 3	38	0.027 1	46	0.002 99
3	38	11	37	29.28	15	0.030 5	46	0.027 1	38	0.002 46
4	52	10	41	29.28	58	0.029 7	52	0.024 9	34	0.002 46
5	34	10	21	29.15	38	0.029 1	34	0.024 9	58	0.001 97
6	58	9	46	28.58	46	0.028 4	58	0.022 6	52	0.001 97
7	30	9	34	28.48	34	0.027 6	18	0.022 6	14	0.001 84
8	21	9	2	27.37	30	0.025 8	21	0.022 6	30	0.001 63
9	18	9	51	27.33	14	0.025 6	30	0.022 6	18	0.001 41
10	41	8	52	26.83	2	0.024 1	14	0.020 4	10	0.001 34

算法	Polblogs			neural
算法	全部	前10%	前20%	全部	前10%	前20%
DC	0.823 1	0.993 6	0.977 4	0.444 7	0.950 1	0.876 0
CC	0.429 3	0.944 4	0.888 3	-0.181 8	0.828 2	0.690 0
PR	0.937 6	0.998 6	0.996 4	0.847 5	0.987 2	0.972 7
LR	0.960 9	0.997 7	0.993 6	0.878 4	0.977 6	0.956 3
SR	0.963 9	0.999 3	0.997 9	0.879 5	0.996 7	0.989 4

Node Ranking in Complex Networks Based on LeaderRank and Modes Similaritya

doi: 10.3969/j.issn.1001-0548.2017.02.020

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views