Modeling of Input-output Performance of Scientists Based on the Analysis of Citation

GUO Qiang; CHEN Qing-wen; LIU Jian-guo

doi:10.12178/1001-0548.2018236

Volume 49 Issue 5

Sep. 2020

Article Contents

Article Navigation > Journal of University of Electronic Science and Technology of China > 2020 > 49(5): 774-779

GUO Qiang, CHEN Qing-wen, LIU Jian-guo. Modeling of Input-output Performance of Scientists Based on the Analysis of Citation[J]. Journal of University of Electronic Science and Technology of China, 2020, 49(5): 774-779. doi: 10.12178/1001-0548.2018236

Citation:

GUO Qiang, CHEN Qing-wen, LIU Jian-guo. Modeling of Input-output Performance of Scientists Based on the Analysis of Citation[J]. Journal of University of Electronic Science and Technology of China, 2020, 49(5): 774-779. doi: 10.12178/1001-0548.2018236

Modeling of Input-output Performance of Scientists Based on the Analysis of Citation

doi: 10.12178/1001-0548.2018236

1.
Complex Systems Science Research Center, University of Shanghai for Science and Technology　Yangpu Shanghai　200093
2.
School of Accountancy, Shanghai University of Finance and Economics　Yangpu Shanghai　200433

Received Date: 2018-09-06
Rev Recd Date: 2019-10-09

Available Online: 2020-09-29

Publish Date: 2020-09-23

Abstract

This paper presents a model to evaluate input-output performance of scientists. With consideration of the input cost of scientists' communication and time, this model takes the number of co-authors and the number of institutions of target scientists in scientific papers as input variables, and the number of co-published articles and their cited number as output variables. The experiments results show the scientists who won Nobel Price are ranked higher than the sciences who did not win Nobel Price. The experimental results also show that the AUC values of input-output performance model could reach 0.7957 for the APS data set, which is better than the results generated by h-index, i10-index, total number of papers, and total number of citations. Furthermore, The experimental results indicate that most input-output performances of scientists before winning award is higher than the input-output performances of scientists after winning award for the APS data set and the web of science data set. The proposed model also provides an effective tool for policy makers to quantify the input-output performances of sciences.
- citation analysis,
- H-index,
- input-output performance,
- scientific cooperation,
- total number of citations

References

[1]	HICKS D, WOUTERS P, WALTMAN L, et al. The Leiden manifesto for research metrics[J]. Nature, 2015, 520(7548): 429. doi: 10.1038/520429a
[2]	刘浏, 王东波. 引用内容分析研究综述[J]. 情报学报, 2017, 36(6): 637-643. doi: 10.3772/j.issn.1000-0135.2017.06.012 LIU Liu, WANG Dong-bo. A review of citation content analysis research[J]. Journal of Information, 2017, 36(6): 637-643. doi: 10.3772/j.issn.1000-0135.2017.06.012
[3]	胡小军, 郭强, 杨凯, 等. 基于相对熵的多属性作者学术影响力排名研究[J]. 电子科技大学学报, 2018, 47(2): 281-285. HU Xiao-jun, GUO Qiang, YANG Kai, et al. Multi-attribute researcher academic influence ranking based on relative entropy[J]. Journal of University of Electronic Science and Technology of China, 2018, 47(2): 281-285.
[4]	VAN H B A, PHELPS J, BARNES M, et al. Evaluating scientific impact[J]. Environmental Health Perspectives, 2000, 108(9): A392. doi: 10.1289/ehp.108-a392
[5]	FITZPATRICK R B. Essential science indicators[J]. Medical Reference Services Quarterly, 2005, 24(4): 67. doi: 10.1300/J115v24n04_05
[6]	曹志梅, 刘伟辉, 杨光. 高校 ESI 潜势学科排名提升策略探讨[J]. 情报探索, 2017(4): 44-47. doi: 10.3969/j.issn.1005-8095.2017.04.009 CAO Zhi-mei, LIU Wei-hui, YANG Guang. Discussion on the strategy of improving the ESI potential discipline in colleges and universities[J]. Information Research, 2017(4): 44-47. doi: 10.3969/j.issn.1005-8095.2017.04.009
[7]	CSAJBOK E, BERHIDI A, VASAS L, et al. Hirsch-index for countries based on essential science indicators data[J]. Scientometrics, 2007, 73(1): 91-117. doi: 10.1007/s11192-007-1859-9
[8]	HIRSCH J E. An index to quantify an individual's scientific research output[J]. Proceedings of the National Academy of Sciences of the United States of America, 2005, 102(46): 16569. doi: 10.1073/pnas.0507655102
[9]	EGGHE L. Theory and practise of the G-index[J]. Scientometrics, 2006, 69(1): 131-152. doi: 10.1007/s11192-006-0144-7
[10]	DELGADO L C E, ROBINSON G N, TORRES S D. The Google scholar experiment: How to index false papers and manipulate bibliometric indicators[J]. Journal of the Association for Information Science and Technology, 2014, 65(3): 446-454. doi: 10.1002/asi.23056
[11]	BRIN S, PAGE L. The anatomy of a large-scale hypertexual web search engine[J]. Computer Networks and ISDN Systems, 1998, 30(1-7): 107-117. doi: 10.1016/S0169-7552(98)00110-X
[12]	王露, 郭强, 刘建国. 基于加权方法的节点重要性度量[J]. 计算机应用研究, 2018(5): 1426-1428. doi: 10.3969/j.issn.1001-3695.2018.05.031 WANG Lu, GUO Qiang, LIU Jian-guo. Node importance measure based on weighting method[J]. Journal of Computer Applications, 2018(5): 1426-1428. doi: 10.3969/j.issn.1001-3695.2018.05.031
[13]	顾亦然, 许梦馨. 基于PageRank的新闻关键词提取算法[J]. 电子科技大学学报, 2017, 46(5): 777-783. doi: 10.3969/j.issn.1001-0548.2017.05.021 GU Yi-ran, XU Meng-xin. News keyword extraction algorithm based on PageRank[J]. Journal of University of Electronic Science and Technology of China, 2017, 46(5): 777-783. doi: 10.3969/j.issn.1001-0548.2017.05.021
[14]	陈仕吉, 史丽文, 左文革. 科学合作网络中节点合作效果评测与分析[J]. 图书情报工作, 2012, 56(10): 61-143. CHEN Shi-ji, SHI Li-wen, ZUO Wen-ge. Evaluation and analysis of node cooperation effect in scientific cooperation network[J]. Library and Information Service, 2012, 56(10): 61-143.
[15]	CHARNES A, COOPER W W, RHODES E. Measuring the efficiency of decision making units[J]. European Journal of Operational Research, 1978, 2(6): 429-444. doi: 10.1016/0377-2217(78)90138-8
[16]	DE S P D J, BEAVER D. Collaboration in an invisible college[J]. American Psychologist, 1966, 21(11): 1011. doi: 10.1037/h0024051
[17]	苏芳荔. 科研合作对期刊论文被引频次的影响[J]. 图书情报工作, 2011, 55(10): 144-148. SU Fang-li. The influence of scientific research cooperation on the citation frequency of journal papers[J]. Library and Information Service, 2011, 55(10): 144-148.
[18]	何海燕, 李芳. 高校科研合作对论文产出质量的影响—基于国家重点实验室分析[J]. 北京理工大学学报(社会科学版), 2017, 19(5): 162-167. HE Hai-yan, LI Fang. The influence of scientific research cooperation on the output quality of papers-Based on the analysis of national key laboratories[J]. Journal of Beijing Institute of Technology (Social Science Edition), 2017, 19(5): 162-167.
[19]	王卫, 史锐涵, 潘京华. 基于期刊论文的作者学术合作与科研产出关系研究—以图书情报领域为例[J]. 情报杂志, 2017, 36(3): 191-195. doi: 10.3969/j.issn.1002-1965.2017.03.033 WANG Wei, SHI Rui-han, PAN Jing-hua. Research on the relationship between academic cooperation and scientific research output based on journal papers—Taking the field of library and information as an example[J]. Journal of Information, 2017, 36(3): 191-195. doi: 10.3969/j.issn.1002-1965.2017.03.033
[20]	BROWN S A, DENNIS A R, VENKATESH V. Predicting collaboration technology use: Integrating technology adoption and collaboration research[J]. Journal of Management Information Systems, 2010, 27(2): 9-54. doi: 10.2753/MIS0742-1222270201
[21]	SHEN H W, BARABASI A L. Collective credit allocation in science[J]. Proceedings of the National Academy of Sciences, 2014, 111(34): 12325-12330. doi: 10.1073/pnas.1401992111
[22]	刘静, 马建霞. 我国管理科学研究进展分析—以国家自然科学基金立项项目及论文产出为分析数据[J]. 科技管理研究, 2015, 35(326): 249-258. LIU Jing, MA Jian-xia. Analysis of the progress of management science research in China—Analysis of national natural science foundation projects and paper outputs as analysis data[J]. Science and Technology Management Research, 2015, 35(326): 249-258.
[23]	ZHANG Song-tao, GUAN Zhong-cheng. Education experience of scientific workforce—A case study on the winners of NSFDYS in CAS[J]. Forum on Science and Technology in China, 2015(12): 132-137.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(4) / Tables(1)

Get Citation

PDF

XML

Article Metrics

Article views(6336) PDF downloads(39) Cited by()

Proportional views

HTML

引文网络的建模与分析已经被广泛用于评价科学家、科研单位甚至地区或国家的学术影响力。论文的应用次数对科学家、科学家的职称评定、科研奖励等方面都具有重要意义^[1-3]。引文网络的分析结果已经被应用于科研管理政策的制定、科研激励等措施，对学科发展具有重要意义^[4]。

当前，基于科研引文网络分析方法主要归为两类：基于统计和基于网络结构的评价方法。基于统计的评价方法包括基本科学指标数据库(ESI)^[5-7]、总引用次数、总论文发表数、H指数^[8]、G指数^[9]、I10指数^[10]等指标。2001年，美国科技信息所(ISI)提出ESI指标用来度量科学研究绩效^[5-6]。ESI是从论文发表总数、引文次数、平均被引频次等多个方面对国家/地区科研水平、机构学术声誉以及期刊学术水平进行衡量。但是ESI只考虑编入Thomson Reuters索引的期刊中发表的书籍、文献^[7]，限制了其客观性。文献[8]提出既考虑引用量又考虑发文章数的H指数。一个人或组织的H指数定义为其发表的所有文章中被引次数大于等于H次的论文超过H篇。一名科学家的H指数越高，他的论文影响力越大。但是H指数无法对只发表了少数几篇重要文献的科学家的工作进行评价。文献[9]在H指数上做出改进，提出了G指数。G指数是一种基于学者以往贡献的科学家影响力评估方法。此外，一些用来完善或优化H指数的指标也相继被提出。2011年，谷歌提出了I10指数^[10]，即科学家发表文章中被引次数大于等于10次的文章数。基于网络结构的评价方法包括基于科学家合作^[11-12]和引用网络的PageRank算法^[13-14]。基于合作网络的PageRank算法是指基于合作网络中科学家之间的合作关系对科学家进行评价，该方法主要反映了科学家在合作网络中的影响力。基于引用网络的PageRank算法则是基于文献之间的引用关系和科学家之间的引用网络对科学家的学术水平进行评估。但是，上述全部方法都只考虑了科学家发表文章数、文章引用量，没有考虑到科学家的沟通、时间等投入成本。因此，本文提出一种考虑输入和输出变量的投入产出模型，对科学家的绩效进行综合评价。

假设有甲乙两位科学家，科学家甲与多名科学家合作发表了一篇文章，而科学家乙与一名科学家合作也发表了一篇文章，同时他们文章的引用量也相同。用H指数等指标计量甲乙两名科学家的投入产出绩效是相同的。但是，甲比乙投入的多，占用的社会资源更多。如果乙和甲拥有相同的社会资源，乙就可能有更多的产出。综合考虑科学家的投入和产出要素，本文工作主要是提出了一种考虑投入和产出的科学家绩效算法。算法在考虑科学家的科研产出的同时，也考虑了科学家的沟通、时间等投入成本，从投入和产出的视角对科学家的绩效进行建模评价。在APS实证数据集上的实验结果表明，本文提出的方法可以更准确地识别出获诺贝尔奖的科学家，其中本文算法的AUC值为0.7957，比只考虑总引用量的评价方法的准确度提高了8.77%。此外，对于APS数据集，64.29%的科学家获得诺奖前的投入产出绩效高于获得诺奖后的投入产出绩效。对于Web of science数据集，81.25%的科学家获得杰青前的投入产出绩效高于获得杰青后的投入产出绩效。

3. 结束语

本文提出了一种考虑科学家投入和产出信息的绩效评价算法。在评价科学家绩效的时候，除了要考虑科学家的发表论文和论文影响力等产出绩效，还需要考虑科学家的投入精力因素。如科学家需要花大量的时间进行沟通、协商才能够彼此合作。因此，本文考虑了合作科学家数和合作机构数等投入因素，对科学家的投入产出绩效进行综合评价。在包含近百年数据的美国物理学会上的实验结果表明，本文提出方法的AUC值为0.7957，相比于总引用量的评价结果，准确率提高了8.77%。此外，科学家在获奖前后的投入产出绩效实验结果表明，大部分科学家获奖前的投入产出绩效高于获奖后科学家的投入产出绩效。

科学家投入产出绩效算法取决于投入要素和产出要素的选取，因此可以研究更多投入要素，使科学家的排名更准确。如科学家投入产出绩效在一定程度上取决于科学家研究的主题，而本文方法并没有考虑到研究主题这个投入变量。同时，具有意义的研究主题可能会有更多的产出(发表的论文数)，在未来的工作里会考虑加入研究主题来研究科学家的投入产出绩效^[22-23]。除此之外，获奖科学家获奖前后绩效的差异的原因很多，如得奖的年龄很大，得奖后文章的价值还没有完全发挥出来等，而本文的方法中并没有考虑到这些影响因素。

Reference (23)

[1]	HICKS D, WOUTERS P, WALTMAN L, et al. The Leiden manifesto for research metrics[J]. Nature, 2015, 520(7548): 429.
[2]	刘浏, 王东波. 引用内容分析研究综述[J]. 情报学报, 2017, 36(6): 637-643.	LIU Liu, WANG Dong-bo. A review of citation content analysis research[J]. Journal of Information, 2017, 36(6): 637-643.
[3]	胡小军, 郭强, 杨凯, 等. 基于相对熵的多属性作者学术影响力排名研究[J]. 电子科技大学学报, 2018, 47(2): 281-285.	HU Xiao-jun, GUO Qiang, YANG Kai, et al. Multi-attribute researcher academic influence ranking based on relative entropy[J]. Journal of University of Electronic Science and Technology of China, 2018, 47(2): 281-285.
[4]	VAN H B A, PHELPS J, BARNES M, et al. Evaluating scientific impact[J]. Environmental Health Perspectives, 2000, 108(9): A392.
[5]	FITZPATRICK R B. Essential science indicators[J]. Medical Reference Services Quarterly, 2005, 24(4): 67.
[6]	曹志梅, 刘伟辉, 杨光. 高校 ESI 潜势学科排名提升策略探讨[J]. 情报探索, 2017(4): 44-47.	CAO Zhi-mei, LIU Wei-hui, YANG Guang. Discussion on the strategy of improving the ESI potential discipline in colleges and universities[J]. Information Research, 2017(4): 44-47.
[7]	CSAJBOK E, BERHIDI A, VASAS L, et al. Hirsch-index for countries based on essential science indicators data[J]. Scientometrics, 2007, 73(1): 91-117.
[8]	HIRSCH J E. An index to quantify an individual's scientific research output[J]. Proceedings of the National Academy of Sciences of the United States of America, 2005, 102(46): 16569.
[9]	EGGHE L. Theory and practise of the G-index[J]. Scientometrics, 2006, 69(1): 131-152.
[10]	DELGADO L C E, ROBINSON G N, TORRES S D. The Google scholar experiment: How to index false papers and manipulate bibliometric indicators[J]. Journal of the Association for Information Science and Technology, 2014, 65(3): 446-454.
[11]	BRIN S, PAGE L. The anatomy of a large-scale hypertexual web search engine[J]. Computer Networks and ISDN Systems, 1998, 30(1-7): 107-117.
[12]	王露, 郭强, 刘建国. 基于加权方法的节点重要性度量[J]. 计算机应用研究, 2018(5): 1426-1428.	WANG Lu, GUO Qiang, LIU Jian-guo. Node importance measure based on weighting method[J]. Journal of Computer Applications, 2018(5): 1426-1428.
[13]	顾亦然, 许梦馨. 基于PageRank的新闻关键词提取算法[J]. 电子科技大学学报, 2017, 46(5): 777-783.	GU Yi-ran, XU Meng-xin. News keyword extraction algorithm based on PageRank[J]. Journal of University of Electronic Science and Technology of China, 2017, 46(5): 777-783.
[14]	陈仕吉, 史丽文, 左文革. 科学合作网络中节点合作效果评测与分析[J]. 图书情报工作, 2012, 56(10): 61-143.	CHEN Shi-ji, SHI Li-wen, ZUO Wen-ge. Evaluation and analysis of node cooperation effect in scientific cooperation network[J]. Library and Information Service, 2012, 56(10): 61-143.
[15]	CHARNES A, COOPER W W, RHODES E. Measuring the efficiency of decision making units[J]. European Journal of Operational Research, 1978, 2(6): 429-444.
[16]	DE S P D J, BEAVER D. Collaboration in an invisible college[J]. American Psychologist, 1966, 21(11): 1011.
[17]	苏芳荔. 科研合作对期刊论文被引频次的影响[J]. 图书情报工作, 2011, 55(10): 144-148.	SU Fang-li. The influence of scientific research cooperation on the citation frequency of journal papers[J]. Library and Information Service, 2011, 55(10): 144-148.
[18]	何海燕, 李芳. 高校科研合作对论文产出质量的影响—基于国家重点实验室分析[J]. 北京理工大学学报(社会科学版), 2017, 19(5): 162-167.	HE Hai-yan, LI Fang. The influence of scientific research cooperation on the output quality of papers-Based on the analysis of national key laboratories[J]. Journal of Beijing Institute of Technology (Social Science Edition), 2017, 19(5): 162-167.
[19]	王卫, 史锐涵, 潘京华. 基于期刊论文的作者学术合作与科研产出关系研究—以图书情报领域为例[J]. 情报杂志, 2017, 36(3): 191-195.	WANG Wei, SHI Rui-han, PAN Jing-hua. Research on the relationship between academic cooperation and scientific research output based on journal papers—Taking the field of library and information as an example[J]. Journal of Information, 2017, 36(3): 191-195.
[20]	BROWN S A, DENNIS A R, VENKATESH V. Predicting collaboration technology use: Integrating technology adoption and collaboration research[J]. Journal of Management Information Systems, 2010, 27(2): 9-54.
[21]	SHEN H W, BARABASI A L. Collective credit allocation in science[J]. Proceedings of the National Academy of Sciences, 2014, 111(34): 12325-12330.
[22]	刘静, 马建霞. 我国管理科学研究进展分析—以国家自然科学基金立项项目及论文产出为分析数据[J]. 科技管理研究, 2015, 35(326): 249-258.	LIU Jing, MA Jian-xia. Analysis of the progress of management science research in China—Analysis of national natural science foundation projects and paper outputs as analysis data[J]. Science and Technology Management Research, 2015, 35(326): 249-258.
[23]	ZHANG Song-tao, GUAN Zhong-cheng. Education experience of scientific workforce—A case study on the winners of NSFDYS in CAS[J]. Forum on Science and Technology in China, 2015(12): 132-137.

Modeling of Input-output Performance of Scientists Based on the Analysis of Citation

doi: 10.12178/1001-0548.2018236

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views