期刊论文详细信息
EPJ Data Science
Investigating the contribution of author- and publication-specific features to scholars’ h-index prediction
Regular Article
Fakhri Momeni1  Philipp Mayr1  Stefan Dietze2 
[1] GESIS – Leibniz Institute for the Social Sciences, Unter Sachsenhausen 6-8, 50667, Cologne, Germany;GESIS – Leibniz Institute for the Social Sciences, Unter Sachsenhausen 6-8, 50667, Cologne, Germany;Heinrich-Heine-University, Universitätsstr. 1, 40225, Düsseldorf, Germany;
关键词: h-index prediction;    Feature importance;    Academic mobility;    Machine learning;    Open access publishing;   
DOI  :  10.1140/epjds/s13688-023-00421-6
 received in 2022-07-08, accepted in 2023-09-23,  发布年份 2023
来源: Springer
PDF
【 摘 要 】

Evaluation of researchers’ output is vital for hiring committees and funding bodies, and it is usually measured via their scientific productivity, citations, or a combined metric such as the h-index. Assessing young researchers is more critical because it takes a while to get citations and increment of h-index. Hence, predicting the h-index can help to discover the researchers’ scientific impact. In addition, identifying the influential factors to predict the scientific impact is helpful for researchers and their organizations seeking solutions to improve it. This study investigates the effect of the author, paper/venue-specific features on the future h-index. For this purpose, we used a machine learning approach to predict the h-index and feature analysis techniques to advance the understanding of feature impact. Utilizing the bibliometric data in Scopus, we defined and extracted two main groups of features. The first relates to prior scientific impact, and we name it ‘prior impact-based features’ and includes the number of publications, received citations, and h-index. The second group is ‘non-prior impact-based features’ and contains the features related to author, co-authorship, paper, and venue characteristics. We explored their importance in predicting researchers’ h-index in three career phases. Also, we examined the temporal dimension of predicting performance for different feature categories to find out which features are more reliable for long- and short-term prediction. We referred to the gender of the authors to examine the role of this author’s characteristics in the prediction task. Our findings showed that gender has a very slight effect in predicting the h-index. Although the results demonstrate better performance for the models containing prior impact-based features for all researchers’ groups in the near future, we found that non-prior impact-based features are more robust predictors for younger scholars in the long term. Also, prior impact-based features lose their power to predict more than other features in the long term.

【 授权许可】

CC BY   
© Springer-Verlag GmbH, DE 2023

【 预 览 】
附件列表
Files Size Format View
RO202311105632203ZK.pdf 2279KB PDF download
Fig. 1 2460KB Image download
Fig. 3 77KB Image download
12937_2016_133_Article_IEq1.gif 1KB Image download
Fig. 6 54KB Image download
Fig. 6 90KB Image download
12951_2017_292_Article_IEq1.gif 1KB Image download
12951_2015_155_Article_IEq62.gif 1KB Image download
MediaObjects/13046_2022_2359_MOESM2_ESM.docx 15KB Other download
Fig. 2 1305KB Image download
Fig. 1 1997KB Image download
12951_2017_255_Article_IEq39.gif 1KB Image download
【 图 表 】

12951_2017_255_Article_IEq39.gif

Fig. 1

Fig. 2

12951_2015_155_Article_IEq62.gif

12951_2017_292_Article_IEq1.gif

Fig. 6

Fig. 6

12937_2016_133_Article_IEq1.gif

Fig. 3

Fig. 1

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  • [51]
  • [52]
  • [53]
  • [54]
  • [55]
  • [56]
  • [57]
  文献评价指标  
  下载次数:2次 浏览次数:1次