期刊论文详细信息
Brazilian Computer Society. Journal
Detecting referential inconsistencies in electronic CV datasets
Vanessa Braganholo1  Ivison C. Rubim2 
[1] Institute of Computing, Fluminense Federal University (UFF), NiteróNCE, Federal University of Rio de Janeiro (UFRJ), Rio de Janeiro, Brazil;i, Brazil
关键词: Electronic curricula;    Lattes;    Inconsistency;    Similarity;   
DOI  :  10.1186/s13173-017-0052-0
学科分类:农业科学(综合)
来源: Springer U K
PDF
【 摘 要 】

One way to measure the scientific progress of a country is to evaluate the curriculum vitae (CV) of its researchers. In Brazil, this is not different. The Lattes Platform is an information system whose primary objective is to provide a single repository to store the CV of the Brazilian researchers. This system is increasingly acquiring expressiveness as the main source of information regarding the Brazilian community of researchers, students, managers, and other actors in the national system of science, technology, and innovation. However, the integrity of this important tool for gaging the national bibliographic production may be affected by the effect of ambiguities or referential inconsistencies in coauthoring citations. A first step towards solving this problem lies in identifying such inconsistencies. For that, we propose a heuristic-based approach that uses similarity search to match papers from coauthors of CV. We then use this technique to analyze over 2000 curricula of researchers from a given institution recovered from the Lattes Platform. The results indicate 18.98% of the analyzed publications present referential inconsistencies, which is a significant amount for a dataset that is supposed to be correct and trustable.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201902198226219ZK.pdf 1005KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:6次