download477 view1,297
twitter facebook

CC_BYThis item is licensed Creative Commons License

Title
The Impact of Name Ambiguity on Properties of Coauthorship Networks
Author(s)
Jinseok KimHeejun KimJana Diesner
Publication Year
2014-06-30
Abstract
Initial based disambiguation of author names is a common data pre-processing step in bibliometrics. It is widely accepted that this procedure can introduce errors into network data and any subsequent analytical results. What is not sufficiently understood is the precise impact of this step on the data and findings. We present an empirical answer to this question by comparing the impact of two commonly used initial based disambiguation methods against a reasonable proxy for ground truth data. We use DBLP, a database covering major journals and conferences in computer science and information science, as a source. We find that initial based disambiguation induces strong distortions in network metrics on the graph and node level: Authors become embedded in ties for which there is no empirical support, thus increasing their sphere of influence and diversity of involvement. Consequently, networks generated with initial-based disambiguation are more coherent and interconnected than the actual underlying networks, and individual authors appear to be more productive and more strongly embedded than they actually are.
Keyword
bibliometrics; name ambiguity; initial based disambiguation; coauthorship networks; collaboration networks
Journal Title
Journal of Information Science Theory and Practice
Citation Volume
2
ISSN
2287-4577
DOI
10.1633/JISTaP.2014.2.2.1
Files in This Item:
Thumbnail E1JSCH_2014_v2n2_6.pdf428.15 kBDownload
Appears in Collections:
8. KISTI 간행물 > JISTaP > Vol. 2 - No. 2
Type
Article
URI
https://repository.kisti.re.kr/handle/10580/8651
Export
RIS (EndNote)
XLS (Excel)
XML

Browse