ANDez: An open-source tool for author name disambiguation using machine learning
•ANDez consolidates multiple ML techniques for disambiguation.•Built using Python and popular ML libraries.•Provides a unified platform to evaluate and refine ML methods.•Assists scholars with limited ML expertise in bibliographic data analysis. Author name disambiguation in bibliographic data is ch...
Saved in:
Published in | SoftwareX Vol. 26; p. 101719 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
01.05.2024
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | •ANDez consolidates multiple ML techniques for disambiguation.•Built using Python and popular ML libraries.•Provides a unified platform to evaluate and refine ML methods.•Assists scholars with limited ML expertise in bibliographic data analysis.
Author name disambiguation in bibliographic data is challenging due to the same names of different authors and name variations of authors. Various machine learning (ML) methods address this, but a unified framework for comparing them is lacking. This study introduces ANDez, an open-source tool that integrates top-performing ML techniques for author name disambiguation. Developed in Python using popular ML libraries, ANDez provides a transparent system, merging complex procedures from different ML approaches. This promotes the assessment, modification, and benchmarking of ML techniques in author name disambiguation. ANDez's user-friendly design also helps researchers analyze ambiguous bibliographic data without needing advanced ML coding expertise. |
---|---|
ISSN: | 2352-7110 2352-7110 |
DOI: | 10.1016/j.softx.2024.101719 |