Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades

Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics and biology insights Vol. 16; p. 11779322221125604
Main Authors Lopes, Tiago JS, Rios, Ricardo, Nogueira, Tatiane
Format Journal Article
LanguageEnglish
Published London, England SAGE Publications 01.09.2022
Sage Publications Ltd
SAGE Publishing
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific literature is increasing so rapidly, even the most avid reader will find it difficult to follow it closely. In this study, we used sophisticated computational techniques to map the hemophilia literature of the last 60 years. We created a network structure to represent authorship collaborations, where the nodes are the researchers and 2 nodes are connected if they co-authored a manuscript. We accurately identified author clusters, namely, researchers who have collaborated systematically for several years, and used text mining techniques to automatically synthesize their research specialties. Overall, this study serves as a historical appreciation of the effort of thousands of hemophilia researchers and demonstrates that a computational framework is able to automatically identify collaboration networks and their research specialties. Importantly, we made all datasets and source code available for the community, and we anticipate that the methods introduced here will pave the way for the development of systems that generate compelling hypothesis based on patterns that are imperceptible to human researchers.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1177-9322
1177-9322
DOI:10.1177/11779322221125604