Developing a Prediction Model for Author Collaboration in Bioinformatics Research Using Graph Mining Techniques and Big Data Applications

Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific databases. The present study aims to provide a predictive model for author collaborations in bioinformatics research output using graph mining t...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of information science and management Vol. 19; no. 2; p. 1
Main Authors Ebrahimi, Fezzeh, Asemi, Asefeh, Shabani, Ahmad, Nezarat, Amin
Format Journal Article
LanguageEnglish
Published Shiraz Regional Information Center for Science and Technology 01.01.2021
Subjects
Online AccessGet full text
ISSN2008-8302
2008-8310

Cover

Loading…
Abstract Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific databases. The present study aims to provide a predictive model for author collaborations in bioinformatics research output using graph mining techniques and big data applications. The study is applied-developmental research adopting a mixed-method approach, i.e., a mix of quantitative and qualitative measures. The research population consisted of all bioinformatics research documents indexed in PubMed (n=699160). The correlations of bioinformatics articles were examined in terms of weight and strength based on article sections including title, abstract, keywords, journal title, and author affiliation using graph mining techniques and big data applications. Eventually, the prediction model of author collaboration in bioinformatics research was developed using the abovementioned tools and expert-assigned weights. The calculations and data analysis were carried out using Expert Choice, Excel, Spark, and Scala, and Python programming languages in a big data server. Accordingly, the research was conducted in three phases: 1) identifying and weighting the factors contributing to authors' similarity measurement; 2) implementing co-authorship prediction model; and 3) integrating the first and second phases (i.e., integrating the weights obtained in the previous phases). The results showed that journal title, citation, article title, author affiliation, keywords, and abstract scored 0.374, 0.374, 0.091, 0.075, 0.055, and 0.031. Moreover, the journal title achieved the highest score in the model for the co-author recommender system. As the data in bibliometric information networks is static, it was proved remarkably effective to use content-based features for similarity measures. So that the recommender system can offer the most suitable collaboration suggestions. It is expected that the model works efficiently in other databases and provides suitable recommendations for author collaborations in other subject areas. By integrating expert opinion and systemic weights, the model can help alleviate the current information overload and facilitate collaborator lookup by authors.
AbstractList Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific databases. The present study aims to provide a predictive model for author collaborations in bioinformatics research output using graph mining techniques and big data applications. The study is applied-developmental research adopting a mixed-method approach, i.e., a mix of quantitative and qualitative measures. The research population consisted of all bioinformatics research documents indexed in PubMed (n=699160). The correlations of bioinformatics articles were examined in terms of weight and strength based on article sections including title, abstract, keywords, journal title, and author affiliation using graph mining techniques and big data applications. Eventually, the prediction model of author collaboration in bioinformatics research was developed using the abovementioned tools and expert-assigned weights. The calculations and data analysis were carried out using Expert Choice, Excel, Spark, and Scala, and Python programming languages in a big data server. Accordingly, the research was conducted in three phases: 1) identifying and weighting the factors contributing to authors' similarity measurement; 2) implementing co-authorship prediction model; and 3) integrating the first and second phases (i.e., integrating the weights obtained in the previous phases). The results showed that journal title, citation, article title, author affiliation, keywords, and abstract scored 0.374, 0.374, 0.091, 0.075, 0.055, and 0.031. Moreover, the journal title achieved the highest score in the model for the co-author recommender system. As the data in bibliometric information networks is static, it was proved remarkably effective to use content-based features for similarity measures. So that the recommender system can offer the most suitable collaboration suggestions. It is expected that the model works efficiently in other databases and provides suitable recommendations for author collaborations in other subject areas. By integrating expert opinion and systemic weights, the model can help alleviate the current information overload and facilitate collaborator lookup by authors.
Author Ebrahimi, Fezzeh
Asemi, Asefeh
Nezarat, Amin
Shabani, Ahmad
Author_xml – sequence: 1
  givenname: Fezzeh
  surname: Ebrahimi
  fullname: Ebrahimi, Fezzeh
– sequence: 2
  givenname: Asefeh
  surname: Asemi
  fullname: Asemi, Asefeh
– sequence: 3
  givenname: Ahmad
  surname: Shabani
  fullname: Shabani, Ahmad
– sequence: 4
  givenname: Amin
  surname: Nezarat
  fullname: Nezarat, Amin
BookMark eNo9jk1OwzAQhS1UJErpHSyxjmTHTeIsSwulUisQKutqbE8aV8EOdsIduDVJQczmzc-bb-aWTJx3eEWmKWMykYKzyX_O0hsyj_HMGONiwWWeTcn3Gr-w8a11Jwr0NaCxurPe0b032NDKB7rsu3qQlW8aUD7AZWwdfbDeusHwMXR0pG8YEYKu6XscYZsAbU331o3FAXXt7GePkYIzw-aJrqEDumzbxuoLMd6R6wqaiPM_nZHD0-Nh9ZzsXjbb1XKXtKXskkUFILAoU6FKxnJQIHVpBEeZIShdCJkvVCa1MlipggnGcxCgVGl0wc3gnJH7X2wb_PhQdzz7Prjh4jHNZJHyMcQPLV5kBQ
ContentType Journal Article
Copyright Copyright Regional Information Center for Science and Technology 2021
Copyright_xml – notice: Copyright Regional Information Center for Science and Technology 2021
DBID E3H
F2A
DatabaseName Library & Information Sciences Abstracts (LISA)
Library & Information Science Abstracts (LISA)
DatabaseTitle Library and Information Science Abstracts (LISA)
DatabaseTitleList Library and Information Science Abstracts (LISA)
DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
EISSN 2008-8310
GroupedDBID .4I
5VS
ABDBF
ALMA_UNASSIGNED_HOLDINGS
E3H
ELW
EOJEC
F2A
GROUPED_DOAJ
KQ8
OBODZ
OK1
RNS
ID FETCH-LOGICAL-p98t-4faa3e7923b9006aba8c9d31e85eabc73864b58cbdefb703016a3abb9dc71d9d3
ISSN 2008-8302
IngestDate Mon Jun 30 04:25:29 EDT 2025
IsPeerReviewed false
IsScholarly true
Issue 2
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p98t-4faa3e7923b9006aba8c9d31e85eabc73864b58cbdefb703016a3abb9dc71d9d3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
PQID 2587211111
PQPubID 2035601
ParticipantIDs proquest_journals_2587211111
PublicationCentury 2000
PublicationDate 20210101
PublicationDateYYYYMMDD 2021-01-01
PublicationDate_xml – month: 01
  year: 2021
  text: 20210101
  day: 01
PublicationDecade 2020
PublicationPlace Shiraz
PublicationPlace_xml – name: Shiraz
PublicationTitle International journal of information science and management
PublicationYear 2021
Publisher Regional Information Center for Science and Technology
Publisher_xml – name: Regional Information Center for Science and Technology
SSID ssj0001341865
Score 2.2067997
Snippet Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific...
SourceID proquest
SourceType Aggregation Database
StartPage 1
SubjectTerms Bibliometrics
Big Data
Bioinformatics
Co authorship
Collaboration
Recommender systems
Title Developing a Prediction Model for Author Collaboration in Bioinformatics Research Using Graph Mining Techniques and Big Data Applications
URI https://www.proquest.com/docview/2587211111
Volume 19
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Jb9NAFB6lPXFBrGJp0TsgLpaRvMWeYygJBbVFoq6UWzSbEyPFoNa95B_0R_W_8WZxZtIiBOTgWM_beN6nmTef30LI25RlaHdkKmZZSuO8HPOYUVSIFKwqFS2UMB_aT8_Gxxf5l3kxH41uA6-l656_F5vfxpX8j1ZRhnrVUbL_oNntTVGA-6hf3KKGcftXOv7oI56YdqaQrS38rQucmbjESDNg-He0o-220yUoXcpUk6Z58L-LrAfBJ53FOjo1tSMs-a6zvNpszh_aJUKlZ9Z-DQm_794p3nOMO5kptpGS0RBNpO-4vueBM8Ul_KpdG0eDmdps1JaznlwpK8adxovPV4zb4lTRZLVm0lPcG53a3MjXLse4ozjS5A7F8U0tbZM_B-3U5Le6NF15HjT5zgcJM5AaFw-d58zOeaHMudQOMwENEJ8Gw3riZ8vBQ-Ds62J2cXKyqKfzeo_sZUkerOcNwYf2QWVKmW6ff2--N0ZM_Yg8dKsPmFgoPSYj1T0hhy52Bd5B8Org3vcpufEwAwYeZmBgBngFWJjBDsyg7WAXZjDADAzMwMAMLMzAwwywh_HKJWiYQQizZ6SeTeuj49hV8Ih_0qqP84axTOkMlZzi6I5IqASVWaKqQjEudL3ZnBeV4FI1XE89yZhljHMqRZlIPPM52e9-dOoFgUKVCS1EzkSDa2yZVpyPsxTt2ZKlTSmrl-Rg6NqFQ_bVIi0qTXDg79WfD78mDzzuDsh-f3mtDtHY7Pkbo9RfcjGRBQ
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Developing+a+Prediction+Model+for+Author+Collaboration+in+Bioinformatics+Research+Using+Graph+Mining+Techniques+and+Big+Data+Applications&rft.jtitle=International+journal+of+information+science+and+management&rft.au=Ebrahimi%2C+Fezzeh&rft.au=Asemi%2C+Asefeh&rft.au=Shabani%2C+Ahmad&rft.au=Nezarat%2C+Amin&rft.date=2021-01-01&rft.pub=Regional+Information+Center+for+Science+and+Technology&rft.issn=2008-8302&rft.eissn=2008-8310&rft.volume=19&rft.issue=2&rft.spage=1&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2008-8302&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2008-8302&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2008-8302&client=summon