Developing a Prediction Model for Author Collaboration in Bioinformatics Research Using Graph Mining Techniques and Big Data Applications
Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific databases. The present study aims to provide a predictive model for author collaborations in bioinformatics research output using graph mining t...
Saved in:
Published in | International journal of information science and management Vol. 19; no. 2; p. 1 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Shiraz
Regional Information Center for Science and Technology
01.01.2021
|
Subjects | |
Online Access | Get full text |
ISSN | 2008-8302 2008-8310 |
Cover
Loading…
Abstract | Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific databases. The present study aims to provide a predictive model for author collaborations in bioinformatics research output using graph mining techniques and big data applications. The study is applied-developmental research adopting a mixed-method approach, i.e., a mix of quantitative and qualitative measures. The research population consisted of all bioinformatics research documents indexed in PubMed (n=699160). The correlations of bioinformatics articles were examined in terms of weight and strength based on article sections including title, abstract, keywords, journal title, and author affiliation using graph mining techniques and big data applications. Eventually, the prediction model of author collaboration in bioinformatics research was developed using the abovementioned tools and expert-assigned weights. The calculations and data analysis were carried out using Expert Choice, Excel, Spark, and Scala, and Python programming languages in a big data server. Accordingly, the research was conducted in three phases: 1) identifying and weighting the factors contributing to authors' similarity measurement; 2) implementing co-authorship prediction model; and 3) integrating the first and second phases (i.e., integrating the weights obtained in the previous phases). The results showed that journal title, citation, article title, author affiliation, keywords, and abstract scored 0.374, 0.374, 0.091, 0.075, 0.055, and 0.031. Moreover, the journal title achieved the highest score in the model for the co-author recommender system. As the data in bibliometric information networks is static, it was proved remarkably effective to use content-based features for similarity measures. So that the recommender system can offer the most suitable collaboration suggestions. It is expected that the model works efficiently in other databases and provides suitable recommendations for author collaborations in other subject areas. By integrating expert opinion and systemic weights, the model can help alleviate the current information overload and facilitate collaborator lookup by authors. |
---|---|
AbstractList | Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific databases. The present study aims to provide a predictive model for author collaborations in bioinformatics research output using graph mining techniques and big data applications. The study is applied-developmental research adopting a mixed-method approach, i.e., a mix of quantitative and qualitative measures. The research population consisted of all bioinformatics research documents indexed in PubMed (n=699160). The correlations of bioinformatics articles were examined in terms of weight and strength based on article sections including title, abstract, keywords, journal title, and author affiliation using graph mining techniques and big data applications. Eventually, the prediction model of author collaboration in bioinformatics research was developed using the abovementioned tools and expert-assigned weights. The calculations and data analysis were carried out using Expert Choice, Excel, Spark, and Scala, and Python programming languages in a big data server. Accordingly, the research was conducted in three phases: 1) identifying and weighting the factors contributing to authors' similarity measurement; 2) implementing co-authorship prediction model; and 3) integrating the first and second phases (i.e., integrating the weights obtained in the previous phases). The results showed that journal title, citation, article title, author affiliation, keywords, and abstract scored 0.374, 0.374, 0.091, 0.075, 0.055, and 0.031. Moreover, the journal title achieved the highest score in the model for the co-author recommender system. As the data in bibliometric information networks is static, it was proved remarkably effective to use content-based features for similarity measures. So that the recommender system can offer the most suitable collaboration suggestions. It is expected that the model works efficiently in other databases and provides suitable recommendations for author collaborations in other subject areas. By integrating expert opinion and systemic weights, the model can help alleviate the current information overload and facilitate collaborator lookup by authors. |
Author | Ebrahimi, Fezzeh Asemi, Asefeh Nezarat, Amin Shabani, Ahmad |
Author_xml | – sequence: 1 givenname: Fezzeh surname: Ebrahimi fullname: Ebrahimi, Fezzeh – sequence: 2 givenname: Asefeh surname: Asemi fullname: Asemi, Asefeh – sequence: 3 givenname: Ahmad surname: Shabani fullname: Shabani, Ahmad – sequence: 4 givenname: Amin surname: Nezarat fullname: Nezarat, Amin |
BookMark | eNo9jk1OwzAQhS1UJErpHSyxjmTHTeIsSwulUisQKutqbE8aV8EOdsIduDVJQczmzc-bb-aWTJx3eEWmKWMykYKzyX_O0hsyj_HMGONiwWWeTcn3Gr-w8a11Jwr0NaCxurPe0b032NDKB7rsu3qQlW8aUD7AZWwdfbDeusHwMXR0pG8YEYKu6XscYZsAbU331o3FAXXt7GePkYIzw-aJrqEDumzbxuoLMd6R6wqaiPM_nZHD0-Nh9ZzsXjbb1XKXtKXskkUFILAoU6FKxnJQIHVpBEeZIShdCJkvVCa1MlipggnGcxCgVGl0wc3gnJH7X2wb_PhQdzz7Prjh4jHNZJHyMcQPLV5kBQ |
ContentType | Journal Article |
Copyright | Copyright Regional Information Center for Science and Technology 2021 |
Copyright_xml | – notice: Copyright Regional Information Center for Science and Technology 2021 |
DBID | E3H F2A |
DatabaseName | Library & Information Sciences Abstracts (LISA) Library & Information Science Abstracts (LISA) |
DatabaseTitle | Library and Information Science Abstracts (LISA) |
DatabaseTitleList | Library and Information Science Abstracts (LISA) |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Library & Information Science |
EISSN | 2008-8310 |
GroupedDBID | .4I 5VS ABDBF ALMA_UNASSIGNED_HOLDINGS E3H ELW EOJEC F2A GROUPED_DOAJ KQ8 OBODZ OK1 RNS |
ID | FETCH-LOGICAL-p98t-4faa3e7923b9006aba8c9d31e85eabc73864b58cbdefb703016a3abb9dc71d9d3 |
ISSN | 2008-8302 |
IngestDate | Mon Jun 30 04:25:29 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Issue | 2 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-p98t-4faa3e7923b9006aba8c9d31e85eabc73864b58cbdefb703016a3abb9dc71d9d3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
PQID | 2587211111 |
PQPubID | 2035601 |
ParticipantIDs | proquest_journals_2587211111 |
PublicationCentury | 2000 |
PublicationDate | 20210101 |
PublicationDateYYYYMMDD | 2021-01-01 |
PublicationDate_xml | – month: 01 year: 2021 text: 20210101 day: 01 |
PublicationDecade | 2020 |
PublicationPlace | Shiraz |
PublicationPlace_xml | – name: Shiraz |
PublicationTitle | International journal of information science and management |
PublicationYear | 2021 |
Publisher | Regional Information Center for Science and Technology |
Publisher_xml | – name: Regional Information Center for Science and Technology |
SSID | ssj0001341865 |
Score | 2.2067997 |
Snippet | Nowadays, scientific collaboration has dramatically increased due to web-based technologies, advanced communication systems, and information and scientific... |
SourceID | proquest |
SourceType | Aggregation Database |
StartPage | 1 |
SubjectTerms | Bibliometrics Big Data Bioinformatics Co authorship Collaboration Recommender systems |
Title | Developing a Prediction Model for Author Collaboration in Bioinformatics Research Using Graph Mining Techniques and Big Data Applications |
URI | https://www.proquest.com/docview/2587211111 |
Volume | 19 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Jb9NAFB6lPXFBrGJp0TsgLpaRvMWeYygJBbVFoq6UWzSbEyPFoNa95B_0R_W_8WZxZtIiBOTgWM_beN6nmTef30LI25RlaHdkKmZZSuO8HPOYUVSIFKwqFS2UMB_aT8_Gxxf5l3kxH41uA6-l656_F5vfxpX8j1ZRhnrVUbL_oNntTVGA-6hf3KKGcftXOv7oI56YdqaQrS38rQucmbjESDNg-He0o-220yUoXcpUk6Z58L-LrAfBJ53FOjo1tSMs-a6zvNpszh_aJUKlZ9Z-DQm_794p3nOMO5kptpGS0RBNpO-4vueBM8Ul_KpdG0eDmdps1JaznlwpK8adxovPV4zb4lTRZLVm0lPcG53a3MjXLse4ozjS5A7F8U0tbZM_B-3U5Le6NF15HjT5zgcJM5AaFw-d58zOeaHMudQOMwENEJ8Gw3riZ8vBQ-Ds62J2cXKyqKfzeo_sZUkerOcNwYf2QWVKmW6ff2--N0ZM_Yg8dKsPmFgoPSYj1T0hhy52Bd5B8Org3vcpufEwAwYeZmBgBngFWJjBDsyg7WAXZjDADAzMwMAMLMzAwwywh_HKJWiYQQizZ6SeTeuj49hV8Ih_0qqP84axTOkMlZzi6I5IqASVWaKqQjEudL3ZnBeV4FI1XE89yZhljHMqRZlIPPM52e9-dOoFgUKVCS1EzkSDa2yZVpyPsxTt2ZKlTSmrl-Rg6NqFQ_bVIi0qTXDg79WfD78mDzzuDsh-f3mtDtHY7Pkbo9RfcjGRBQ |
linkProvider | Directory of Open Access Journals |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Developing+a+Prediction+Model+for+Author+Collaboration+in+Bioinformatics+Research+Using+Graph+Mining+Techniques+and+Big+Data+Applications&rft.jtitle=International+journal+of+information+science+and+management&rft.au=Ebrahimi%2C+Fezzeh&rft.au=Asemi%2C+Asefeh&rft.au=Shabani%2C+Ahmad&rft.au=Nezarat%2C+Amin&rft.date=2021-01-01&rft.pub=Regional+Information+Center+for+Science+and+Technology&rft.issn=2008-8302&rft.eissn=2008-8310&rft.volume=19&rft.issue=2&rft.spage=1&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2008-8302&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2008-8302&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2008-8302&client=summon |