Revisiting subject classification in academic databases: A comparison of the classification accuracy of Web of Science, Scopus & Dimensions

Classification of research articles into different subject areas is an extremely important task in bibliometric analysis and information retrieval. There are primarily two kinds of subject classification approaches used in different academic databases: journal-based (aka source-level) and article-ba...

Full description

Saved in:
Bibliographic Details
Published inJournal of intelligent & fuzzy systems Vol. 39; no. 2; pp. 2471 - 2476
Main Authors Singh, Prashasti, Piryani, Rajesh, Singh, Vivek Kumar, Pinto, David
Format Journal Article
LanguageEnglish
Published London, England SAGE Publications 01.01.2020
Sage Publications Ltd
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Classification of research articles into different subject areas is an extremely important task in bibliometric analysis and information retrieval. There are primarily two kinds of subject classification approaches used in different academic databases: journal-based (aka source-level) and article-based (aka publication-level). The two popular academic databases- Web of Science and Scopus- use journal-based subject classification scheme for articles, which assigns articles into a subject based on the subject category assigned to the journal in which they are published. On the other hand, the recently introduced Dimensions database is the first large academic database that uses article-based subject classification scheme that assigns the article to a subject category based on its contents. Though the subject classification schemes of Web of Science have been compared in several studies, no research studies have been done on comparison of the article-based and journal-based subject classification systems in different academic databases. This paper aims to compare the accuracy of subject classification system of the three popular academic databases: Web of Science, Scopus and Dimensions through a large-scale user-based study. Results show that the commonly held belief of superiority of article-based subject classification over the journal-based subject classification scheme does not hold at least at the moment, as Web of Science appears to have the most accurate subject classification.
AbstractList Classification of research articles into different subject areas is an extremely important task in bibliometric analysis and information retrieval. There are primarily two kinds of subject classification approaches used in different academic databases: journal-based (aka source-level) and article-based (aka publication-level). The two popular academic databases- Web of Science and Scopus- use journal-based subject classification scheme for articles, which assigns articles into a subject based on the subject category assigned to the journal in which they are published. On the other hand, the recently introduced Dimensions database is the first large academic database that uses article-based subject classification scheme that assigns the article to a subject category based on its contents. Though the subject classification schemes of Web of Science have been compared in several studies, no research studies have been done on comparison of the article-based and journal-based subject classification systems in different academic databases. This paper aims to compare the accuracy of subject classification system of the three popular academic databases: Web of Science, Scopus and Dimensions through a large-scale user-based study. Results show that the commonly held belief of superiority of article-based subject classification over the journal-based subject classification scheme does not hold at least at the moment, as Web of Science appears to have the most accurate subject classification.
Author Singh, Prashasti
Singh, Vivek Kumar
Pinto, David
Piryani, Rajesh
Author_xml – sequence: 1
  givenname: Prashasti
  surname: Singh
  fullname: Singh, Prashasti
  organization: Faculty of Computer Science
– sequence: 2
  givenname: Rajesh
  surname: Piryani
  fullname: Piryani, Rajesh
  organization: Faculty of Computer Science
– sequence: 3
  givenname: Vivek Kumar
  surname: Singh
  fullname: Singh, Vivek Kumar
  organization: Faculty of Computer Science
– sequence: 4
  givenname: David
  surname: Pinto
  fullname: Pinto, David
  organization: Faculty of Computer Science
BookMark eNp1kMlKBDEQhoMouJ58gYAggrZm683b4C6C4ILHppJONMNM95hKCz6DL23GEQTRSy3U91cV_zpZ7vrOErLN2aEUUh5dX53fZ7ysa1YskTVelXlW1UW5nGpWqIwLVaySdcQxY7zMBVsjH3f2zaOPvnumOOixNZGaCSB65w1E33fUdxQMtHbqDW0hgga0eExH1PTTGQSPiekdjS_2txKMGQKY9_n4yep5ujfedsYepKKfDUh36amf2g4TjptkxcEE7dZ33iCP52cPJ5fZze3F1cnoJjOiVjED6UCISgPXlRa5sbpyvC1zXWue69aJvCxb6UpZK8fyQrXS1MKy1ChVOC3lBtlZ7J2F_nWwGJtxP4QunWyEkjVTKRaJ2l9QJvSIwbpmFvwUwnvDWTN3u5m73SzcTjT_RRsfv1yIAfzkH83eQoPwbH9--Av9BJ5Mkoc
CitedBy_id crossref_primary_10_1007_s11192_021_03948_5
crossref_primary_10_21834_e_bpj_v9iSI18_5473
crossref_primary_10_3390_publications11040051
crossref_primary_10_1016_j_joi_2021_101226
crossref_primary_10_1007_s11192_022_04576_3
crossref_primary_10_1111_jop_13575
crossref_primary_10_1007_s11192_023_04899_9
crossref_primary_10_1016_j_heliyon_2023_e13726
crossref_primary_10_1007_s10668_024_05667_2
crossref_primary_10_1177_01655515231191351
crossref_primary_10_1109_ACCESS_2025_3531778
crossref_primary_10_2478_nispa_2024_0018
crossref_primary_10_3390_su17020525
Cites_doi 10.5865/IJKCT.2012.2.1.051
10.1002/asi.20322
10.1002/asi.21086
10.1371/journal.pone.0018209
10.1023/A:1022378804087
10.1007/s11192-018-2854-z
10.1016/j.joi.2016.02.003
10.1016/j.joi.2016.02.007
10.1371/journal.pone.0039464
10.1016/j.joi.2014.11.010
10.1007/BF02458488
10.1007/BF02018480
10.1007/s11192-018-2855-y
10.1007/s11192-010-0180-1
10.1016/j.joi.2018.12.005
10.1007/BF02018475
10.1002/asi.22748
10.1002/asi.20967
10.1016/j.ipm.2014.10.011
ContentType Journal Article
Copyright 2020 – IOS Press and the authors. All rights reserved
Copyright IOS Press BV 2020
Copyright_xml – notice: 2020 – IOS Press and the authors. All rights reserved
– notice: Copyright IOS Press BV 2020
DBID AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.3233/JIFS-179906
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Computer and Information Systems Abstracts
CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 1875-8967
EndPage 2476
ExternalDocumentID 10_3233_JIFS_179906
10.3233_JIFS-179906
GroupedDBID .4S
.DC
4.4
5GY
8VB
AAGLT
ABCQX
ABDBF
ABJNI
ABUJY
ACGFS
ACPQW
ACUHS
ADMLS
ADZMO
AEMOZ
AENEX
AFRHK
AHDMH
AHQJS
AJNRN
AKVCP
ALMA_UNASSIGNED_HOLDINGS
AMVHM
ARCSS
ARTOV
ASPBG
AVWKF
DU5
EAD
EAP
EBA
EBR
EBS
EBU
EDO
EMK
EPL
EST
ESX
H13
HZ~
I-F
IOS
K1G
L7B
MET
MIO
MK~
MV1
NGNOM
O9-
P2P
QWB
TH9
TUS
ZL0
AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c294t-a3fa228ba1b8b25ceb8f1d75b9b15bdf2577d3f7394f0564d3c92e04f0446fb33
ISSN 1064-1246
IngestDate Fri Jul 25 10:09:47 EDT 2025
Sun Jul 06 05:02:55 EDT 2025
Thu Apr 24 23:08:22 EDT 2025
Sun Jul 13 06:01:30 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords subject classification
research category
Academic databases
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c294t-a3fa228ba1b8b25ceb8f1d75b9b15bdf2577d3f7394f0564d3c92e04f0446fb33
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
PQID 2439042436
PQPubID 2046407
PageCount 6
ParticipantIDs proquest_journals_2439042436
crossref_primary_10_3233_JIFS_179906
crossref_citationtrail_10_3233_JIFS_179906
sage_journals_10_3233_JIFS_179906
PublicationCentury 2000
PublicationDate 2020-01-01
PublicationDateYYYYMMDD 2020-01-01
PublicationDate_xml – month: 01
  year: 2020
  text: 2020-01-01
  day: 01
PublicationDecade 2020
PublicationPlace London, England
PublicationPlace_xml – name: London, England
– name: London
PublicationTitle Journal of intelligent & fuzzy systems
PublicationYear 2020
Publisher SAGE Publications
Sage Publications Ltd
Publisher_xml – name: SAGE Publications
– name: Sage Publications Ltd
References 2012; 2
2018; 117
2006; 57
2015; 51
2009; 60
2019; 13
2016; 10
1999; 44
2012; 7
1996; 35
2015; 9
2011; 6
2012; 63
2003; 56
2010; 82
e_1_3_2_9_2
e_1_3_2_15_2
e_1_3_2_8_2
e_1_3_2_16_2
e_1_3_2_7_2
e_1_3_2_17_2
e_1_3_2_6_2
e_1_3_2_18_2
e_1_3_2_19_2
e_1_3_2_20_2
e_1_3_2_10_2
e_1_3_2_21_2
e_1_3_2_5_2
e_1_3_2_11_2
e_1_3_2_4_2
e_1_3_2_12_2
e_1_3_2_3_2
e_1_3_2_13_2
e_1_3_2_2_2
e_1_3_2_14_2
References_xml – volume: 2
  start-page: 51
  issue: 1
  year: 2012
  end-page: 65
  article-title: A Preliminary Study on the Multiple Mapping Structure of Classification Systems for Heterogeneous Databases
  publication-title: International Journal of Knowledge Content Development & Technology
– volume: 10
  start-page: 347
  issue: 2
  year: 2016
  end-page: 364
  article-title: Large-scale analysis of the accuracy of the journal classification systems of Web of Science and Scopus
  publication-title: Journal of Informetrics
– volume: 13
  start-page: 202
  issue: 1
  year: 2019
  end-page: 225
  article-title: Comparing journal and paper level classifications of science
  publication-title: Journal of Informetrics
– volume: 56
  start-page: 357
  issue: 3
  year: 2003
  end-page: 367
  article-title: A new classification scheme of science fields and subfields designed for scientometric evaluation purposes
  publication-title: Scientometrics
– volume: 10
  start-page: 365
  issue: 2
  year: 2016
  end-page: 391
  article-title: A review of the literature on citation impact indicators
  publication-title: Journal of Informetrics
– volume: 9
  start-page: 102
  issue: 1
  year: 2015
  end-page: 117
  article-title: Field-normalized citation impact indicators using algorithmically constructed classification systems of science
  publication-title: Journal of Informetrics
– volume: 51
  start-page: 50
  issue: 2
  year: 2015
  end-page: 61
  article-title: New journal classification methods based on the global h-index
  publication-title: Information Processing & Management
– volume: 35
  start-page: 223
  issue: 2
  year: 1996
  end-page: 235
  article-title: Coping with the problem of subject classification diversity
  publication-title: Scientometrics
– volume: 60
  start-page: 348
  issue: 2
  year: 2009
  end-page: 362
  article-title: A global map of science based on the ISI subject categories
  publication-title: Journal of the American Society for Information Science and Technology
– volume: 57
  start-page: 601
  issue: 5
  year: 2006
  end-page: 613
  article-title: Can scientific journals be classified in terms of aggregated journal-journal citation relations using the Journal Citation Reports?
  publication-title: Journal of the American Society for Information Science and Technology
– volume: 35
  start-page: 167
  issue: 2
  year: 1996
  end-page: 176
  article-title: The need for standards in bibliometric research and technology
  publication-title: Scientometrics
– volume: 6
  start-page: e18209
  issue: 4
  year: 2011
  article-title: Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems
  publication-title: PLoS One
– volume: 117
  start-page: 637
  issue: 1
  year: 2018
  end-page: 640
  article-title: “Field classification of publications in Dimensions: a first case study testing its reliability and validity,”
  publication-title: Scientometrics
– volume: 44
  start-page: 427
  issue: 3
  year: 1999
  end-page: 439
  article-title: An item-by-item subject classification of papers published in multidisciplinary and general journals using reference analysis
  publication-title: Scientometrics
– volume: 60
  start-page: 1823
  issue: 9
  year: 2009
  end-page: 1835
  article-title: Content-based and algorithmic classifications of journals: Perspectives on the dynamics of scientific communication and indexer effects
  publication-title: Journal of the American Society for Information Science and Technology
– volume: 7
  start-page: e39464
  issue: 7
  year: 2012
  article-title: “Design and Update of a Classification System: The UCSD Map of Science,”
  publication-title: PLoS ONE
– volume: 63
  start-page: 2378
  issue: 12
  year: 2012
  end-page: 2392
  article-title: A new methodology for constructing a publication-level classification system of science
  publication-title: Journal of the American Society for Information Science and Technology
– volume: 82
  start-page: 687
  issue: 3
  year: 2010
  end-page: 706
  article-title: Journal cross-citation analysis for validation and improvement of journal-based subject classification in bibliometric research
  publication-title: Scientometrics
– volume: 117
  start-page: 641
  issue: 1
  year: 2018
  end-page: 645
  article-title: Response to the letter ‘Field classification of publications in Dimensions: a first case study testing its reliability and validity
  publication-title: Scientometrics
– ident: e_1_3_2_9_2
  doi: 10.5865/IJKCT.2012.2.1.051
– ident: e_1_3_2_11_2
  doi: 10.1002/asi.20322
– ident: e_1_3_2_12_2
  doi: 10.1002/asi.21086
– ident: e_1_3_2_20_2
  doi: 10.1371/journal.pone.0018209
– ident: e_1_3_2_4_2
  doi: 10.1023/A:1022378804087
– ident: e_1_3_2_8_2
  doi: 10.1007/s11192-018-2854-z
– ident: e_1_3_2_17_2
  doi: 10.1016/j.joi.2016.02.003
– ident: e_1_3_2_16_2
  doi: 10.1016/j.joi.2016.02.007
– ident: e_1_3_2_2_2
  doi: 10.1371/journal.pone.0039464
– ident: e_1_3_2_13_2
  doi: 10.1016/j.joi.2014.11.010
– ident: e_1_3_2_6_2
  doi: 10.1007/BF02458488
– ident: e_1_3_2_7_2
  doi: 10.1007/BF02018480
– ident: e_1_3_2_3_2
  doi: 10.1007/s11192-018-2855-y
– ident: e_1_3_2_19_2
  doi: 10.1007/s11192-010-0180-1
– ident: e_1_3_2_14_2
  doi: 10.1016/j.joi.2018.12.005
– ident: e_1_3_2_5_2
  doi: 10.1007/BF02018475
– ident: e_1_3_2_15_2
  doi: 10.1002/asi.22748
– ident: e_1_3_2_21_2
– ident: e_1_3_2_10_2
  doi: 10.1002/asi.20967
– ident: e_1_3_2_18_2
  doi: 10.1016/j.ipm.2014.10.011
SSID ssj0017520
Score 2.304896
Snippet Classification of research articles into different subject areas is an extremely important task in bibliometric analysis and information retrieval. There are...
SourceID proquest
crossref
sage
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 2471
SubjectTerms Bibliometrics
Classification
Classification schemes
Indexing
Information retrieval
Science
Title Revisiting subject classification in academic databases: A comparison of the classification accuracy of Web of Science, Scopus & Dimensions
URI https://journals.sagepub.com/doi/full/10.3233/JIFS-179906
https://www.proquest.com/docview/2439042436
Volume 39
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Nb9NAEF2F9gIHxKcoLWiRKg4EQ7Jex1luETQqVVqkJoHcLO96Vw1USRXbh-Yv8KP4a8x41x9NIwRcnMTrXSV5L-PZyZsZQg4NuN3GCO2JRMMGRQMWfdkznuEmNDCgOhIThU_PesdTfjILZq3Wr4ZqKc_kO7XemlfyP6jCOcAVs2T_AdlqUTgBzwFfOALCcPwrjM-L1PBCuJzmEiMqbYXeMMp_4lLFGJcKeBSD4k0rtdnoqtmBsPA_N-bGSuUr7AYPw9-0LNJbrCUoQqZqeZWnljnYISCtAn-3Xd15VfczKyaYfL2-dkWkK59-DB_DxnhWcXoRg-mpzPZ8dW1bT7XP4-86vbg15SvY7B_tQi5eT1oUHaIaqn0X3GCdjeBGU7dkhYG11gmtNfhTHjgorpa2PQcbMK8vbI-P0sTbekmOyqxpr7ltALN5I_EZBrqHJ5-HYyzgKjpbynWffYmG09EomhzNJnfILoN9Chja3cGn09G4-iMrDJgtiOHeqU0RxeXfNxa_6RTVO52GuLDwdyYPyH2HHh1Y1j0kLb14RO41ylc-Jj9r_lHHP3qTQ3S-oCX_aMW_D3RAa_bRpaHAvs2ZJftwGNiHD459b6nlHn1Na-Y9IdPh0eTjseeae3iKCZ55sW9ixvoy7sq-ZIHSsm-6SRhIIbuBTAzcSsLEN6EvuAEnnSe-Ekx34AXnPSN9_ynZWSwX-hmhXSzowLWKe4Jz2O6IUCktdMh1z2dKiT3ypvx2I-Uq32MDlssIdsAIRYRQRBaKPXJYXXxlC75sv-yghClyFiGNGHj3KCXwYfgVQlcPbVni-Z-X2Cd361_EAdnJVrl-AQ5wJl86kv0Gj4a3CQ
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Revisiting+subject+classification+in+academic+databases%3A+A+comparison+of+the+classification+accuracy+of+Web+of+Science%2C+Scopus+%26+Dimensions&rft.jtitle=Journal+of+intelligent+%26+fuzzy+systems&rft.au=Singh%2C+Prashasti&rft.au=Piryani%2C+Rajesh&rft.au=Singh%2C+Vivek+Kumar&rft.au=Pinto%2C+David&rft.date=2020-01-01&rft.pub=Sage+Publications+Ltd&rft.issn=1064-1246&rft.eissn=1875-8967&rft.volume=39&rft.issue=2&rft.spage=2471&rft_id=info:doi/10.3233%2FJIFS-179906&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1064-1246&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1064-1246&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1064-1246&client=summon