Leveraging the global research infrastructure to characterize the impact of National Science Foundation research

The global research infrastructure (GRI) is made up of the repositories and organizations that provide persistent identifiers (PIDs) and metadata for many kinds of research objects and connect these objects to funders, research institutions, researchers, and one another using PIDs. The INFORMATE Pro...

Full description

Saved in:
Bibliographic Details
Published inInformation services & use Vol. 45; no. 1-2; pp. 30 - 47
Main Authors Jones, Jamaica, Habermann, Ted
Format Journal Article
LanguageEnglish
Published London, England SAGE Publications 01.05.2025
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The global research infrastructure (GRI) is made up of the repositories and organizations that provide persistent identifiers (PIDs) and metadata for many kinds of research objects and connect these objects to funders, research institutions, researchers, and one another using PIDs. The INFORMATE Project has combined three data sources to focus on understanding how the global research infrastructure might help the US National Science Foundation (NSF) and other federal agencies identify and characterize the impact of their support. In this paper we present INFORMATE observations of three data systems. The NSF Award database represents NSF funding while the NSF Public Access Repository (PAR) and CHORUS, as a proxy for the GRI, represent two different views of results of that funding. We compare the first at the level of awards and the second two at the level of published research articles. Our findings demonstrate that CHORUS datasets include significantly more NSF awards and more related papers than does PAR. Our findings also suggest that time plays a significant role in the inclusion of award metadata across the sources analyzed. Data in those sources travel very different journeys, each presenting different obstacles to metadata completeness and suggesting necessary actions on the parts of authors and publishers to ensure that publication and funding metadata are captured. We discuss these actions, as well as the implications that our findings have for emergent technologies such as artificial intelligence and natural language processing.
AbstractList The global research infrastructure (GRI) is made up of the repositories and organizations that provide persistent identifiers (PIDs) and metadata for many kinds of research objects and connect these objects to funders, research institutions, researchers, and one another using PIDs. The INFORMATE Project has combined three data sources to focus on understanding how the global research infrastructure might help the US National Science Foundation (NSF) and other federal agencies identify and characterize the impact of their support. In this paper we present INFORMATE observations of three data systems. The NSF Award database represents NSF funding while the NSF Public Access Repository (PAR) and CHORUS, as a proxy for the GRI, represent two different views of results of that funding. We compare the first at the level of awards and the second two at the level of published research articles. Our findings demonstrate that CHORUS datasets include significantly more NSF awards and more related papers than does PAR. Our findings also suggest that time plays a significant role in the inclusion of award metadata across the sources analyzed. Data in those sources travel very different journeys, each presenting different obstacles to metadata completeness and suggesting necessary actions on the parts of authors and publishers to ensure that publication and funding metadata are captured. We discuss these actions, as well as the implications that our findings have for emergent technologies such as artificial intelligence and natural language processing.
Author Jones, Jamaica
Habermann, Ted
Author_xml – sequence: 1
  givenname: Jamaica
  surname: Jones
  fullname: Jones, Jamaica
  organization: , Boulder, CO, USA
– sequence: 2
  givenname: Ted
  surname: Habermann
  fullname: Habermann, Ted
  organization: , Boulder, CO, USA
BookMark eNplkMFOwzAQRC1UJNrCB3DzD6R47dhOjqiiUKmCA3COHGedpip2ZScc-HqSgrhwGu2TZrQzCzLzwSMht8BWAFrfQaFloYuSSxBCMV1ekPnEsgnOyJyB0pnkSl6RRUoHxlguBMzJaYefGE3b-Zb2e6TtMdTmSCMmNNHuaeddNKmPg-2HiLQP1O5NNLbH2H3h2dJ9nMabBkefTd8FP9pfbYfeIt2EwTdn-Jd4TS6dOSa8-dUled88vK2fst3L43Z9v8ssSF5mtdaFa0Ap04yVXC65VALLOpcl1NYwy-vCYaN0niNHEEZYrQEay9HVGkqxJKuf3GRarA5hiONjqQJWTXtV__YS32A0YTg
ContentType Journal Article
Copyright The Author(s) 2025
Copyright_xml – notice: The Author(s) 2025
DBID AFRWT
DOI 10.1177/18758789251336079
DatabaseName Sage Journals GOLD Open Access 2024
DatabaseTitleList
Database_xml – sequence: 1
  dbid: AFRWT
  name: Sage Journals GOLD Open Access 2024
  url: http://journals.sagepub.com/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Library & Information Science
EISSN 1875-8789
EndPage 47
ExternalDocumentID 10.1177_18758789251336079
GrantInformation_xml – fundername: National Science Foundation
  grantid: 2334426
  funderid: https://doi.org/10.13039/100000001
GroupedDBID -W8
-~X
.4I
.4S
.DC
.GO
0R~
29I
4.4
5GY
77K
8VB
AAFNC
AAFWJ
AAGLT
AAHSB
AAQXI
ABDBF
ABJNI
ABUBZ
ABUJY
ACGFS
ACPQW
ACUHS
ADMLS
ADZMO
AEJQA
AEMOZ
AFRHK
AFRWT
AFYTF
AGIAB
AHDMH
AHQJS
AJNRN
AKVCP
ALMA_UNASSIGNED_HOLDINGS
APPIZ
ARCSS
ARTOV
CAG
COF
DU5
EAD
EAP
EAS
EBA
EBE
EBR
EBS
EBU
EDJ
EDO
EJD
ELW
EMK
EPL
EST
ESX
F5P
H13
HZ~
I-F
IL9
IOS
J8X
K1G
LPU
MET
MIO
MK~
ML~
MV1
NGNOM
NIF
O9-
P2P
Q1R
QWB
SAUOL
SCNPE
SFC
TH9
TN5
TUS
ZL0
~02
ID FETCH-LOGICAL-c1529-b778fd166ad251f452563e9b4591bca0c2b8fed6744e2e13a3c7711dc2efb7193
IEDL.DBID AFRWT
ISSN 0167-5265
IngestDate Tue Jun 17 22:26:39 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1-2
Keywords persistent identifiers
metadata
NSF Public Access Repository
research impact
CHORUS
global research infrastructure
PIDs
GRI
Language English
License This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1529-b778fd166ad251f452563e9b4591bca0c2b8fed6744e2e13a3c7711dc2efb7193
OpenAccessLink https://journals.sagepub.com/doi/full/10.1177/18758789251336079?utm_source=summon&utm_medium=discovery-provider
PageCount 18
ParticipantIDs sage_journals_10_1177_18758789251336079
PublicationCentury 2000
PublicationDate 20250500
PublicationDateYYYYMMDD 2025-05-01
PublicationDate_xml – month: 5
  year: 2025
  text: 20250500
PublicationDecade 2020
PublicationPlace London, England
PublicationPlace_xml – name: London, England
PublicationTitle Information services & use
PublicationYear 2025
Publisher SAGE Publications
Publisher_xml – name: SAGE Publications
References Bates, Lin, Goodale 2016; 3
Dumanis, Ratan, McIntosh 2023; 19
Gerasimov, Kc, Mehrabian 2024; 129
Kramer, de Jonge 2022; 3
Habermann, Jones, Packer 2023
Schares 2023; 4
References_xml – year: 2023
  article-title: INFORMATE: metadata game changers and CHORUS collaborate to make the invisible visible
  publication-title: Metadata Game Changers
– volume: 4
  start-page: 1
  issue: 1
  year: 2023
  end-page: 21
  article-title: Impact of the 2022 OSTP memo: a bibliometric analysis of U.S. federally funded publications, 2017–2021
  publication-title: Quant Sci Stud
– volume: 3
  start-page: 2053951716654502
  issue: 2
  year: 2016
  article-title: Data journeys: capturing the socio-material constitution of data objects and flows
  publication-title: Big Data Soc
– volume: 3
  start-page: 583
  issue: 3
  year: 2022
  end-page: 599
  article-title: The availability and completeness of open funder metadata: case study for publications funded by the Dutch Research Council
  publication-title: Quant Sci Stud
– volume: 19
  start-page: e1011626
  issue: 12
  year: 2023
  article-title: From policy to practice: Lessons learned from an open science funding initiative
  publication-title: PLoS Comput Biol
– volume: 129
  start-page: 3681
  issue: 7
  year: 2024
  end-page: 3704
  article-title: Comparison of datasets citation coverage in Google Scholar, web of science, Scopus, Crossref, and DataCite
  publication-title: Scientometrics
SSID ssj0004331
Score 2.3241184
Snippet The global research infrastructure (GRI) is made up of the repositories and organizations that provide persistent identifiers (PIDs) and metadata for many...
SourceID sage
SourceType Publisher
StartPage 30
Title Leveraging the global research infrastructure to characterize the impact of National Science Foundation research
URI https://journals.sagepub.com/doi/full/10.1177/18758789251336079
Volume 45
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELVKu8CAoID4KvKAYAqNndiOJ1QhqgoBA2pFt8p2zmKgDSrtAL8eO3FaEAysUWI7Z1988d17D6FzKmJhpHPAPFbS_aCINFLSeibMOGcJI5CWQNqHRz4YpXdjNm6gosbCBAu-X_myKjei8mPtvdufRndDkrFLXJSdiUxSr07CYyGvl4vppDrurlU1_BWfn15OfWrb-ILIj6iGt22gFhWcOU9u9fpPz8M1lLKWMHTD98zxIRH6Z6c_ir_K_ai_g7ZDIIl71czvogbM2mjrG71gG3UCKAFf4IA68rOAgzvvobd7cMMtZYqwiwNxRQ6CA__PC3aLb64qgtnlHPCiwGZF7_wJ5SMVyBIXFgeC7de6ebwWbFq1uI9G_dvhzSAKCgyRcfu6jLQQmc0J5yp3b219DpQnIHXKJNFGxYbqzELORZoCBZKoxAhBSG4oWC1cbHiAmrNiBocIkzxxVsskWAOpYDqjoJQBS5jhOgZ5hC69MSf1ApiQQET-y-zH_77zBG1SL9db1ieeoqYzGHRcDLHQZ2HevwAJrsNw
linkProvider SAGE Publications
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED1BOwADggLiq-ABwRQUJ44djxWiKtB2QK3oVtnOWQxVi0pZ-PXYiduCYGBPTs7zne8i33sHcJWIWBjpArCIlXQ_KIJFSlqvhBkXWZpRZCWRttfnnSF7HGWj0FXpuTABwfdb31blVlQe1qvo9jxxV2DnIpeJH0zCYyE3oc581qpBvdV-fhmsWZHLaYRuJV4EPtxp_mnkRx9XmVrae7AbakLSqjZxHzZw2oCdb0qBDWgGfgG5JoFA5AElITIP4K2LzifLiUPElXSk0vkgQcrnlTg_mqtKK_ZjjmQxI2al1PyJ5SsVX5LMLAla2ZOlebKevbSyeAjD9v3grhOFYQqRcSlaRlqI3BaUc1W4r7b-OpOnKDXLJNVGxSbRucWCC8YwQZqq1AhBaWEStFq4Mu8IatPZFI-B0CJ1qOUSrUEmMp0nqJRBSzPDdYzyBG48mOPlXo5p0BT_Bfvpv5-8hK3OoNcddx_6T2ewnfgpvGXb4TnUHHjYdKXBQl8EH_gCLcmw2Q
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED1BKyEYEBQQXwUPCKZAnDhxPFZAVKBUCLWiW-XYZzGgtipl4ddjJ24LgoE9OTnPd7lLfPcewFnEQ66EDUAdSmE_UDgLpDCOCTPUSZxQZOUg7WM3bffZ_SAZ-B9ubhbGI_h-6dqq7IrKl7WL7ok2V_6M8YraIjvjmYicOEkacrEKdcZsbqxBvZU_v_SWk5FzRUK7GkcE7881_zTyo5erTC_5Fmz6upC0qo3chhUcNWDjG1tgA5p-xoCcEz9E5EAlPjp3YNJB65el6hCxZR2puD6Ip_N5JdaXprLii_2YIpmNiVqwNX9ieUs1M0nGhni-7Le5ebLUX1pY3IV-ftu7bgdeUCFQNk2LoOA8M5qmqdT2qY070kxjFAVLBC2UDFVUZAZ1yhnDCGksY8U5pVpFaApuS709qI3GI9wHQnVsUcsEGoWMJ0UWoZQKDU1UWoQoDuDCgTmc7-eQel7xX7Af_vvKU1h7usmHnbvuwxGsR06It-w8PIaaxQ6btjqYFSfeBb4A8xqx6Q
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Leveraging+the+global+research+infrastructure+to+characterize+the+impact+of+National+Science+Foundation+research&rft.jtitle=Information+services+%26+use&rft.au=Jones%2C+Jamaica&rft.au=Habermann%2C+Ted&rft.date=2025-05-01&rft.pub=SAGE+Publications&rft.issn=0167-5265&rft.eissn=1875-8789&rft.volume=45&rft.issue=1-2&rft.spage=30&rft.epage=47&rft_id=info:doi/10.1177%2F18758789251336079&rft.externalDocID=10.1177_18758789251336079
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-5265&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-5265&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-5265&client=summon