Automated indexing using NLM's Medical Text Indexer (MTI) compared to human indexing in Medline: a pilot study

Objective: In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a samp...

Full description

Saved in:
Bibliographic Details
Published inJournal of the Medical Library Association Vol. 111; no. 3; pp. 684 - 695
Main Authors Chen, Yin Yin, Bullard, Julia, Giustini, Dean
Format Journal Article
LanguageEnglish
Published United States Medical Library Association 10.07.2023
University Library System, University of Pittsburgh
Subjects
Online AccessGet full text
ISSN1536-5050
1558-9439
1558-9439
DOI10.5195/jmla.2023.1588

Cover

Loading…
Abstract Objective: In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers. Methods: This pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool. Results: In the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles. Conclusion: NLM’s transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed’s mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches.
AbstractList Objective: In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers. Methods: This pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool. Results: In the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles. Conclusion: NLM’s transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed’s mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches.
In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers. This pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool. In the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles. NLM's transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed's mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches.
Objective: In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers. Methods: This pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool. Results: In the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles. Conclusion: NLM’s transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed’s mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches.
Objective: In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers. Methods: This pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool. Results: In the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles. Conclusion: NLM's transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed's mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches. Keywords: Automated indexing; human indexers; information retrieval; Medical Text Indexer (MTI); Medline; PubMed.
In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers.ObjectiveIn 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that it would fully automate its indexing in Medline with an improved MTI by mid-2022. This pilot study examines indexing using a sample of records in Medline from 2000, and how an early, public version of MTI's outputs compares to records created by human indexers.This pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool.MethodsThis pilot study examines twenty Medline records from 2000, a year before the MTI was introduced as a MeSH term recommender. We identified twenty higher- and lower-impact biomedical journals based on Journal Impact Factor (JIF) and examined the indexing of papers by feeding their PubMed records into the Interactive MTI tool.In the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles.ResultsIn the sample, we found key differences between automated and human-indexed Medline records: MTI assigned more terms and used them more accurately for citations in the higher JIF group, and MTI tended to rank the Male check tag more highly than the Female check tag and to omit Aged check tags. Sometimes MTI chose more specific terms than human indexers but was inconsistent in applying specificity principles.NLM's transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed's mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches.ConclusionNLM's transition to fully automated indexing of the biomedical literature could introduce or perpetuate inconsistencies and biases in Medline. Librarians and searchers should assess changes to index terms, and their impact on PubMed's mapping features for a range of topics. Future research should evaluate automated indexing as it pertains to finding clinical information effectively, and in performing systematic searches.
Audience Academic
Author Giustini, Dean
Bullard, Julia
Chen, Yin Yin
Author_xml – sequence: 1
  givenname: Yin Yin
  orcidid: 0000-0003-0350-0513
  surname: Chen
  fullname: Chen, Yin Yin
– sequence: 2
  givenname: Julia
  orcidid: 0000-0002-6815-0609
  surname: Bullard
  fullname: Bullard, Julia
– sequence: 3
  givenname: Dean
  orcidid: 0000-0002-6197-8788
  surname: Giustini
  fullname: Giustini, Dean
BackLink https://www.ncbi.nlm.nih.gov/pubmed/37483360$$D View this record in MEDLINE/PubMed
BookMark eNp1ks2P0zAQxSO0iP2AK0cUiQPLocWOnTjZC6pWfFRq4VLO1sSedF0ldokdtPvfY9NllyKQpXjkvPeLJ_POsxPrLGbZS0rmJW3Kd7uhh3lBCjanZV0_yc5oWdazhrPmJNWsmpWkJKfZufc7QqioBXmWnTLBa8YqcpbZxRTcAAF1bqzGW2O3-eTT88tq_cbna9RGQZ9v8Dbky6TAMb9cb5Zvc-WGPYzRGFx-Mw1gHwnGJmNvLF7lkO9N70Luw6TvnmdPO-g9vrjfL7JvHz9srj_PVl8_La8Xq5kqCx5mrSCNIJUA0dQtIVp3Xcl4o3XBlWiwwQpFy7iqObRYc0oKKEFXwIBTxNjbRbY8cLWDndyPZoDxTjow8teBG7cSxmBUj7LgXY1t1RW05LFsQVNOgZOqYALquois9wfWfmoH1AptGKE_gh6_seZGbt0PSQmr0jgi4fKeMLrvE_ogB-MV9j1YdJOXRWyBkzjA9LHXB-kW4t2M7VxEqiSXCxEnXFAmkmr-D1VcGgejYkI6E8-PDK_-7OHh8r-D8EhUo_N-xO5BQolMSZMpaTIlTaakRQP_y6BMgGBc-gem_5_tJ-a-1hs
CitedBy_id crossref_primary_10_1038_s41597_023_02869_7
crossref_primary_10_1080_03007995_2024_2350612
crossref_primary_10_18438_eblip30415
crossref_primary_10_1186_s13643_024_02551_y
crossref_primary_10_1093_ijpp_riae042
crossref_primary_10_1016_j_sapharm_2024_06_003
crossref_primary_10_34133_hds_0125
ContentType Journal Article
Copyright Copyright © 2023 Eileen Chen, Julia Bullard, Dean Giustini.
COPYRIGHT 2023 Medical Library Association
Copyright © 2023 Eileen Chen, Julia Bullard, Dean Giustini 2023
Copyright_xml – notice: Copyright © 2023 Eileen Chen, Julia Bullard, Dean Giustini.
– notice: COPYRIGHT 2023 Medical Library Association
– notice: Copyright © 2023 Eileen Chen, Julia Bullard, Dean Giustini 2023
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
5PM
DOA
DOI 10.5195/jmla.2023.1588
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
PubMed Central (Full Participant titles)
Directory of Open Access Journals (DOAJ)
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList CrossRef
MEDLINE


MEDLINE - Academic
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
EISSN 1558-9439
EndPage 695
ExternalDocumentID oai_doaj_org_article_24f8eb6f21544f8bad141a406237a882
PMC10361558
A758821372
37483360
10_5195_jmla_2023_1588
Genre Journal Article
GeographicLocations United States
GeographicLocations_xml – name: United States
GroupedDBID ---
.4I
.GJ
04C
29L
2WC
36B
5GY
5VS
6PF
7RV
7X7
85S
88E
8AO
8FI
8FJ
8G5
8R4
8R5
AAFWJ
AAWTL
AAYXX
ABDBF
ABPPZ
ABUWG
ACGFO
ACHQT
ACIHN
ACUHS
ADBBV
ADOJX
AEAQA
AFKRA
AFPKN
AFRXU
AHMBA
ALIPV
ALMA_UNASSIGNED_HOLDINGS
ALSLI
AOIJS
AQUVI
AZQEC
B0M
BAWUL
BCNDV
BENPR
BKEYQ
BMSDO
BPHCQ
BVXVI
CCPQU
CITATION
CNYFK
CS3
DU5
DWQXO
E3Z
EAD
EAP
EBC
EBD
EBS
ECF
ECT
EHE
EIHBH
EJD
ELW
EMB
EMK
EMOBN
EPL
ESX
EX3
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GUQSH
GX1
HMCUK
HML
I-F
IAO
IHR
IHW
INH
INR
ITC
KQ8
L7B
M0T
M1O
M1P
M2O
NAPCQ
OK1
OVT
PCD
PHGZM
PHGZT
PIMPY
PQQKQ
PROAC
PSQYO
PV9
Q2X
QF3
QF4
QN7
RNS
RPM
RXW
RZL
SV3
TR2
TUS
UKHRP
W2D
WOW
WQ9
XSB
XZL
ZXP
~8M
0B8
3V.
CGR
CUY
CVF
DIK
ECM
EIF
M~E
NPM
PMFND
7X8
PPXIY
PRQQA
5PM
PJZUB
PUEGO
ID FETCH-LOGICAL-c524t-b7097067a798b00ddff5349dd24c79e9e6e7b34c84abe84102a5ad6a3a41ee483
IEDL.DBID DOA
ISSN 1536-5050
1558-9439
IngestDate Wed Aug 27 01:23:48 EDT 2025
Thu Aug 21 18:37:28 EDT 2025
Fri Jul 11 09:28:50 EDT 2025
Tue Jun 17 21:26:34 EDT 2025
Tue Jun 10 21:33:48 EDT 2025
Thu Jan 02 22:33:33 EST 2025
Tue Jul 01 03:45:48 EDT 2025
Thu Apr 24 23:12:29 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords Automated indexing
Medline
PubMed
Medical Text Indexer (MTI)
information retrieval
human indexers
Language English
License https://creativecommons.org/licenses/by/4.0
Copyright © 2023 Eileen Chen, Julia Bullard, Dean Giustini.
This work is licensed under a Creative Commons Attribution 4.0 International License.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c524t-b7097067a798b00ddff5349dd24c79e9e6e7b34c84abe84102a5ad6a3a41ee483
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0003-0350-0513
0000-0002-6815-0609
0000-0002-6197-8788
OpenAccessLink https://doaj.org/article/24f8eb6f21544f8bad141a406237a882
PMID 37483360
PQID 2841401582
PQPubID 23479
PageCount 12
ParticipantIDs doaj_primary_oai_doaj_org_article_24f8eb6f21544f8bad141a406237a882
pubmedcentral_primary_oai_pubmedcentral_nih_gov_10361558
proquest_miscellaneous_2841401582
gale_infotracmisc_A758821372
gale_infotracacademiconefile_A758821372
pubmed_primary_37483360
crossref_primary_10_5195_jmla_2023_1588
crossref_citationtrail_10_5195_jmla_2023_1588
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20230710
PublicationDateYYYYMMDD 2023-07-10
PublicationDate_xml – month: 7
  year: 2023
  text: 20230710
  day: 10
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Journal of the Medical Library Association
PublicationTitleAlternate J Med Libr Assoc
PublicationYear 2023
Publisher Medical Library Association
University Library System, University of Pittsburgh
Publisher_xml – name: Medical Library Association
– name: University Library System, University of Pittsburgh
SSID ssj0017870
Score 2.4016638
Snippet Objective: In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM...
In 2002, the National Library of Medicine (NLM) introduced semi-automated indexing of Medline using the Medical Text Indexer (MTI). In 2021, NLM announced that...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage 684
SubjectTerms Abstracting and Indexing - methods
Abstracting and Indexing - standards
Automated indexing
human indexers
Indexing
information retrieval
Libraries
Medical Subject Headings
Medical Text Indexer (MTI)
MEDLINE
National Library of Medicine (U.S.)
Original Investigation
Pilot Projects
PubMed
Rankings
Technology application
United States
Title Automated indexing using NLM's Medical Text Indexer (MTI) compared to human indexing in Medline: a pilot study
URI https://www.ncbi.nlm.nih.gov/pubmed/37483360
https://www.proquest.com/docview/2841401582
https://pubmed.ncbi.nlm.nih.gov/PMC10361558
https://doaj.org/article/24f8eb6f21544f8bad141a406237a882
Volume 111
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3Nb9MwFLdgXLggxmdhq4yEGBzCktiObW4d2jQQLQh1Um-WYzujqCTTlv7_vJe4XSOEuHDpwXYT5334_Zw8_x4hr2XItZAlIDd0Ny4qnihd2MQFx4XTqc86Br7prDi_4J8XYrFT6gtzwnp64F5wxzmvVCiLKkfamEqV1mc8sxCGciYtwENcfSHmbTZT8fsBmmHPlFokEOPTnq4RmVSOf_5aId9Qzt5noqu3chuOOtb-P9fmneA0TJzciURnD8mDCCHppJ_6PrkT6kfkMB5AoG9oPGGEEqfRdR-TerJuG2gMnnYEiRCxKOa8X9LZl-nRDY0fbOgcFmu4gsdSTPTtdP7pHd2kqdO2oV1Jv9srLGv8IyLVD9TSq-WqaWlHWPuEXJydzj-eJ7HWQuJEztuklKmWELms1Ao80fuqEoxr73PupA46FEGWjDvFbRkUB1hihfWFZZZnIXDFnpK9uqnDc0JLybCBC1ZpnjqNNUDTolKV4l4D4BuRZCNy4yIROdbDWBnYkKCKDKrIoIoMqmhEjrbjr3oKjr-OPEENbkchdXbXAAZlokGZfxkU3A71b9DBYVrOxnMK8HBIlWUmsMNSecYkjDwYjATHdIPuVxsLMtiF2Wx1aNY3BiAB7msF3uxZb1HbOSMdEGNFOiJqYGuDhxr21MsfHS94BmgE4KF68T_E8JLcR7nia-wsPSB77fU6HAL-assxuSsXckzunZzOvn0fd44Hv9Ps62852y0X
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Automated+indexing+using+NLM%27s+Medical+Text+Indexer+compared+to+human+indexing+in+Medline%3A+a+pilot+study&rft.jtitle=Journal+of+the+Medical+Library+Association&rft.au=Chen%2C+Eileen&rft.au=Bullard%2C+Julia&rft.au=Giustini%2C+Dean&rft.date=2023-07-10&rft.pub=Medical+Library+Association&rft.issn=1536-5050&rft.volume=111&rft.issue=3&rft.spage=684&rft_id=info:doi/10.5195%2Fjmla.2023.1588&rft.externalDocID=A758821372
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1536-5050&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1536-5050&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1536-5050&client=summon