metID: an R package for automatable compound annotation for LC−MS-based data

Abstract Summary Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information use...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 38; no. 2; pp. 568 - 569
Main Authors Shen, Xiaotao, Wu, Si, Liang, Liang, Chen, Songjie, Contrepois, Kévin, Zhu, Zheng-Jiang, Snyder, Michael
Format Journal Article
LanguageEnglish
Published England Oxford University Press 03.01.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Abstract Summary Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials. Availability and implementation https://jaspershen.github.io/metID. Supplementary information Supplementary data are available at Bioinformatics online.
AbstractList Accurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials.SUMMARYAccurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials.https://jaspershen.github.io/metID.AVAILABILITY AND IMPLEMENTATIONhttps://jaspershen.github.io/metID.Supplementary data are available at Bioinformatics online.SUPPLEMENTARY INFORMATIONSupplementary data are available at Bioinformatics online.
Abstract Summary Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials. Availability and implementation https://jaspershen.github.io/metID. Supplementary information Supplementary data are available at Bioinformatics online.
Accurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials. https://jaspershen.github.io/metID. Supplementary data are available at Bioinformatics online.
Author Liang, Liang
Wu, Si
Shen, Xiaotao
Contrepois, Kévin
Chen, Songjie
Snyder, Michael
Zhu, Zheng-Jiang
AuthorAffiliation 2 Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences , Shanghai 200032, China
1 Department of Genetics, Stanford University School of Medicine , Stanford, CA 94304, USA
AuthorAffiliation_xml – name: 2 Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences , Shanghai 200032, China
– name: 1 Department of Genetics, Stanford University School of Medicine , Stanford, CA 94304, USA
Author_xml – sequence: 1
  givenname: Xiaotao
  orcidid: 0000-0002-9608-9964
  surname: Shen
  fullname: Shen, Xiaotao
  organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA
– sequence: 2
  givenname: Si
  surname: Wu
  fullname: Wu, Si
  organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA
– sequence: 3
  givenname: Liang
  surname: Liang
  fullname: Liang, Liang
  organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA
– sequence: 4
  givenname: Songjie
  surname: Chen
  fullname: Chen, Songjie
  organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA
– sequence: 5
  givenname: Kévin
  surname: Contrepois
  fullname: Contrepois, Kévin
  organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA
– sequence: 6
  givenname: Zheng-Jiang
  surname: Zhu
  fullname: Zhu, Zheng-Jiang
  email: jiangzhu@sioc.ac.cn
  organization: Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai 200032, China
– sequence: 7
  givenname: Michael
  surname: Snyder
  fullname: Snyder, Michael
  email: mpsnyder@stanford.edu
  organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/34432001$$D View this record in MEDLINE/PubMed
BookMark eNqNkctO3TAURS0E4lV-AWXYSYqfiYMqJHTLS7qA1MfYOnYcMCR2GieV-APGfCJfguFeEHTSjmzLa-99dPYWWvXBW4R2Cf5CcMX2tAvON2HoYHQm7ukRtJBsBW0SXuCcYlGtpjsrypxLzDbQVow3GAvCOV9HG4xzRjEmm-iis-PZt_0MfPY968HcwpXNkm8G0xiSOejWZiZ0fZh8nSgfxpQY_Asznz3eP5z_yDVEW2d1oj-htQbaaHeW5zb6dXz0c3aazy9PzmaH89xwUYy5gbKyVELRNJJLDcxqqXmFSwKMaW4kJpzKwkrgRKS3FnUphWkEpdTikrJtdLDw7Sfd2dpYPw7Qqn5wHQx3KoBTH3-8u1ZX4Y-SJaWlqJLB56XBEH5PNo6qc9HYtgVvwxQVFQXnspKyTOju-6y3kNclJqBYAGYIMQ62eUMIVs9tqY9tqWVbSfj1L6Fxi_WmmV37bzlZyMPU_2_kE1wxtns
CitedBy_id crossref_primary_10_1093_bioinformatics_btac546
crossref_primary_10_1016_j_envpol_2024_124460
crossref_primary_10_1038_s41467_023_42659_8
crossref_primary_10_1038_s41467_024_51433_3
crossref_primary_10_3168_jds_2024_25167
crossref_primary_10_1016_j_isci_2023_106881
crossref_primary_10_1038_s41467_022_32155_w
crossref_primary_10_1016_j_talanta_2025_127812
crossref_primary_10_1038_s41551_022_00999_8
crossref_primary_10_1038_s43587_024_00692_2
crossref_primary_10_1021_acs_chemrev_4c00567
crossref_primary_10_1186_s12859_024_06022_y
crossref_primary_10_1007_s00204_023_03620_2
crossref_primary_10_1021_acs_est_4c01156
crossref_primary_10_1021_acsestwater_4c00977
crossref_primary_10_1016_j_tifs_2023_104298
crossref_primary_10_1016_j_tifs_2023_104213
crossref_primary_10_1101_gr_276521_121
crossref_primary_10_1111_acel_13813
crossref_primary_10_1111_jipb_13774
crossref_primary_10_1016_j_scitotenv_2024_171876
crossref_primary_10_1016_j_scitotenv_2024_174715
crossref_primary_10_3390_metabo12020173
Cites_doi 10.1016/j.cell.2020.04.043
10.1007/s11306-007-0082-2
10.1038/nmeth.3393
10.1038/nrd.2016.32
10.1038/s41467-019-09550-x
10.1016/j.copbio.2018.07.010
ContentType Journal Article
Copyright The Author(s) 2021. Published by Oxford University Press. 2021
The Author(s) 2021. Published by Oxford University Press.
Copyright_xml – notice: The Author(s) 2021. Published by Oxford University Press. 2021
– notice: The Author(s) 2021. Published by Oxford University Press.
DBID TOX
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
5PM
DOI 10.1093/bioinformatics/btab583
DatabaseName Oxford Journals Open Access Collection
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic

MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: TOX
  name: Oxford Journals Open Access Collection
  url: https://academic.oup.com/journals/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1460-2059
1367-4811
EndPage 569
ExternalDocumentID PMC8722759
34432001
10_1093_bioinformatics_btab583
10.1093/bioinformatics/btab583
Genre Journal Article
GroupedDBID -~X
.2P
.I3
482
48X
53G
5GY
6.Y
AAIMJ
AAJKP
AAKPC
AAMVS
AAPQZ
AAPXW
AARHZ
AAVAP
ABEFU
ABNKS
ABPTD
ABSAR
ABSMQ
ABWST
ABXVV
ABZBJ
ACGFS
ACMRT
ACPQN
ACUFI
ACYTK
ADEYI
ADFTL
ADGZP
ADHKW
ADOCK
ADRIX
ADRTK
ADYVW
ADZTZ
ADZXQ
AECKG
AEGPL
AEJOX
AEKKA
AEKPW
AEKSI
AELWJ
AEPUE
AETBJ
AFFNX
AFFZL
AFOFC
AFSHK
AFXEN
AGINJ
AGKRT
AGQXC
AI.
ALMA_UNASSIGNED_HOLDINGS
ALTZX
AQDSO
ARIXL
ASAOO
ATDFG
ATTQO
AXUDD
AYOIW
AZFZN
AZVOD
BCRHZ
BHONS
CXTWN
CZ4
DFGAJ
EE~
ELUNK
F5P
F9B
FEDTE
H5~
HAR
HVGLF
HW0
IOX
KOP
KSI
KSN
MBTAY
MVM
NGC
PB-
Q1.
Q5Y
QBD
RD5
RIG
ROL
ROX
ROZ
RXO
TCN
TLC
TN5
TOX
TR2
VH1
WH7
XJT
ZGI
~91
---
-E4
.DC
0R~
23N
2WC
4.4
5WA
70D
AAIJN
AAMDB
AAOGV
AAVLN
AAYXX
ABEJV
ABEUO
ABGNP
ABIXL
ABPQP
ABQLI
ACIWK
ACPRK
ACUXJ
ADBBV
ADEZT
ADGKP
ADHZD
ADMLS
ADPDF
ADRDM
ADVEK
AEMDU
AENEX
AENZO
AEWNT
AFGWE
AFIYH
AFRAH
AGKEF
AGSYK
AHMBA
AHXPO
AIJHB
AJEEA
AJEUX
AKHUL
AKWXX
ALUQC
AMNDL
APIBT
APWMN
ASPBG
AVWKF
BAWUL
BAYMD
BQDIO
BQUQU
BSWAC
BTQHN
C45
CDBKE
CITATION
CS3
DAKXR
DIK
DILTD
DU5
D~K
EBD
EBS
EMOBN
FHSFR
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
GROUPED_DOAJ
GX1
H13
HZ~
J21
JXSIZ
KAQDR
KQ8
M-Z
MK~
ML0
N9A
NLBLG
NMDNZ
NOMLY
NU-
O9-
OAWHX
ODMLO
OJQWA
OK1
OVD
OVEED
P2P
PAFKI
PEELM
PQQKQ
R44
RNS
RPM
RUSNO
RW1
SV3
TEORI
TJP
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
~KM
CGR
CUY
CVF
ECM
EIF
M49
NPM
7X8
5PM
ID FETCH-LOGICAL-c456t-ca79e28a6ff848ba3eb8b49071a33b4c8014286e8a415b4cb5d785cf5222e0723
IEDL.DBID TOX
ISSN 1367-4803
1367-4811
IngestDate Thu Aug 21 18:15:52 EDT 2025
Fri Jul 11 10:32:41 EDT 2025
Wed Feb 19 02:25:39 EST 2025
Thu Apr 24 22:55:54 EDT 2025
Tue Jul 01 02:33:56 EDT 2025
Wed Aug 28 03:17:25 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
License This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
https://creativecommons.org/licenses/by-nc/4.0
The Author(s) 2021. Published by Oxford University Press.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c456t-ca79e28a6ff848ba3eb8b49071a33b4c8014286e8a415b4cb5d785cf5222e0723
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
The authors wish it to be known that, in their opinion, Xiaotao Shen and Si Wu should be regarded as Joint First Authors.
ORCID 0000-0002-9608-9964
OpenAccessLink https://dx.doi.org/10.1093/bioinformatics/btab583
PMID 34432001
PQID 2564489887
PQPubID 23479
PageCount 2
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_8722759
proquest_miscellaneous_2564489887
pubmed_primary_34432001
crossref_primary_10_1093_bioinformatics_btab583
crossref_citationtrail_10_1093_bioinformatics_btab583
oup_primary_10_1093_bioinformatics_btab583
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2022-01-03
PublicationDateYYYYMMDD 2022-01-03
PublicationDate_xml – month: 01
  year: 2022
  text: 2022-01-03
  day: 03
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Bioinformatics
PublicationTitleAlternate Bioinformatics
PublicationYear 2022
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
References Sumner (2023020108413540100_btab583-B5) 2007; 3
Fraga-Corral (2023020108413540100_btab583-B3) 2020; 10
Wishart (2023020108413540100_btab583-B7) 2016; 15
Tsugawa (2023020108413540100_btab583-B6) 2015; 12
Chaleckis (2023020108413540100_btab583-B1) 2019; 55
Contrepois (2023020108413540100_btab583-B2) 2020; 181
Shen (2023020108413540100_btab583-B4) 2019; 10
References_xml – volume: 181
  start-page: 1112
  year: 2020
  ident: 2023020108413540100_btab583-B2
  article-title: Molecular choreography of acute exercise
  publication-title: Cell
  doi: 10.1016/j.cell.2020.04.043
– volume: 10
  start-page: 109
  year: 2020
  ident: 2023020108413540100_btab583-B3
  article-title: Analytical metabolomics and applications in health, environmental and food science
  publication-title: Crit. Rev. Anal. Chem
– volume: 3
  start-page: 211
  year: 2007
  ident: 2023020108413540100_btab583-B5
  article-title: Proposed minimum reporting standards for chemical analysis
  publication-title: Metabolomics
  doi: 10.1007/s11306-007-0082-2
– volume: 12
  start-page: 523
  year: 2015
  ident: 2023020108413540100_btab583-B6
  article-title: MS-DIAL: data-independent ms/ms deconvolution for comprehensive metabolome analysis
  publication-title: Nat. Methods
  doi: 10.1038/nmeth.3393
– volume: 15
  start-page: 473
  year: 2016
  ident: 2023020108413540100_btab583-B7
  article-title: Emerging applications of metabolomics in drug discovery and precision medicine
  publication-title: Nat. Rev. Drug Discov
  doi: 10.1038/nrd.2016.32
– volume: 10
  start-page: 1516
  year: 2019
  ident: 2023020108413540100_btab583-B4
  article-title: Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics
  publication-title: Nat. Commun
  doi: 10.1038/s41467-019-09550-x
– volume: 55
  start-page: 44
  year: 2019
  ident: 2023020108413540100_btab583-B1
  article-title: Challenges, progress and promises of metabolite annotation for LC–MS-based metabolomics
  publication-title: Curr. Opin. Biotechnol
  doi: 10.1016/j.copbio.2018.07.010
SSID ssj0051444
ssj0005056
Score 2.5047758
Snippet Abstract Summary Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics)....
Accurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts...
SourceID pubmedcentral
proquest
pubmed
crossref
oup
SourceType Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 568
SubjectTerms Applications Notes
Chromatography, Liquid
Databases, Factual
Metabolomics
Software
Tandem Mass Spectrometry
Title metID: an R package for automatable compound annotation for LC−MS-based data
URI https://www.ncbi.nlm.nih.gov/pubmed/34432001
https://www.proquest.com/docview/2564489887
https://pubmed.ncbi.nlm.nih.gov/PMC8722759
Volume 38
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NSsNAEF5KQfAi_lv_WMGTEJruZpONN6mWKraCttBb2E02WNREND34Bp59RJ_EmSStTUHUW5bMhmUn2fkmM_MNIcdhLKQdC24pnyGpdghXhvsWd41pgcodHeF_yF7f7Q6dq5EY1UhrWguzGML3eVOP05JEFImLmzpTWkjk9wRLjGz5g5vRd1KHjdQwxQCggFP0tEVqb2nzaYHwj8-s2KZKvdsc7FzMnpwzR51VslLiSHpWKH6N1EyyTpaKzpJvG6T_ZLLL81OqEnpLwS1-gGODwkqommQpLAcLpiimk2NXJZBK0iIkn8tctz_fP3p3Flq4iGIO6SYZdi4G7a5Vtk6wQkBEmRUqzzdMKjeOpSO14kZL7YAj3FKcaydEzhgmXSMVGHAYaxF5UoDiAC4Y22N8i9STNDE7hMbgoXh5xBYm2kbKMIyEZJGrfW1izhpETDctCEtecWxv8RgU8W0eVDc7KDe7QZqzec8Fs8avM05AJ38WPpqqLoAvBsMgKjHp5DUAkAc-qQ-na4NsF6qcPZM7DscsswbxKkqeCSAbd_VOMr7PWbmlx5gn_N3_LHKPLDMsp8jT2vZJPXuZmAMAOZk-zN_rL8Y8AAo
linkProvider Oxford University Press
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=metID%3A+an+R+package+for+automatable+compound+annotation+for+LC%E2%88%92MS-based+data&rft.jtitle=Bioinformatics&rft.au=Shen%2C+Xiaotao&rft.au=Wu%2C+Si&rft.au=Liang%2C+Liang&rft.au=Chen%2C+Songjie&rft.date=2022-01-03&rft.pub=Oxford+University+Press&rft.issn=1367-4803&rft.eissn=1460-2059&rft.volume=38&rft.issue=2&rft.spage=568&rft.epage=569&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtab583&rft.externalDocID=10.1093%2Fbioinformatics%2Fbtab583
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon