metID: an R package for automatable compound annotation for LC−MS-based data
Abstract Summary Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information use...
Saved in:
Published in | Bioinformatics Vol. 38; no. 2; pp. 568 - 569 |
---|---|
Main Authors | , , , , , , |
Format | Journal Article |
Language | English |
Published |
England
Oxford University Press
03.01.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Abstract
Summary
Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials.
Availability and implementation
https://jaspershen.github.io/metID.
Supplementary information
Supplementary data are available at Bioinformatics online. |
---|---|
AbstractList | Accurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials.SUMMARYAccurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials.https://jaspershen.github.io/metID.AVAILABILITY AND IMPLEMENTATIONhttps://jaspershen.github.io/metID.Supplementary data are available at Bioinformatics online.SUPPLEMENTARY INFORMATIONSupplementary data are available at Bioinformatics online. Abstract Summary Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials. Availability and implementation https://jaspershen.github.io/metID. Supplementary information Supplementary data are available at Bioinformatics online. Accurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streamlined. Therefore, we developed metID, an R package that combines information from all major databases for comprehensive and streamlined compound annotation. metID is a flexible, simple and powerful tool that can be installed on all platforms, allowing the compound annotation process to be fully automatic and reproducible. A detailed tutorial and a case study are provided in Supplementary Materials. https://jaspershen.github.io/metID. Supplementary data are available at Bioinformatics online. |
Author | Liang, Liang Wu, Si Shen, Xiaotao Contrepois, Kévin Chen, Songjie Snyder, Michael Zhu, Zheng-Jiang |
AuthorAffiliation | 2 Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences , Shanghai 200032, China 1 Department of Genetics, Stanford University School of Medicine , Stanford, CA 94304, USA |
AuthorAffiliation_xml | – name: 2 Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences , Shanghai 200032, China – name: 1 Department of Genetics, Stanford University School of Medicine , Stanford, CA 94304, USA |
Author_xml | – sequence: 1 givenname: Xiaotao orcidid: 0000-0002-9608-9964 surname: Shen fullname: Shen, Xiaotao organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA – sequence: 2 givenname: Si surname: Wu fullname: Wu, Si organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA – sequence: 3 givenname: Liang surname: Liang fullname: Liang, Liang organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA – sequence: 4 givenname: Songjie surname: Chen fullname: Chen, Songjie organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA – sequence: 5 givenname: Kévin surname: Contrepois fullname: Contrepois, Kévin organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA – sequence: 6 givenname: Zheng-Jiang surname: Zhu fullname: Zhu, Zheng-Jiang email: jiangzhu@sioc.ac.cn organization: Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai 200032, China – sequence: 7 givenname: Michael surname: Snyder fullname: Snyder, Michael email: mpsnyder@stanford.edu organization: Department of Genetics, Stanford University School of Medicine, Stanford, CA 94304, USA |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/34432001$$D View this record in MEDLINE/PubMed |
BookMark | eNqNkctO3TAURS0E4lV-AWXYSYqfiYMqJHTLS7qA1MfYOnYcMCR2GieV-APGfCJfguFeEHTSjmzLa-99dPYWWvXBW4R2Cf5CcMX2tAvON2HoYHQm7ukRtJBsBW0SXuCcYlGtpjsrypxLzDbQVow3GAvCOV9HG4xzRjEmm-iis-PZt_0MfPY968HcwpXNkm8G0xiSOejWZiZ0fZh8nSgfxpQY_Asznz3eP5z_yDVEW2d1oj-htQbaaHeW5zb6dXz0c3aazy9PzmaH89xwUYy5gbKyVELRNJJLDcxqqXmFSwKMaW4kJpzKwkrgRKS3FnUphWkEpdTikrJtdLDw7Sfd2dpYPw7Qqn5wHQx3KoBTH3-8u1ZX4Y-SJaWlqJLB56XBEH5PNo6qc9HYtgVvwxQVFQXnspKyTOju-6y3kNclJqBYAGYIMQ62eUMIVs9tqY9tqWVbSfj1L6Fxi_WmmV37bzlZyMPU_2_kE1wxtns |
CitedBy_id | crossref_primary_10_1093_bioinformatics_btac546 crossref_primary_10_1016_j_envpol_2024_124460 crossref_primary_10_1038_s41467_023_42659_8 crossref_primary_10_1038_s41467_024_51433_3 crossref_primary_10_3168_jds_2024_25167 crossref_primary_10_1016_j_isci_2023_106881 crossref_primary_10_1038_s41467_022_32155_w crossref_primary_10_1016_j_talanta_2025_127812 crossref_primary_10_1038_s41551_022_00999_8 crossref_primary_10_1038_s43587_024_00692_2 crossref_primary_10_1021_acs_chemrev_4c00567 crossref_primary_10_1186_s12859_024_06022_y crossref_primary_10_1007_s00204_023_03620_2 crossref_primary_10_1021_acs_est_4c01156 crossref_primary_10_1021_acsestwater_4c00977 crossref_primary_10_1016_j_tifs_2023_104298 crossref_primary_10_1016_j_tifs_2023_104213 crossref_primary_10_1101_gr_276521_121 crossref_primary_10_1111_acel_13813 crossref_primary_10_1111_jipb_13774 crossref_primary_10_1016_j_scitotenv_2024_171876 crossref_primary_10_1016_j_scitotenv_2024_174715 crossref_primary_10_3390_metabo12020173 |
Cites_doi | 10.1016/j.cell.2020.04.043 10.1007/s11306-007-0082-2 10.1038/nmeth.3393 10.1038/nrd.2016.32 10.1038/s41467-019-09550-x 10.1016/j.copbio.2018.07.010 |
ContentType | Journal Article |
Copyright | The Author(s) 2021. Published by Oxford University Press. 2021 The Author(s) 2021. Published by Oxford University Press. |
Copyright_xml | – notice: The Author(s) 2021. Published by Oxford University Press. 2021 – notice: The Author(s) 2021. Published by Oxford University Press. |
DBID | TOX AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 5PM |
DOI | 10.1093/bioinformatics/btab583 |
DatabaseName | Oxford Journals Open Access Collection CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic PubMed Central (Full Participant titles) |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
DatabaseTitleList | MEDLINE - Academic MEDLINE |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: TOX name: Oxford Journals Open Access Collection url: https://academic.oup.com/journals/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 1460-2059 1367-4811 |
EndPage | 569 |
ExternalDocumentID | PMC8722759 34432001 10_1093_bioinformatics_btab583 10.1093/bioinformatics/btab583 |
Genre | Journal Article |
GroupedDBID | -~X .2P .I3 482 48X 53G 5GY 6.Y AAIMJ AAJKP AAKPC AAMVS AAPQZ AAPXW AARHZ AAVAP ABEFU ABNKS ABPTD ABSAR ABSMQ ABWST ABXVV ABZBJ ACGFS ACMRT ACPQN ACUFI ACYTK ADEYI ADFTL ADGZP ADHKW ADOCK ADRIX ADRTK ADYVW ADZTZ ADZXQ AECKG AEGPL AEJOX AEKKA AEKPW AEKSI AELWJ AEPUE AETBJ AFFNX AFFZL AFOFC AFSHK AFXEN AGINJ AGKRT AGQXC AI. ALMA_UNASSIGNED_HOLDINGS ALTZX AQDSO ARIXL ASAOO ATDFG ATTQO AXUDD AYOIW AZFZN AZVOD BCRHZ BHONS CXTWN CZ4 DFGAJ EE~ ELUNK F5P F9B FEDTE H5~ HAR HVGLF HW0 IOX KOP KSI KSN MBTAY MVM NGC PB- Q1. Q5Y QBD RD5 RIG ROL ROX ROZ RXO TCN TLC TN5 TOX TR2 VH1 WH7 XJT ZGI ~91 --- -E4 .DC 0R~ 23N 2WC 4.4 5WA 70D AAIJN AAMDB AAOGV AAVLN AAYXX ABEJV ABEUO ABGNP ABIXL ABPQP ABQLI ACIWK ACPRK ACUXJ ADBBV ADEZT ADGKP ADHZD ADMLS ADPDF ADRDM ADVEK AEMDU AENEX AENZO AEWNT AFGWE AFIYH AFRAH AGKEF AGSYK AHMBA AHXPO AIJHB AJEEA AJEUX AKHUL AKWXX ALUQC AMNDL APIBT APWMN ASPBG AVWKF BAWUL BAYMD BQDIO BQUQU BSWAC BTQHN C45 CDBKE CITATION CS3 DAKXR DIK DILTD DU5 D~K EBD EBS EMOBN FHSFR FLIZI FLUFQ FOEOM FQBLK GAUVT GJXCC GROUPED_DOAJ GX1 H13 HZ~ J21 JXSIZ KAQDR KQ8 M-Z MK~ ML0 N9A NLBLG NMDNZ NOMLY NU- O9- OAWHX ODMLO OJQWA OK1 OVD OVEED P2P PAFKI PEELM PQQKQ R44 RNS RPM RUSNO RW1 SV3 TEORI TJP W8F WOQ X7H YAYTL YKOAZ YXANX ZKX ~KM CGR CUY CVF ECM EIF M49 NPM 7X8 5PM |
ID | FETCH-LOGICAL-c456t-ca79e28a6ff848ba3eb8b49071a33b4c8014286e8a415b4cb5d785cf5222e0723 |
IEDL.DBID | TOX |
ISSN | 1367-4803 1367-4811 |
IngestDate | Thu Aug 21 18:15:52 EDT 2025 Fri Jul 11 10:32:41 EDT 2025 Wed Feb 19 02:25:39 EST 2025 Thu Apr 24 22:55:54 EDT 2025 Tue Jul 01 02:33:56 EDT 2025 Wed Aug 28 03:17:25 EDT 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 2 |
Language | English |
License | This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com https://creativecommons.org/licenses/by-nc/4.0 The Author(s) 2021. Published by Oxford University Press. |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c456t-ca79e28a6ff848ba3eb8b49071a33b4c8014286e8a415b4cb5d785cf5222e0723 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 The authors wish it to be known that, in their opinion, Xiaotao Shen and Si Wu should be regarded as Joint First Authors. |
ORCID | 0000-0002-9608-9964 |
OpenAccessLink | https://dx.doi.org/10.1093/bioinformatics/btab583 |
PMID | 34432001 |
PQID | 2564489887 |
PQPubID | 23479 |
PageCount | 2 |
ParticipantIDs | pubmedcentral_primary_oai_pubmedcentral_nih_gov_8722759 proquest_miscellaneous_2564489887 pubmed_primary_34432001 crossref_primary_10_1093_bioinformatics_btab583 crossref_citationtrail_10_1093_bioinformatics_btab583 oup_primary_10_1093_bioinformatics_btab583 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2022-01-03 |
PublicationDateYYYYMMDD | 2022-01-03 |
PublicationDate_xml | – month: 01 year: 2022 text: 2022-01-03 day: 03 |
PublicationDecade | 2020 |
PublicationPlace | England |
PublicationPlace_xml | – name: England |
PublicationTitle | Bioinformatics |
PublicationTitleAlternate | Bioinformatics |
PublicationYear | 2022 |
Publisher | Oxford University Press |
Publisher_xml | – name: Oxford University Press |
References | Sumner (2023020108413540100_btab583-B5) 2007; 3 Fraga-Corral (2023020108413540100_btab583-B3) 2020; 10 Wishart (2023020108413540100_btab583-B7) 2016; 15 Tsugawa (2023020108413540100_btab583-B6) 2015; 12 Chaleckis (2023020108413540100_btab583-B1) 2019; 55 Contrepois (2023020108413540100_btab583-B2) 2020; 181 Shen (2023020108413540100_btab583-B4) 2019; 10 |
References_xml | – volume: 181 start-page: 1112 year: 2020 ident: 2023020108413540100_btab583-B2 article-title: Molecular choreography of acute exercise publication-title: Cell doi: 10.1016/j.cell.2020.04.043 – volume: 10 start-page: 109 year: 2020 ident: 2023020108413540100_btab583-B3 article-title: Analytical metabolomics and applications in health, environmental and food science publication-title: Crit. Rev. Anal. Chem – volume: 3 start-page: 211 year: 2007 ident: 2023020108413540100_btab583-B5 article-title: Proposed minimum reporting standards for chemical analysis publication-title: Metabolomics doi: 10.1007/s11306-007-0082-2 – volume: 12 start-page: 523 year: 2015 ident: 2023020108413540100_btab583-B6 article-title: MS-DIAL: data-independent ms/ms deconvolution for comprehensive metabolome analysis publication-title: Nat. Methods doi: 10.1038/nmeth.3393 – volume: 15 start-page: 473 year: 2016 ident: 2023020108413540100_btab583-B7 article-title: Emerging applications of metabolomics in drug discovery and precision medicine publication-title: Nat. Rev. Drug Discov doi: 10.1038/nrd.2016.32 – volume: 10 start-page: 1516 year: 2019 ident: 2023020108413540100_btab583-B4 article-title: Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics publication-title: Nat. Commun doi: 10.1038/s41467-019-09550-x – volume: 55 start-page: 44 year: 2019 ident: 2023020108413540100_btab583-B1 article-title: Challenges, progress and promises of metabolite annotation for LC–MS-based metabolomics publication-title: Curr. Opin. Biotechnol doi: 10.1016/j.copbio.2018.07.010 |
SSID | ssj0051444 ssj0005056 |
Score | 2.5047758 |
Snippet | Abstract
Summary
Accurate and efficient compound annotation is a long-standing challenge for LC–MS-based data (e.g. untargeted metabolomics and exposomics).... Accurate and efficient compound annotation is a long-standing challenge for LC-MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts... |
SourceID | pubmedcentral proquest pubmed crossref oup |
SourceType | Open Access Repository Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 568 |
SubjectTerms | Applications Notes Chromatography, Liquid Databases, Factual Metabolomics Software Tandem Mass Spectrometry |
Title | metID: an R package for automatable compound annotation for LC−MS-based data |
URI | https://www.ncbi.nlm.nih.gov/pubmed/34432001 https://www.proquest.com/docview/2564489887 https://pubmed.ncbi.nlm.nih.gov/PMC8722759 |
Volume | 38 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NSsNAEF5KQfAi_lv_WMGTEJruZpONN6mWKraCttBb2E02WNREND34Bp59RJ_EmSStTUHUW5bMhmUn2fkmM_MNIcdhLKQdC24pnyGpdghXhvsWd41pgcodHeF_yF7f7Q6dq5EY1UhrWguzGML3eVOP05JEFImLmzpTWkjk9wRLjGz5g5vRd1KHjdQwxQCggFP0tEVqb2nzaYHwj8-s2KZKvdsc7FzMnpwzR51VslLiSHpWKH6N1EyyTpaKzpJvG6T_ZLLL81OqEnpLwS1-gGODwkqommQpLAcLpiimk2NXJZBK0iIkn8tctz_fP3p3Flq4iGIO6SYZdi4G7a5Vtk6wQkBEmRUqzzdMKjeOpSO14kZL7YAj3FKcaydEzhgmXSMVGHAYaxF5UoDiAC4Y22N8i9STNDE7hMbgoXh5xBYm2kbKMIyEZJGrfW1izhpETDctCEtecWxv8RgU8W0eVDc7KDe7QZqzec8Fs8avM05AJ38WPpqqLoAvBsMgKjHp5DUAkAc-qQ-na4NsF6qcPZM7DscsswbxKkqeCSAbd_VOMr7PWbmlx5gn_N3_LHKPLDMsp8jT2vZJPXuZmAMAOZk-zN_rL8Y8AAo |
linkProvider | Oxford University Press |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=metID%3A+an+R+package+for+automatable+compound+annotation+for+LC%E2%88%92MS-based+data&rft.jtitle=Bioinformatics&rft.au=Shen%2C+Xiaotao&rft.au=Wu%2C+Si&rft.au=Liang%2C+Liang&rft.au=Chen%2C+Songjie&rft.date=2022-01-03&rft.pub=Oxford+University+Press&rft.issn=1367-4803&rft.eissn=1460-2059&rft.volume=38&rft.issue=2&rft.spage=568&rft.epage=569&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtab583&rft.externalDocID=10.1093%2Fbioinformatics%2Fbtab583 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon |