An integrated approach for mixture analysis using MS and NMR techniques

We suggest an improved software pipeline for mixture analysis. The improvements include combining tandem MS and 2D NMR data for a reliable identification of the constituents in an algorithm based on network analysis aiming for a robust and reliable identification routine. An important part of this p...

Full description

Saved in:
Bibliographic Details
Published inFaraday discussions Vol. 218; pp. 339 - 353
Main Authors Kuhn, Stefan, Colreavy-Donnelly, Simon, Santana de Souza, Juliana, Borges, Ricardo Moreira
Format Journal Article
LanguageEnglish
Published England Royal Society of Chemistry 15.08.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We suggest an improved software pipeline for mixture analysis. The improvements include combining tandem MS and 2D NMR data for a reliable identification of the constituents in an algorithm based on network analysis aiming for a robust and reliable identification routine. An important part of this pipeline is the use of open-data repositories, although it is not totally reliant on them. The NMR identification step emphasizes robustness and is less sensitive towards changes in data acquisition and processing than existing methods. The process starts with LC-ESI-MSMS based molecular network dereplication using data from the GNPS collaborative collection. We identify closely related structures by propagating structure elucidation through edges in the network. Those identified compounds are added on top of a candidate list for the following NMR filtering method that predicts HSQC and HMBC NMR data. The similarity of the predicted spectra of the set of closely related structures to the measured spectra of the mixture sample is taken as one indication of the most likely candidates for its compounds. The other indication is the match of the spectra to clusters built by a network analysis from the spectra of the mixture. The sensitivity gap between NMR and MS is anticipated and it will be reflected naturally by the eventual identification of fewer compounds, but with a higher confidence level, after the NMR analysis step. The contributions of the paper are an algorithm combining MS and NMR spectroscopy and a robust n J CH network analysis to explore the complementary aspects of both techniques. This delivers good results, even if a perfect computational separation of the compounds in the mixture is not possible. All of the scripts are freely available to aid studies such as with plants, marine organisms, and microorganism natural product chemistry and metabolomics, as those are the driving forces for this project. We suggest an improved software pipeline for mixture analysis.
Bibliography:The spectral data (HSQC and HMBC) of
10.1039/c8fd00227d
Electronic supplementary information (ESI) available: The data in realspectrum.csv and compounds.smi are for
P. boldus
https://github.com/stefhk3/nmrfilter
contains all code described in Section 2.2. See DOI
are provided as well. Additionally
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1359-6640
1364-5498
DOI:10.1039/c8fd00227d