Optimised spectral pre-processing for discrimination of biofluids via ATR-FTIR spectroscopy

Pre-processing is an essential step in the analysis of spectral data. Mid-IR spectroscopy of biological samples is often subject to instrumental and sample specific variances which may often conceal valuable biological information. Whilst pre-processing can effectively reduce this unwanted variance,...

Full description

Saved in:
Bibliographic Details
Published inAnalyst (London) Vol. 143; no. 24; pp. 6121 - 6134
Main Authors Butler, Holly J., Smith, Benjamin R., Fritzsch, Robby, Radhakrishnan, Pretheepan, Palmer, David S., Baker, Matthew J.
Format Journal Article
LanguageEnglish
Published England Royal Society of Chemistry 03.12.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Pre-processing is an essential step in the analysis of spectral data. Mid-IR spectroscopy of biological samples is often subject to instrumental and sample specific variances which may often conceal valuable biological information. Whilst pre-processing can effectively reduce this unwanted variance, the plethora of possible processing steps has resulted in a lack of consensus in the field, often meaning that analysis outputs are not comparable. As pre-processing is specific to the sample under investigation, here we present a systematic approach for defining the optimum pre-processing protocol for biofluid ATR-FTIR spectroscopy. Using a trial-and-error based approach and a clinically relevant dataset describing control and brain cancer patients, the effects of pre-processing permutations on subsequent classification algorithms were observed, by assessing key diagnostic performance parameters, including sensitivity and specificity. It was found that optimum diagnostic performance correlated with the use of minimal binning and baseline correction, with derivative functions improving diagnostic performance most significantly. If smoothing is required, a Sovitzky–Golay approach was the preferred option in this investigation. Heavy binning appeared to reduce classification most significantly, alongside wavelet noise reduction (filter length ≥6), resulting in the lowest diagnostic performances of all pre-processing permutations tested.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0003-2654
1364-5528
1364-5528
DOI:10.1039/C8AN01384E