Optimised spectral pre-processing for discrimination of biofluids via ATR-FTIR spectroscopy

Pre-processing is an essential step in the analysis of spectral data. Mid-IR spectroscopy of biological samples is often subject to instrumental and sample specific variances which may often conceal valuable biological information. Whilst pre-processing can effectively reduce this unwanted variance,...

Full description

Saved in:

Bibliographic Details
Published in	Analyst (London) Vol. 143; no. 24; pp. 6121 - 6134
Main Authors	Butler, Holly J., Smith, Benjamin R., Fritzsch, Robby, Radhakrishnan, Pretheepan, Palmer, David S., Baker, Matthew J.
Format	Journal Article
Language	English
Published	England Royal Society of Chemistry 03.12.2018
Subjects	Adolescent Adult Aged Aged, 80 and over Algorithms Biological properties Blood Chemical Analysis - statistics & numerical data Brain Brain Neoplasms - blood Brain Neoplasms - diagnosis Classification Datasets as Topic Diagnostic systems Female Fourier transforms Humans Infrared spectroscopy Machine Learning Male Middle Aged Noise reduction Parameter sensitivity Permutations Sensitivity analysis Sensitivity and Specificity Spectroscopy, Fourier Transform Infrared - methods Spectrum analysis Wavelet Young Adult
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Pre-processing is an essential step in the analysis of spectral data. Mid-IR spectroscopy of biological samples is often subject to instrumental and sample specific variances which may often conceal valuable biological information. Whilst pre-processing can effectively reduce this unwanted variance, the plethora of possible processing steps has resulted in a lack of consensus in the field, often meaning that analysis outputs are not comparable. As pre-processing is specific to the sample under investigation, here we present a systematic approach for defining the optimum pre-processing protocol for biofluid ATR-FTIR spectroscopy. Using a trial-and-error based approach and a clinically relevant dataset describing control and brain cancer patients, the effects of pre-processing permutations on subsequent classification algorithms were observed, by assessing key diagnostic performance parameters, including sensitivity and specificity. It was found that optimum diagnostic performance correlated with the use of minimal binning and baseline correction, with derivative functions improving diagnostic performance most significantly. If smoothing is required, a Sovitzky–Golay approach was the preferred option in this investigation. Heavy binning appeared to reduce classification most significantly, alongside wavelet noise reduction (filter length ≥6), resulting in the lowest diagnostic performances of all pre-processing permutations tested.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	0003-2654 1364-5528 1364-5528
DOI:	10.1039/C8AN01384E