Wavenumber selection method to determine the concentration of cocaine and adulterants in cocaine samples

•We propose a new framework for selecting predictive wavenumbers in FTIR.•Our propositions are applied to cocaine samples.•The method is tested in a dataset comprised of five different response variables (cocaine and four adulterants).•Retained wavenumbers are discussed. Street cocaine is typically...

Full description

Saved in:
Bibliographic Details
Published inJournal of pharmaceutical and biomedical analysis Vol. 152; pp. 120 - 127
Main Authors Kahmann, A., Anzanello, M.J., Fogliatto, F.S., Marcelo, M.C.A., Ferrão, M.F., Ortiz, R.S., Mariotti, K.C.
Format Journal Article
LanguageEnglish
Published England Elsevier B.V 15.04.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•We propose a new framework for selecting predictive wavenumbers in FTIR.•Our propositions are applied to cocaine samples.•The method is tested in a dataset comprised of five different response variables (cocaine and four adulterants).•Retained wavenumbers are discussed. Street cocaine is typically altered with several compounds that increase its harmful health-related side effects, most notably depression, convulsions, and severe damages to the cardiovascular system, lungs, and brain. Thus, determining the concentration of cocaine and adulterants in seized drug samples is important from both health and forensic perspectives. Although FTIR has been widely used to identify the fingerprint and concentration of chemical compounds, spectroscopy datasets are usually comprised of thousands of highly correlated wavenumbers which, when used as predictors in regression models, tend to undermine the predictive performance of multivariate techniques. In this paper, we propose an FTIR wavenumber selection method aimed at identifying FTIR spectra intervals that best predict the concentration of cocaine and adulterants (e.g. caffeine, phenacetin, levamisole, and lidocaine) in cocaine samples. For that matter, the Mutual Information measure is integrated into a Quadratic Programming problem with the objective of minimizing the probability of retaining redundant wavenumbers, while maximizing the relationship between retained wavenumbers and compounds’ concentrations. Optimization outputs guide the order of inclusion of wavenumbers in a predictive model, using a forward-based wavenumber selection method. After the inclusion of each wavenumber, parameters of three alternative regression models are estimated, and each model’s prediction error is assessed through the Mean Average Error (MAE) measure; the recommended subset of retained wavenumbers is the one that minimizes the prediction error with maximum parsimony. Using our propositions in a dataset of 115 cocaine samples we obtained a best prediction model with average MAE of 0.0502 while retaining only 2.29% of the original wavenumbers, increasing the predictive precision by 0.0359 when compared to a model using the complete set of wavenumbers as predictors.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0731-7085
1873-264X
DOI:10.1016/j.jpba.2018.01.050