Decision tree classification of proteins identified by mass spectrometry of blood serum samples from people with and without lung cancer

A classification and regression tree (CART) model was trained to classify 41 clinical specimens as disease/nondisease based on 26 variables computed from the mass‐to‐charge ratio (m/z) and peak heights of proteins identified by mass spectroscopy. The CART model built on all of the specimens (no cros...

Full description

Saved in:
Bibliographic Details
Published inProteomics (Weinheim) Vol. 3; no. 9; pp. 1678 - 1679
Main Authors Markey, Mia K., Tourassi, Georgia D., Floyd Jr, Carey E.
Format Journal Article
LanguageEnglish
Published Weinheim WILEY-VCH Verlag 01.09.2003
WILEY‐VCH Verlag
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A classification and regression tree (CART) model was trained to classify 41 clinical specimens as disease/nondisease based on 26 variables computed from the mass‐to‐charge ratio (m/z) and peak heights of proteins identified by mass spectroscopy. The CART model built on all of the specimens (no cross‐validation) had an error rate of 4/41 = 10%. The CART model suggests that mass spectra peaks in the 8000–10 000, 20 000–30 000, 45 000–60 000, and >125 000 m/z ranges may be valuable in distinguishing between the disease/nondisease specimens. The area under the receiver operating characteristics curve was 0.80 ± 0.07 for leave‐one‐out cross‐validation.
Bibliography:istex:D3E669BD7E1DF97B815DDDBEC8737578C6454DCD
ark:/67375/WNG-4J7CL126-H
ArticleID:PMIC200300521
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1615-9853
1615-9861
DOI:10.1002/pmic.200300521