Decision tree classification of proteins identified by mass spectrometry of blood serum samples from people with and without lung cancer
A classification and regression tree (CART) model was trained to classify 41 clinical specimens as disease/nondisease based on 26 variables computed from the mass‐to‐charge ratio (m/z) and peak heights of proteins identified by mass spectroscopy. The CART model built on all of the specimens (no cros...
Saved in:
Published in | Proteomics (Weinheim) Vol. 3; no. 9; pp. 1678 - 1679 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Weinheim
WILEY-VCH Verlag
01.09.2003
WILEY‐VCH Verlag |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A classification and regression tree (CART) model was trained to classify 41 clinical specimens as disease/nondisease based on 26 variables computed from the mass‐to‐charge ratio (m/z) and peak heights of proteins identified by mass spectroscopy. The CART model built on all of the specimens (no cross‐validation) had an error rate of 4/41 = 10%. The CART model suggests that mass spectra peaks in the 8000–10 000, 20 000–30 000, 45 000–60 000, and >125 000 m/z ranges may be valuable in distinguishing between the disease/nondisease specimens. The area under the receiver operating characteristics curve was 0.80 ± 0.07 for leave‐one‐out cross‐validation. |
---|---|
Bibliography: | istex:D3E669BD7E1DF97B815DDDBEC8737578C6454DCD ark:/67375/WNG-4J7CL126-H ArticleID:PMIC200300521 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1615-9853 1615-9861 |
DOI: | 10.1002/pmic.200300521 |