Application of a Neural Network to Improve Nodal Staging Accuracy with ^sup 18^F-FDG PET in Non-Small Cell Lung Cancer
We proposed to train a back-propagation artificial neural network (aNN) on a cohort of surgically proven non-small cell lung cancers (NSCLCs) and compare its accuracy with that of a trained (18)F-FDG PET reader. We plan to show that an aNN trained on (18)F-FDG PET- and CT-derived data is more accura...
Saved in:
Published in | The Journal of nuclear medicine (1978) Vol. 44; no. 12; p. 1918 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
New York
Society of Nuclear Medicine
01.12.2003
|
Online Access | Get full text |
Cover
Loading…
Summary: | We proposed to train a back-propagation artificial neural network (aNN) on a cohort of surgically proven non-small cell lung cancers (NSCLCs) and compare its accuracy with that of a trained (18)F-FDG PET reader. We plan to show that an aNN trained on (18)F-FDG PET- and CT-derived data is more accurate in predicting the true surgicopathologic nodal stage than a human reader. One hundred thirty-three NSCLC patients with surgically proven N status treated at the University of Washington Medical Center or the Veterans Affairs Puget Sound Health Care System between February 1998 and September 2002 were used as inputs for the creation of an aNN. From CT of the thorax and (18)F-FDG PET (neck to pelvis) performed before surgery, we extracted the primary tumor size and uptake (maximum pixel SUV [maxSUV]), normal lung and mediastinal uptake, and nodal uptake (maxSUV). Using the same 133 cases, the same output (surgical N status, N(0) to N(3)), and the same software configuration settings, scenarios were created to assess which input parameters were most influential in creating an optimal aNN. To compute this optimal aNN, cases were split randomly 100 times into a training subset of 103 cases and a testing subset of 30 cases having the same proportion of N(0), N(1), N(2), and N(3) cases. N status predicted by the aNN was compared with the proven surgical N status to calculate the aNN accuracy. The N status readings from (18)F-FDG PET were also compared with the surgical N status for the same cases to determine (18)F-FDG PET accuracy. Statistical tests demonstrate that the best aNN accuracy is achieved by using N(1)-N(2)- N(3) nodal maxSUV divided by background uptake, the primary tumor size, and primary tumor maxSUV as inputs. The aNN correctly predicted the N stage in 87.3% of the testing cases compared with 73.5% for the (18)F-FDG PET expert reader. Accuracy of the aNN increased to 94.8% (PET, 89.4%) when comparing N(0) + N(1) with N(2) or N(3) status and to 94.9% (PET, 91.9%) when comparing N(0) + N(1) with N(2) + N(3) status. A back-propagation aNN can be trained to predict hilar and mediastinal nodal involvement with greater accuracy than an expert (18)F-FDG PET reader. Such a tool could be used to improve clinical interpretations and for clinical training. |
---|---|
ISSN: | 0161-5505 1535-5667 |