Automated discovery of drug treatment patterns for endocrine therapy of breast cancer within an electronic medical record

To develop an algorithm for the discovery of drug treatment patterns for endocrine breast cancer therapy within an electronic medical record and to test the hypothesis that information extracted using it is comparable to the information found by traditional methods. The electronic medical charts of...

Full description

Saved in:
Bibliographic Details
Published inJournal of the American Medical Informatics Association : JAMIA Vol. 19; no. e1; pp. e83 - e89
Main Authors Savova, G. K., Olson, J. E., Murphy, S. P., Cafourek, V. L., Couch, F. J., Goetz, M. P., Ingle, J. N., Suman, V. J., Chute, C. G., Weinshilboum, R. M.
Format Journal Article
LanguageEnglish
Published England BMJ Group 01.06.2012
SeriesFOCUS on clinical research informatics
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:To develop an algorithm for the discovery of drug treatment patterns for endocrine breast cancer therapy within an electronic medical record and to test the hypothesis that information extracted using it is comparable to the information found by traditional methods. The electronic medical charts of 1507 patients diagnosed with histologically confirmed primary invasive breast cancer. The automatic drug treatment classification tool consisted of components for: (1) extraction of drug treatment-relevant information from clinical narratives using natural language processing (clinical Text Analysis and Knowledge Extraction System); (2) extraction of drug treatment data from an electronic prescribing system; (3) merging information to create a patient treatment timeline; and (4) final classification logic. Agreement between results from the algorithm and from a nurse abstractor is measured for categories: (0) no tamoxifen or aromatase inhibitor (AI) treatment; (1) tamoxifen only; (2) AI only; (3) tamoxifen before AI; (4) AI before tamoxifen; (5) multiple AIs and tamoxifen cycles in no specific order; and (6) no specific treatment dates. Specificity (all categories): 96.14%-100%; sensitivity (categories (0)-(4)): 90.27%-99.83%; sensitivity (categories (5)-(6)): 0-23.53%; positive predictive values: 80%-97.38%; negative predictive values: 96.91%-99.93%. Our approach illustrates a secondary use of the electronic medical record. The main challenge is event temporality. We present an algorithm for automated treatment classification within an electronic medical record to combine information extracted through natural language processing with that extracted from structured databases. The algorithm has high specificity for all categories, high sensitivity for five categories, and low sensitivity for two categories.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:1067-5027
1527-974X
1527-974X
DOI:10.1136/amiajnl-2011-000295