Automated identification of extreme-risk events in clinical incident reports

To explore the feasibility of using statistical text classification to automatically detect extreme-risk events in clinical incident reports. Statistical text classifiers based on Naïve Bayes and Support Vector Machine (SVM) algorithms were trained and tested on clinical incident reports to automati...

Full description

Saved in:

Bibliographic Details
Published in	Journal of the American Medical Informatics Association : JAMIA Vol. 19; no. e1; pp. e110 - e118
Main Authors	Ong, Mei-Sing, Magrabi, Farah, Coiera, Enrico
Format	Journal Article
Language	English
Published	England BMJ Group 01.06.2012
Series	FOCUS on clinical research informatics
Subjects	Artificial Intelligence Bayes Theorem Humans Medical Errors - classification Research and Applications Risk Management - methods Support Vector Machine
Online Access	Get full text

Cover

Loading…

More Information
Summary:	To explore the feasibility of using statistical text classification to automatically detect extreme-risk events in clinical incident reports. Statistical text classifiers based on Naïve Bayes and Support Vector Machine (SVM) algorithms were trained and tested on clinical incident reports to automatically detect extreme-risk events, defined by incidents that satisfy the criteria of Severity Assessment Code (SAC) level 1. For this purpose, incident reports submitted to the Advanced Incident Management System by public hospitals from one Australian region were used. The classifiers were evaluated on two datasets: (1) a set of reports with diverse incident types (n=120); (2) a set of reports associated with patient misidentification (n=166). Results were assessed using accuracy, precision, recall, F-measure, and area under the curve (AUC) of receiver operating characteristic curves. The classifiers performed well on both datasets. In the multi-type dataset, SVM with a linear kernel performed best, identifying 85.8% of SAC level 1 incidents (precision=0.88, recall=0.83, F-measure=0.86, AUC=0.92). In the patient misidentification dataset, 96.4% of SAC level 1 incidents were detected when SVM with linear, polynomial or radial-basis function kernel was used (precision=0.99, recall=0.94, F-measure=0.96, AUC=0.98). Naïve Bayes showed reasonable performance, detecting 80.8% of SAC level 1 incidents in the multi-type dataset and 89.8% of SAC level 1 patient misidentification incidents. Overall, higher prediction accuracy was attained on the specialized dataset, compared with the multi-type dataset. Text classification techniques can be applied effectively to automate the detection of extreme-risk events in clinical incident reports.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 ObjectType-Article-2 ObjectType-Feature-1
ISSN:	1067-5027 1527-974X 1527-974X
DOI:	10.1136/amiajnl-2011-000562