Can Natural Language Processing and Artificial Intelligence Automate The Generation of Billing Codes From Operative Note Dictations?

Study Design Retrospective Cohort Study. Objectives Using natural language processing (NLP) in combination with machine learning on standard operative notes may allow for efficient billing, maximization of collections, and minimization of coder error. This study was conducted as a pilot study to det...

Full description

Saved in:
Bibliographic Details
Published inGlobal spine journal Vol. 13; no. 7; pp. 1946 - 1955
Main Authors Kim, Jun S., Vivas, Andrew, Arvind, Varun, Lombardi, Joseph, Reidler, Jay, Zuckerman, Scott L, Lee, Nathan J., Vulapalli, Meghana, Geng, Eric A, Cho, Brian H., Morizane, Kazuaki, Cho, Samuel K., Lehman, Ronald A., Lenke, Lawrence G., Riew, Kiehyun Daniel
Format Journal Article
LanguageEnglish
Published Los Angeles, CA SAGE Publications 01.09.2023
Sage Publications Ltd
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Study Design Retrospective Cohort Study. Objectives Using natural language processing (NLP) in combination with machine learning on standard operative notes may allow for efficient billing, maximization of collections, and minimization of coder error. This study was conducted as a pilot study to determine if a machine learning algorithm can accurately identify billing Current Procedural Terminology (CPT) codes on patient operative notes. Methods This was a retrospective analysis of operative notes from patients who underwent elective spine surgery by a single senior surgeon from 9/2015 to 1/2020. Algorithm performance was measured by performing receiver operating characteristic (ROC) analysis, calculating the area under the ROC curve (AUC) and the area under the precision-recall curve (AUPRC). A deep learning NLP algorithm and a Random Forest algorithm were both trained and tested on operative notes to predict CPT codes. CPT codes generated by the billing department were compared to those generated by our model. Results The random forest machine learning model had an AUC of .94 and an AUPRC of .85. The deep learning model had a final AUC of .72 and an AUPRC of .44. The random forest model had a weighted average, class-by-class accuracy of 87%. The LSTM deep learning model had a weighted average, class-by-class accuracy 0f 59%. Conclusions Combining natural language processing with machine learning is a valid approach for automatic generation of CPT billing codes. The random forest machine learning model outperformed the LSTM deep learning model in this case. These models can be used by orthopedic or neurosurgery departments to allow for efficient billing.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2192-5682
2192-5690
DOI:10.1177/21925682211062831