Applications of Digital Microscopy and Densely Connected Convolutional Neural Networks for Automated Quantification of Babesia-Infected Erythrocytes

Abstract Background Clinical babesiosis is diagnosed, and parasite burden is determined, by microscopic inspection of a thick or thin Giemsa-stained peripheral blood smear. However, quantitative analysis by manual microscopy is subject to error. As such, methods for the automated measurement of perc...

Full description

Saved in:

Bibliographic Details
Published in	Clinical chemistry (Baltimore, Md.) Vol. 68; no. 1; pp. 218 - 229
Main Authors	Durant, Thomas J S, Dudgeon, Sarah N, McPadden, Jacob, Simpson, Anisia, Price, Nathan, Schulz, Wade L, Torres, Richard, Olson, Eben M
Format	Journal Article
Language	English
Published	England Oxford University Press 01.01.2022
Subjects	Artificial neural networks Automation Babesia Babesiosis Classification Digital imaging Erythrocytes Health aspects Humans Image classification Inspection Learning algorithms Machine learning Microscopy Microscopy - methods Model testing Neural networks Neural Networks, Computer Parasitemia Parasitemia - diagnosis Parasites Peripheral blood Physiological aspects Smear United States peripheral blood smear convolutional neural networks babesia machine learning image analysis erythrocyte
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Abstract Background Clinical babesiosis is diagnosed, and parasite burden is determined, by microscopic inspection of a thick or thin Giemsa-stained peripheral blood smear. However, quantitative analysis by manual microscopy is subject to error. As such, methods for the automated measurement of percent parasitemia in digital microscopic images of peripheral blood smears could improve clinical accuracy, relative to the predicate method. Methods Individual erythrocyte images were manually labeled as “parasite” or “normal” and were used to train a model for binary image classification. The best model was then used to calculate percent parasitemia from a clinical validation dataset, and values were compared to a clinical reference value. Lastly, model interpretability was examined using an integrated gradient to identify pixels most likely to influence classification decisions. Results The precision and recall of the model during development testing were 0.92 and 1.00, respectively. In clinical validation, the model returned increasing positive signal with increasing mean reference value. However, there were 2 highly erroneous false positive values returned by the model. Further, the model incorrectly assessed 3 cases well above the clinical threshold of 10%. The integrated gradient suggested potential sources of false positives including rouleaux formations, cell boundaries, and precipitate as deterministic factors in negative erythrocyte images. Conclusions While the model demonstrated highly accurate single cell classification and correctly assessed most slides, several false positives were highly incorrect. This project highlights the need for integrated testing of machine learning-based models, even when models in the development phase perform well.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	0009-9147 1530-8561 1530-8561
DOI:	10.1093/clinchem/hvab237