Evaluation of cell segmentation methods without reference segmentations

We developed metrics and an open source tool to evaluate the cell segmentation quality of multichannel images without a human-labeled reference. Deep learning–based methods performed the best, but postprocessing to ensure proper matching of cell and nuclear masks dramatically improved performance. C...

Full description

Saved in:

Bibliographic Details
Published in	Molecular biology of the cell Vol. 34; no. 6; p. ar50
Main Authors	Chen, Haoran, Murphy, Robert F.
Format	Journal Article
Language	English
Published	United States American Society for Cell Biology 15.05.2023 The American Society for Cell Biology
Subjects	Analysis Cells Methods Public software Segmentation (Morphology) United States
Online Access	Get full text

Cover

Loading…

More Information
Summary:	We developed metrics and an open source tool to evaluate the cell segmentation quality of multichannel images without a human-labeled reference. Deep learning–based methods performed the best, but postprocessing to ensure proper matching of cell and nuclear masks dramatically improved performance. Cell segmentation is a cornerstone of many bioimage informatics studies, and inaccurate segmentation introduces error in downstream analysis. Evaluating segmentation results is thus a necessary step for developing segmentation methods as well as for choosing the most appropriate method for a particular type of sample. The evaluation process has typically involved comparison of segmentations with those generated by humans, which can be expensive and subject to unknown bias. We present here an approach to evaluating cell segmentation methods without relying upon comparison to results from humans. For this, we defined a number of segmentation quality metrics that can be applied to multichannel fluorescence images. We calculated these metrics for 14 previously described segmentation methods applied to datasets from four multiplexed microscope modalities covering five tissues. Using principal component analysis to combine the metrics, we defined an overall cell segmentation quality score and ranked the segmentation methods. We found that two deep learning–based methods performed the best overall, but that results for all methods could be significantly improved by postprocessing to ensure proper matching of cell and nuclear masks. Our evaluation tool is available as open source and all code and data are available in a Reproducible Research Archive.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1059-1524 1939-4586 1939-4586
DOI:	10.1091/mbc.E22-08-0364