Evaluation of cell segmentation methods without reference segmentations
We developed metrics and an open source tool to evaluate the cell segmentation quality of multichannel images without a human-labeled reference. Deep learning–based methods performed the best, but postprocessing to ensure proper matching of cell and nuclear masks dramatically improved performance. C...
Saved in:
Published in | Molecular biology of the cell Vol. 34; no. 6; p. ar50 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
United States
American Society for Cell Biology
15.05.2023
The American Society for Cell Biology |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We developed metrics and an open source tool to evaluate the cell segmentation quality of multichannel images without a human-labeled reference. Deep learning–based methods performed the best, but postprocessing to ensure proper matching of cell and nuclear masks dramatically improved performance.
Cell segmentation is a cornerstone of many bioimage informatics studies, and inaccurate segmentation introduces error in downstream analysis. Evaluating segmentation results is thus a necessary step for developing segmentation methods as well as for choosing the most appropriate method for a particular type of sample. The evaluation process has typically involved comparison of segmentations with those generated by humans, which can be expensive and subject to unknown bias. We present here an approach to evaluating cell segmentation methods without relying upon comparison to results from humans. For this, we defined a number of segmentation quality metrics that can be applied to multichannel fluorescence images. We calculated these metrics for 14 previously described segmentation methods applied to datasets from four multiplexed microscope modalities covering five tissues. Using principal component analysis to combine the metrics, we defined an overall cell segmentation quality score and ranked the segmentation methods. We found that two deep learning–based methods performed the best overall, but that results for all methods could be significantly improved by postprocessing to ensure proper matching of cell and nuclear masks. Our evaluation tool is available as open source and all code and data are available in a Reproducible Research Archive. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1059-1524 1939-4586 1939-4586 |
DOI: | 10.1091/mbc.E22-08-0364 |