LDMP-RENet: Reducing intra-class differences for metal surface defect few-shot semantic segmentation

Given their fast generalization capability for unseen classes and segmentation ability at pixel scale, models based on few-shot segmentation perform well in solving data insuﬃciency problems during metal defect detection and in delineating refined objects under industrial scenarios. Extant researche...

Full description

Saved in:

Bibliographic Details
Published in	PloS one Vol. 20; no. 3; p. e0318553
Main Authors	Zhang, Jiyan, Ding, Hanze, Wu, Zhangkai, Peng, Ming, Liu, Yanfang
Format	Journal Article
Language	English
Published	United States Public Library of Science 17.03.2025 Public Library of Science (PLoS)
Subjects	Algorithms Benchmarks Biology and Life Sciences Data integration Datasets Defects Distortion Engineering and Technology Evaluation Excitation Image Processing, Computer-Assisted - methods Image segmentation Metal products Metal surfaces Metals Metals - chemistry Methods Modules Physical Sciences Pixels Product quality Prototypes Reasoning Research and Analysis Methods Semantic segmentation Semantics Social Sciences Surface defects Surface Properties Teaching methods Testing China
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Given their fast generalization capability for unseen classes and segmentation ability at pixel scale, models based on few-shot segmentation perform well in solving data insuﬃciency problems during metal defect detection and in delineating refined objects under industrial scenarios. Extant researches fail to consider the inherent intra-class differences in data about metal surface defects, so that the models can hardly learn enough information from the support set for guiding the segmentation of query set. Specifically, it can be categorized into two types: the semantic intra-class difference induced by internal factors in metal samples and the distortion intra-class difference caused by external factors of surroundings. To address these differences, we introduce a L ocal D escriptor-based M ulti- P rototype R easoning and E xcitation Net work ( LDMP-RENet ) to learn the two-view guidance, i.e., the local information from the graph space and the global information from the feature space, and fuse them to segment precisely. Given the contribution of relational structure of graph space-embedded local features to the Semantic Difference obviation, a multi-prototype reasoning module is utilized to extract local descriptors-based prototypes and to assess relevance between local-view features in support-query set pairs. Meanwhile, since global information helps obviate Distortion Difference in observations, a multi-prototype excitation module is employed for capturing global-view relevance in the above pairs. Lastly, an information fusion module is employed to integrate the learned prototypes in both global and local views, thereby creating pixel-level masks. Thorough experiments are conducted on defect datasets, revealing the superiority of proposed network to extant benchmarks, which sets a new state-of-the-art.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Competing Interests: The authors have declared that no competing interests exist. Peer Review History: PLOS recognizes the benefits of transparency in the peer review process; therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. The editorial history of this article is available here: 10.1371/journal.pone.0318553
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0318553