Contrastive deep convolutional transform k-means clustering

Deep clustering has gained the immense attention of researchers in recent years. Most of the deep clustering approaches are based on auto-encoders which consist of an encoder-decoder framework. In these approaches, the clustering module is embedded in the latent space of auto-encoders. The auto-enco...

Full description

Saved in:
Bibliographic Details
Published inInformation sciences Vol. 661; p. 120191
Main Authors Goel, Anurag, Majumdar, Angshul
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.03.2024
Subjects
Online AccessGet full text
ISSN0020-0255
1872-6291
DOI10.1016/j.ins.2024.120191

Cover

Loading…
More Information
Summary:Deep clustering has gained the immense attention of researchers in recent years. Most of the deep clustering approaches are based on auto-encoders which consist of an encoder-decoder framework. In these approaches, the clustering module is embedded in the latent space of auto-encoders. The auto-encoder based deep clustering approaches require learning of encoder weights as well as decoder weights. Moreover, due to the unsupervised learning strategy, these approaches lack in learning the discriminative features that can help in generating better clusters. This work introduces a novel clustering approach based on Contrastive Deep Convolutional Transform Learning (DCTL) framework. The proposed approach mitigates the problem of lack of supervision in DCTL based K-means clustering approach by embedding the contrastive learning into it. To embed the contrastive learning, the positive pairs and negative pairs of data samples are generated by reconstructing the data samples from the DCTL learnt representation itself and thus eliminates the requirement of data augmentation for embedding contrastive learning. The experimental results on several benchmark facial images datasets demonstrate that the proposed framework gives better clustering performance as compared to the current state-of-the-art deep clustering approaches especially in data constrained scenarios. •We propose a novel framework that can learn compact and discriminative representations for clustering.•The contrastive learning is embedded using a reconstruction module with skip connection.•Comparison with SOTA on several facial images datasets demonstrate the effectiveness of the proposed model.
ISSN:0020-0255
1872-6291
DOI:10.1016/j.ins.2024.120191