From Logits to Hierarchies: Hierarchical Clustering made Simple

The structure of many real-world datasets is intrinsically hierarchical, making the modeling of such hierarchies a critical objective in both unsupervised and supervised machine learning. Recently, novel approaches for hierarchical clustering with deep architectures have been proposed. In this work,...

Full description

Saved in:
Bibliographic Details
Main Authors Palumbo, Emanuele, Vandenhirtz, Moritz, Ryser, Alain, Daunhawer, Imant, Vogt, Julia E
Format Journal Article
LanguageEnglish
Published 10.10.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The structure of many real-world datasets is intrinsically hierarchical, making the modeling of such hierarchies a critical objective in both unsupervised and supervised machine learning. Recently, novel approaches for hierarchical clustering with deep architectures have been proposed. In this work, we take a critical perspective on this line of research and demonstrate that many approaches exhibit major limitations when applied to realistic datasets, partly due to their high computational complexity. In particular, we show that a lightweight procedure implemented on top of pre-trained non-hierarchical clustering models outperforms models designed specifically for hierarchical clustering. Our proposed approach is computationally efficient and applicable to any pre-trained clustering model that outputs logits, without requiring any fine-tuning. To highlight the generality of our findings, we illustrate how our method can also be applied in a supervised setup, recovering meaningful hierarchies from a pre-trained ImageNet classifier.
DOI:10.48550/arxiv.2410.07858