Diffusion-based data augmentation and hierarchical CLIP for real estate image annotation

The critical role of real estate search engines in the economic domain underscores the necessity for a robust methodology in annotating the luxury level of room images. Contemporary challenges include inadequacies in room quality assessment, underutilization of deep network capacities, and insuffici...

Full description

Saved in:
Bibliographic Details
Published inPattern analysis and applications : PAA Vol. 28; no. 2
Main Authors Deng, Haojin, Zhang, Wandong, Yang, Yimin, Nejad, Eman
Format Journal Article
LanguageEnglish
Published London Springer London 01.06.2025
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The critical role of real estate search engines in the economic domain underscores the necessity for a robust methodology in annotating the luxury level of room images. Contemporary challenges include inadequacies in room quality assessment, underutilization of deep network capacities, and insufficient annotated house images. This paper presents an innovative real estate image annotation model employing a multi-stage approach that integrates diffusion models with contrastive language-image pretraining (CLIP) networks. Initially, the diffusion network serves as a data augmentation technique, generating supplementary real estate images for training. Next, a hierarchical CLIP model categorizes images into different room types. Subsequently, multiple CLIP models assess the condition of each room, annotating them as either contemporary or standard. The final and most significant stage involves transferring knowledge from the larger CLIP model to a smaller, more efficient model using soft labeling. This approach enhances inferencing speed while maintaining high performance. Experimental results on a newly gathered real estate image dataset demonstrate the superior performance of the proposed approach compared to existing house image classification algorithms. Our demonstration code and checkpoints are available at https://huggingface.co/strollingorange/roomLuxuryAnnotater .
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1433-7541
1433-755X
DOI:10.1007/s10044-025-01465-2