Learning from mixed datasets: A monotonic image quality assessment model

Deep learning based image quality assessment models usually learn to predict image quality from a single dataset, leading the model to overfit specific scenes. To account for this, mixed datasets training can be an effective way to enhance the generalization capability of the model. However, it is n...

Full description

Saved in:
Bibliographic Details
Published inElectronics letters Vol. 59; no. 3
Main Authors Feng, Zhaopeng, Zhang, Keyang, Jia, Shuyue, Chen, Baoliang, Wang, Shiqi
Format Journal Article
LanguageEnglish
Published Stevenage John Wiley & Sons, Inc 01.02.2023
Wiley
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Deep learning based image quality assessment models usually learn to predict image quality from a single dataset, leading the model to overfit specific scenes. To account for this, mixed datasets training can be an effective way to enhance the generalization capability of the model. However, it is nontrivial to combine different image quality assessment datasets, as their quality evaluation criteria, score ranges, view conditions, as well as subjects are usually not shared during the image quality annotation. Instead of aligning the annotations, this paper proposes a monotonic neural network for image quality assessment model learning with different datasets combined.  In particular, this model consists of a dataset‐shared quality regressor and several dataset‐specific quality transformers. The quality regressor aims to obtain the perceptual quality of each image of each dataset and the quality transformer maps the perceptual quality to the corresponding annotation monotonically. The experimental results verify the effectiveness of the proposed learning strategy and the code is available at https://github.com/fzp0424/MonotonicIQA. We propose a monotonic neural network for IQA model learning with different datasets combined, getting rid of the laborious quality annotation alignment. Our model consists of a dataset‐shared quality regressor and several dataset‐specific quality transformers. The quality regressor aims to obtain the perceptual qualities of each dataset while each quality transformer maps the perceptual qualities to the corresponding dataset annotations with their monotonicity maintained.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0013-5194
1350-911X
DOI:10.1049/ell2.12698