Learning from mixed datasets: A monotonic image quality assessment model
Deep learning based image quality assessment models usually learn to predict image quality from a single dataset, leading the model to overfit specific scenes. To account for this, mixed datasets training can be an effective way to enhance the generalization capability of the model. However, it is n...
Saved in:
Published in | Electronics letters Vol. 59; no. 3 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Stevenage
John Wiley & Sons, Inc
01.02.2023
Wiley |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Deep learning based image quality assessment models usually learn to predict image quality from a single dataset, leading the model to overfit specific scenes. To account for this, mixed datasets training can be an effective way to enhance the generalization capability of the model. However, it is nontrivial to combine different image quality assessment datasets, as their quality evaluation criteria, score ranges, view conditions, as well as subjects are usually not shared during the image quality annotation. Instead of aligning the annotations, this paper proposes a monotonic neural network for image quality assessment model learning with different datasets combined. In particular, this model consists of a dataset‐shared quality regressor and several dataset‐specific quality transformers. The quality regressor aims to obtain the perceptual quality of each image of each dataset and the quality transformer maps the perceptual quality to the corresponding annotation monotonically. The experimental results verify the effectiveness of the proposed learning strategy and the code is available at https://github.com/fzp0424/MonotonicIQA.
We propose a monotonic neural network for IQA model learning with different datasets combined, getting rid of the laborious quality annotation alignment. Our model consists of a dataset‐shared quality regressor and several dataset‐specific quality transformers. The quality regressor aims to obtain the perceptual qualities of each dataset while each quality transformer maps the perceptual qualities to the corresponding dataset annotations with their monotonicity maintained. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 0013-5194 1350-911X |
DOI: | 10.1049/ell2.12698 |