Interval-Based Least Squares for Uncertainty-Aware Learning in Human-Centric Multimedia Systems

Machine learning (ML) methods are popular in several application areas of multimedia signal processing. However, most existing solutions in the said area, including the popular least squares, rely on penalizing predictions that deviate from the target ground-truth values. In other words, uncertainty...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transaction on neural networks and learning systems Vol. 32; no. 11; pp. 5241 - 5246
Main Authors	Narwaria, Manish, Tatu, Aditya
Format	Journal Article
Language	English
Published	United States IEEE 01.11.2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Context Crowdsourcing Estimation Humans Image Processing, Computer-Assisted - methods Image Processing, Computer-Assisted - trends Image quality Learning algorithms Learning systems Least squares Least-Squares Analysis Machine learning machine learning (ML) Machine Learning - trends Multimedia Multimedia - trends multimedia signal processing Multimedia systems Neural Networks, Computer Optimization Predictions Sensory integration Signal processing Signal quality Uncertainty
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Machine learning (ML) methods are popular in several application areas of multimedia signal processing. However, most existing solutions in the said area, including the popular least squares, rely on penalizing predictions that deviate from the target ground-truth values. In other words, uncertainty in the ground-truth data is simply ignored. As a result, optimization and validation overemphasize a single-target value when, in fact, human subjects themselves did not unanimously agree to it. This leads to an unreasonable scenario where the trained model is not allowed the benefit of the doubt in terms of prediction accuracy. The problem becomes even more significant in the context of more recent human-centric and immersive multimedia systems where user feedback and interaction are influenced by higher degrees of freedom (leading to higher levels of uncertainty in the ground truth). To ameliorate this drawback, we propose an uncertainty aware loss function (referred to as <inline-formula> <tex-math notation="LaTeX">\text {MSE}^{*} </tex-math></inline-formula>) that explicitly accounts for data uncertainty and is useful for both optimization (training) and validation. As examples, we demonstrate the utility of the proposed method for blind estimation of perceptual quality of audiovisual signals, panoramic images, and images affected by camera-induced distortions. The experimental results support the theoretical ideas in terms of reducing prediction errors. The proposed method is also relevant in the context of more recent paradigms, such as crowdsourcing, where larger uncertainty in ground truth is expected.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	2162-237X 2162-2388 2162-2388
DOI:	10.1109/TNNLS.2020.3025834