AMENet is a monocular depth estimation network designed for automatic stereoscopic display

Monocular depth estimation has a wide range of applications in the field of autostereoscopic displays, while accuracy and robustness in complex scenes are still a challenge. In this paper, we propose a depth estimation network for autostereoscopic displays, which aims at improving the accuracy of mo...

Full description

Saved in:

Bibliographic Details
Published in	Scientific reports Vol. 14; no. 1; p. 5868
Main Authors	Wu, Tianzhao, Xia, Zhongyi, Zhou, Man, Kong, Ling Bing, Chen, Zengyuan
Format	Journal Article
Language	English
Published	England Nature Publishing Group 11.03.2024 Nature Publishing Group UK Nature Portfolio
Subjects	Accuracy CNN Depth loss Depth perception Monocular depth estimation Neural networks Transformer CNN Monocular depth estimation Transformer Depth loss
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Monocular depth estimation has a wide range of applications in the field of autostereoscopic displays, while accuracy and robustness in complex scenes are still a challenge. In this paper, we propose a depth estimation network for autostereoscopic displays, which aims at improving the accuracy of monocular depth estimation by fusing Vision Transformer (ViT) and Convolutional Neural Network (CNN). Our approach feeds the input image as a sequence of visual features into the ViT module and utilizes its global perception capability to extract high-level semantic features of the image. The relationship between the losses is quantified by adding a weight correction module to improve robustness of the model. Experimental evaluation results on several public datasets show that AMENet exhibits higher accuracy and robustness than existing methods in different scenarios and complex conditions. In addition, a detailed experimental analysis was conducted to verify the effectiveness and stability of our method. The accuracy improvement on the KITTI dataset compared to the baseline method is 4.4%. In summary, AMENet is a promising depth estimation method with sufficient high robustness and accuracy for monocular depth estimation tasks.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-024-56095-1