Visual SLAM: Why filter?

While the most accurate solution to off-line structure from motion (SFM) problems is undoubtedly to extract as much correspondence information as possible and perform batch optimisation, sequential methods suitable for live video streams must approximate this to fit within fixed computational bounds...

Full description

Saved in:

Bibliographic Details
Published in	Image and vision computing Vol. 30; no. 2; pp. 65 - 77
Main Authors	Strasdat, Hauke, Montiel, J.M.M., Davison, Andrew J.
Format	Journal Article
Language	English
Published	Elsevier B.V 01.02.2012
Subjects	Accuracy Bundle adjustment Computation EKF Entropy Filtering Filtration Information filter Monocular vision Monte Carlo methods SLAM Stereo vision Structure from motion Visual Bundle adjustment EKF Structure from motion SLAM Monocular vision Stereo vision Information filter
Online Access	Get full text

Cover

Loading…

More Information
Summary:	While the most accurate solution to off-line structure from motion (SFM) problems is undoubtedly to extract as much correspondence information as possible and perform batch optimisation, sequential methods suitable for live video streams must approximate this to fit within fixed computational bounds. Two quite different approaches to real-time SFM – also called visual SLAM (simultaneous localisation and mapping) – have proven successful, but they sparsify the problem in different ways. Filtering methods marginalise out past poses and summarise the information gained over time with a probability distribution. Keyframe methods retain the optimisation approach of global bundle adjustment, but computationally must select only a small number of past frames to process. In this paper we perform a rigorous analysis of the relative advantages of filtering and sparse bundle adjustment for sequential visual SLAM. In a series of Monte Carlo experiments we investigate the accuracy and cost of visual SLAM. We measure accuracy in terms of entropy reduction as well as root mean square error (RMSE), and analyse the efficiency of bundle adjustment versus filtering using combined cost/accuracy measures. In our analysis, we consider both SLAM using a stereo rig and monocular SLAM as well as various different scenes and motion patterns. For all these scenarios, we conclude that keyframe bundle adjustment outperforms filtering, since it gives the most accuracy per unit of computing time. [Display omitted] ►We analyse filtering and bundle adjustment (BA) for sequential visual SLAM. ►In Monte Carlo experiments we investigate the accuracy and computational cost. ►Increasing the number of points increases the accuracy significantly. ►Increasing the number of intermediate keyframe only has a minor effect. ►BA outperforms filtering, since it gives the most accuracy per time step.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
ISSN:	0262-8856 1872-8138
DOI:	10.1016/j.imavis.2012.02.009