Visual SLAM: Why filter?

While the most accurate solution to off-line structure from motion (SFM) problems is undoubtedly to extract as much correspondence information as possible and perform batch optimisation, sequential methods suitable for live video streams must approximate this to fit within fixed computational bounds...

Full description

Saved in:
Bibliographic Details
Published inImage and vision computing Vol. 30; no. 2; pp. 65 - 77
Main Authors Strasdat, Hauke, Montiel, J.M.M., Davison, Andrew J.
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.02.2012
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:While the most accurate solution to off-line structure from motion (SFM) problems is undoubtedly to extract as much correspondence information as possible and perform batch optimisation, sequential methods suitable for live video streams must approximate this to fit within fixed computational bounds. Two quite different approaches to real-time SFM – also called visual SLAM (simultaneous localisation and mapping) – have proven successful, but they sparsify the problem in different ways. Filtering methods marginalise out past poses and summarise the information gained over time with a probability distribution. Keyframe methods retain the optimisation approach of global bundle adjustment, but computationally must select only a small number of past frames to process. In this paper we perform a rigorous analysis of the relative advantages of filtering and sparse bundle adjustment for sequential visual SLAM. In a series of Monte Carlo experiments we investigate the accuracy and cost of visual SLAM. We measure accuracy in terms of entropy reduction as well as root mean square error (RMSE), and analyse the efficiency of bundle adjustment versus filtering using combined cost/accuracy measures. In our analysis, we consider both SLAM using a stereo rig and monocular SLAM as well as various different scenes and motion patterns. For all these scenarios, we conclude that keyframe bundle adjustment outperforms filtering, since it gives the most accuracy per unit of computing time. [Display omitted] ►We analyse filtering and bundle adjustment (BA) for sequential visual SLAM. ►In Monte Carlo experiments we investigate the accuracy and computational cost. ►Increasing the number of points increases the accuracy significantly. ►Increasing the number of intermediate keyframe only has a minor effect. ►BA outperforms filtering, since it gives the most accuracy per time step.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0262-8856
1872-8138
DOI:10.1016/j.imavis.2012.02.009