MIDV-500: a dataset for identity document analysis and recognition on mobile devices in video stream

A lot of research has been devoted to identity documents analysis and recognition on mobile devices. However, no publicly available datasets designed for this particular problem currently exist. There are a few datasets which are useful for associated subtasks but in order to facilitate a more compr...

Full description

Saved in:
Bibliographic Details
Published inKompʹûternaâ optika Vol. 43; no. 5; pp. 818 - 824
Main Authors Arlazarov, V.V., Bulatov, K., Chernov, T., Arlazarov, V.L.
Format Journal Article
LanguageEnglish
Published Samara National Research University 01.10.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A lot of research has been devoted to identity documents analysis and recognition on mobile devices. However, no publicly available datasets designed for this particular problem currently exist. There are a few datasets which are useful for associated subtasks but in order to facilitate a more comprehensive scientific and technical approach to identity document recognition more specialized datasets are required. In this paper we present a Mobile Identity Document Video dataset (MIDV-500) consisting of 500 video clips for 50 different identity document types with ground truth which allows to perform research in a wide scope of document analysis problems. The paper presents characteristics of the dataset and evaluation results for existing methods of face detection, text line recognition, and document fields data extraction. Since an important feature of identity documents is their sensitiveness as they contain personal data, all source document images used in MIDV-500 are either in public domain or distributed under public copyright licenses. The main goal of this paper is to present a dataset. However, in addition and as a baseline, we present evaluation results for existing methods for face detection, text line recognition, and document data extraction, using the presented dataset.
ISSN:0134-2452
2412-6179
DOI:10.18287/2412-6179-2019-43-5-818-824