CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (cnec-xet) for detecting tables present in the documents. The proposed network consists of a multistage...

Full description

Saved in:
Bibliographic Details
Published in2020 25th International Conference on Pattern Recognition (ICPR) pp. 9491 - 9498
Main Authors Agarwal, Madhav, Mondal, Ajoy, Jawahar, C. V.
Format Conference Proceeding
LanguageEnglish
Published IEEE 10.01.2021
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (cnec-xet) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher IoU threshold. We empirically evaluate CDeC-Net on the publicly available benchmark datasets with extensive experiments. Our solution has three important properties: (i) a single trained model CDeC-Net ‡ that performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of IoU; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models are publicly available at https://github.com/mdv3101/CDeCNet for enabling reproducibility of the results.
AbstractList Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (cnec-xet) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher IoU threshold. We empirically evaluate CDeC-Net on the publicly available benchmark datasets with extensive experiments. Our solution has three important properties: (i) a single trained model CDeC-Net ‡ that performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of IoU; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models are publicly available at https://github.com/mdv3101/CDeCNet for enabling reproducibility of the results.
Author Mondal, Ajoy
Agarwal, Madhav
Jawahar, C. V.
Author_xml – sequence: 1
  givenname: Madhav
  surname: Agarwal
  fullname: Agarwal, Madhav
  email: madhav14130@gmail.com
  organization: CVIT, IIIT,Hyderabad,India
– sequence: 2
  givenname: Ajoy
  surname: Mondal
  fullname: Mondal, Ajoy
  email: ajoy.mondal@iiit.ac.in
  organization: CVIT, IIIT,Hyderabad,India
– sequence: 3
  givenname: C. V.
  surname: Jawahar
  fullname: Jawahar, C. V.
  email: jawahar@iiit.ac.in
  organization: CVIT, IIIT,Hyderabad,India
BookMark eNotj81KxDAUhSPowhl9AkHyAq25t50mcSepP4VBRWbWQ5reSHGaDG1EfHurzuosvsPHOQt2GmIgxq5B5ABC3zTm9a1USlQ5CoRclwAa8YQtQKKCGajinG1NTSZ7pnTLTRwOceoT8Zp8HAfb7okbOznbEZ8bX3H84DPgmz9SUyKX-hh4H3gd3edAIfFmsO80XbAzb_cTXR5zybYP9xvzlK1fHhtzt84crqqUgUdoO1LWu6LVHZUCkLSrXEvCtr4E7wjQCS0lSLFCKoS0GnXprCp-9y_Z1b-3J6LdYewHO37vjk-LH8q9TcI
CitedBy_id crossref_primary_10_1109_TMM_2022_3165717
crossref_primary_10_1145_3657285
crossref_primary_10_3390_jimaging9030062
crossref_primary_10_1007_s11042_021_11582_9
crossref_primary_10_1007_s42979_022_01041_z
crossref_primary_10_1007_s10032_023_00431_0
crossref_primary_10_1038_s41597_023_01985_8
crossref_primary_10_3390_app122010578
crossref_primary_10_1007_s10489_023_04782_3
crossref_primary_10_1007_s40747_023_01235_9
crossref_primary_10_1109_ACCESS_2022_3211069
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICPR48806.2021.9411922
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1728188083
9781728188089
EndPage 9498
ExternalDocumentID 9411922
Genre orig-research
GrantInformation_xml – fundername: MEITY
  funderid: 10.13039/501100008628
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-c256t-1f21bde8afc3b9de4012e9c6cbe0abf41fce12c097717052e307a9294ca838083
IEDL.DBID RIE
IngestDate Thu Jun 29 18:39:16 EDT 2023
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c256t-1f21bde8afc3b9de4012e9c6cbe0abf41fce12c097717052e307a9294ca838083
PageCount 8
ParticipantIDs ieee_primary_9411922
PublicationCentury 2000
PublicationDate 2021-Jan.-10
PublicationDateYYYYMMDD 2021-01-10
PublicationDate_xml – month: 01
  year: 2021
  text: 2021-Jan.-10
  day: 10
PublicationDecade 2020
PublicationTitle 2020 25th International Conference on Pattern Recognition (ICPR)
PublicationTitleAbbrev ICPR
PublicationYear 2021
Publisher IEEE
Publisher_xml – name: IEEE
Score 2.3732288
Snippet Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a...
SourceID ieee
SourceType Publisher
StartPage 9491
SubjectTerms Benchmark testing
Cascade Mask R-CNN
Convolution
Convolutional codes
Data mining
deformable convolution
Mathematical model
Page object
Protocols
Reproducibility of results
single model
table detection
Title CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
URI https://ieeexplore.ieee.org/document/9411922
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA1zJ08qm_ibHDyaLkm7_vDaOjZhY8gGu40m-QKiVnHdxb_efGmdKB68lNAEUpLm-17T914IuYbEDNEqi0UxXrSxTCnNGbc2sSZ2QTNFvfN0Fo-X0f1quOqQm50WBgA8-QwCLPp_-eZVb3GrbJBFwgESF3D3Ui4brVYr-hU8G0zy-QO-jkg8kCJoG_84NcUnjdEBmX5113BFnoJtrQL98cuJ8b_Pc0j63_I8Ot8lniPSgapHlnkBOZtBfUtxjSMXC2gBHpOqZ6B5uUEqPJ01vG_qKujC1xRQez5WRR8rWrS90smLizSbPlmO7hb5mLVnJjDtwEvNhJVCGUhLq0OVGXCfTxIyHWsFvFQ2ElaDkJo72IdGOhLcGi8dRIp0mYapw2PHpFu9VnBCaKZCm4amDDXqT1WYaSMTdCyLdIIuOKekh0OyfmtsMdbtaJz9ffuc7OO04O6F4BekW79v4dLl81pd-Yn8BLTfoOo
link.rule.ids 310,311,786,790,795,796,802,27956,55107
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA1jHvSksom_zcGj6ZK26w-vrWPTrQzZYLfRJF9A1E5cd_GvN19bJ4oHLyU0gZSk-b7X9L0XQq4h1H20ymJ-gBelDZNSccaNCY0ObNCMUO88yYLh3L9f9BctcrPVwgBART4DB4vVv3y9UhvcKuvFvrCAxAbcHZvneVirtRrZr-Bxb5RMH_GFROqBK5ym-Y9zU6q0Mdgnk68Oa7bIs7MppaM-fnkx_veJDkj3W6BHp9vUc0haUHTIPEkhYRmUtxRXObKxgKZQoVL5AjTJ10iGp1nN_Ka2gs6qmhTKipFV0KeCpk2vdPRqY826S-aDu1kyZM2pCUxZ-FIyYVwhNUS5UZ6MNdgPKBdiFSgJPJfGF0aBcBW3wA-tdFywqzy3IMlXeeRFFpEdkXaxKuCY0Fh6JvJ07ilUoEovVtoN0bPMVyH64JyQDg7J8q02xlg2o3H69-0rsjucTcbL8Sh7OCN7OEW4lyH4OWmX7xu4sNm9lJfVpH4CYHWkPg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2020+25th+International+Conference+on+Pattern+Recognition+%28ICPR%29&rft.atitle=CDeC-Net%3A+Composite+Deformable+Cascade+Network+for+Table+Detection+in+Document+Images&rft.au=Agarwal%2C+Madhav&rft.au=Mondal%2C+Ajoy&rft.au=Jawahar%2C+C.+V.&rft.date=2021-01-10&rft.pub=IEEE&rft.spage=9491&rft.epage=9498&rft_id=info:doi/10.1109%2FICPR48806.2021.9411922&rft.externalDocID=9411922