CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (cnec-xet) for detecting tables present in the documents. The proposed network consists of a multistage...
Saved in:
Published in | 2020 25th International Conference on Pattern Recognition (ICPR) pp. 9491 - 9498 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
10.01.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (cnec-xet) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher IoU threshold. We empirically evaluate CDeC-Net on the publicly available benchmark datasets with extensive experiments. Our solution has three important properties: (i) a single trained model CDeC-Net ‡ that performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of IoU; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models are publicly available at https://github.com/mdv3101/CDeCNet for enabling reproducibility of the results. |
---|---|
AbstractList | Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel end-to-end trainable deep network, (cnec-xet) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher IoU threshold. We empirically evaluate CDeC-Net on the publicly available benchmark datasets with extensive experiments. Our solution has three important properties: (i) a single trained model CDeC-Net ‡ that performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of IoU; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models are publicly available at https://github.com/mdv3101/CDeCNet for enabling reproducibility of the results. |
Author | Mondal, Ajoy Agarwal, Madhav Jawahar, C. V. |
Author_xml | – sequence: 1 givenname: Madhav surname: Agarwal fullname: Agarwal, Madhav email: madhav14130@gmail.com organization: CVIT, IIIT,Hyderabad,India – sequence: 2 givenname: Ajoy surname: Mondal fullname: Mondal, Ajoy email: ajoy.mondal@iiit.ac.in organization: CVIT, IIIT,Hyderabad,India – sequence: 3 givenname: C. V. surname: Jawahar fullname: Jawahar, C. V. email: jawahar@iiit.ac.in organization: CVIT, IIIT,Hyderabad,India |
BookMark | eNotj81KxDAUhSPowhl9AkHyAq25t50mcSepP4VBRWbWQ5reSHGaDG1EfHurzuosvsPHOQt2GmIgxq5B5ABC3zTm9a1USlQ5CoRclwAa8YQtQKKCGajinG1NTSZ7pnTLTRwOceoT8Zp8HAfb7okbOznbEZ8bX3H84DPgmz9SUyKX-hh4H3gd3edAIfFmsO80XbAzb_cTXR5zybYP9xvzlK1fHhtzt84crqqUgUdoO1LWu6LVHZUCkLSrXEvCtr4E7wjQCS0lSLFCKoS0GnXprCp-9y_Z1b-3J6LdYewHO37vjk-LH8q9TcI |
CitedBy_id | crossref_primary_10_1109_TMM_2022_3165717 crossref_primary_10_1145_3657285 crossref_primary_10_3390_jimaging9030062 crossref_primary_10_1007_s11042_021_11582_9 crossref_primary_10_1007_s42979_022_01041_z crossref_primary_10_1007_s10032_023_00431_0 crossref_primary_10_1038_s41597_023_01985_8 crossref_primary_10_3390_app122010578 crossref_primary_10_1007_s10489_023_04782_3 crossref_primary_10_1007_s40747_023_01235_9 crossref_primary_10_1109_ACCESS_2022_3211069 |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ICPR48806.2021.9411922 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 1728188083 9781728188089 |
EndPage | 9498 |
ExternalDocumentID | 9411922 |
Genre | orig-research |
GrantInformation_xml | – fundername: MEITY funderid: 10.13039/501100008628 |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-c256t-1f21bde8afc3b9de4012e9c6cbe0abf41fce12c097717052e307a9294ca838083 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:39:16 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c256t-1f21bde8afc3b9de4012e9c6cbe0abf41fce12c097717052e307a9294ca838083 |
PageCount | 8 |
ParticipantIDs | ieee_primary_9411922 |
PublicationCentury | 2000 |
PublicationDate | 2021-Jan.-10 |
PublicationDateYYYYMMDD | 2021-01-10 |
PublicationDate_xml | – month: 01 year: 2021 text: 2021-Jan.-10 day: 10 |
PublicationDecade | 2020 |
PublicationTitle | 2020 25th International Conference on Pattern Recognition (ICPR) |
PublicationTitleAbbrev | ICPR |
PublicationYear | 2021 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 2.3732288 |
Snippet | Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 9491 |
SubjectTerms | Benchmark testing Cascade Mask R-CNN Convolution Convolutional codes Data mining deformable convolution Mathematical model Page object Protocols Reproducibility of results single model table detection |
Title | CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images |
URI | https://ieeexplore.ieee.org/document/9411922 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA1zJ08qm_ibHDyaLkm7_vDaOjZhY8gGu40m-QKiVnHdxb_efGmdKB68lNAEUpLm-17T914IuYbEDNEqi0UxXrSxTCnNGbc2sSZ2QTNFvfN0Fo-X0f1quOqQm50WBgA8-QwCLPp_-eZVb3GrbJBFwgESF3D3Ui4brVYr-hU8G0zy-QO-jkg8kCJoG_84NcUnjdEBmX5113BFnoJtrQL98cuJ8b_Pc0j63_I8Ot8lniPSgapHlnkBOZtBfUtxjSMXC2gBHpOqZ6B5uUEqPJ01vG_qKujC1xRQez5WRR8rWrS90smLizSbPlmO7hb5mLVnJjDtwEvNhJVCGUhLq0OVGXCfTxIyHWsFvFQ2ElaDkJo72IdGOhLcGi8dRIp0mYapw2PHpFu9VnBCaKZCm4amDDXqT1WYaSMTdCyLdIIuOKekh0OyfmtsMdbtaJz9ffuc7OO04O6F4BekW79v4dLl81pd-Yn8BLTfoOo |
link.rule.ids | 310,311,786,790,795,796,802,27956,55107 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA1jHvSksom_zcGj6ZK26w-vrWPTrQzZYLfRJF9A1E5cd_GvN19bJ4oHLyU0gZSk-b7X9L0XQq4h1H20ymJ-gBelDZNSccaNCY0ObNCMUO88yYLh3L9f9BctcrPVwgBART4DB4vVv3y9UhvcKuvFvrCAxAbcHZvneVirtRrZr-Bxb5RMH_GFROqBK5ym-Y9zU6q0Mdgnk68Oa7bIs7MppaM-fnkx_veJDkj3W6BHp9vUc0haUHTIPEkhYRmUtxRXObKxgKZQoVL5AjTJ10iGp1nN_Ka2gs6qmhTKipFV0KeCpk2vdPRqY826S-aDu1kyZM2pCUxZ-FIyYVwhNUS5UZ6MNdgPKBdiFSgJPJfGF0aBcBW3wA-tdFywqzy3IMlXeeRFFpEdkXaxKuCY0Fh6JvJ07ilUoEovVtoN0bPMVyH64JyQDg7J8q02xlg2o3H69-0rsjucTcbL8Sh7OCN7OEW4lyH4OWmX7xu4sNm9lJfVpH4CYHWkPg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2020+25th+International+Conference+on+Pattern+Recognition+%28ICPR%29&rft.atitle=CDeC-Net%3A+Composite+Deformable+Cascade+Network+for+Table+Detection+in+Document+Images&rft.au=Agarwal%2C+Madhav&rft.au=Mondal%2C+Ajoy&rft.au=Jawahar%2C+C.+V.&rft.date=2021-01-10&rft.pub=IEEE&rft.spage=9491&rft.epage=9498&rft_id=info:doi/10.1109%2FICPR48806.2021.9411922&rft.externalDocID=9411922 |