Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types

This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving structure, alignment, and images. It incorporates advanced image and signature detection for accurate extraction. Specialized models for word segment...

Full description

Saved in:
Bibliographic Details
Published in2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) pp. 1102 - 1109
Main Authors Azad Rabby, AKM Shahariar, Ali, Hasmot, Islam, Md. Majedul, Abujar, Sheikh, Rahman, Fuad
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.01.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving structure, alignment, and images. It incorporates advanced image and signature detection for accurate extraction. Specialized models for word segmentation cater to diverse document types, including computer-composed, letterpress, typewriter, and hand-written documents. The system handles static and dynamic handwritten inputs, recognizing various writing styles. Furthermore, it has the ability to recognize compound characters in Bengali. Extensive data collection efforts provide a diverse corpus, while advanced technical components optimize character and word recognition. Additional contributions include image, logo, signature and table recognition, perspective correction, layout reconstruction, and a queuing module for efficient and scalable processing. The system demonstrates outstanding performance in efficient and accurate text extraction and analysis.
AbstractList This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving structure, alignment, and images. It incorporates advanced image and signature detection for accurate extraction. Specialized models for word segmentation cater to diverse document types, including computer-composed, letterpress, typewriter, and hand-written documents. The system handles static and dynamic handwritten inputs, recognizing various writing styles. Furthermore, it has the ability to recognize compound characters in Bengali. Extensive data collection efforts provide a diverse corpus, while advanced technical components optimize character and word recognition. Additional contributions include image, logo, signature and table recognition, perspective correction, layout reconstruction, and a queuing module for efficient and scalable processing. The system demonstrates outstanding performance in efficient and accurate text extraction and analysis.
Author Ali, Hasmot
Islam, Md. Majedul
Rahman, Fuad
Azad Rabby, AKM Shahariar
Abujar, Sheikh
Author_xml – sequence: 1
  givenname: AKM Shahariar
  surname: Azad Rabby
  fullname: Azad Rabby, AKM Shahariar
  email: rabby@apurbatech.com
  organization: Apurba Technologies,Dhaka,Bangladesh
– sequence: 2
  givenname: Hasmot
  surname: Ali
  fullname: Ali, Hasmot
  email: hasmot_ali@apurba.com.bd
  organization: Apurba Technologies,Dhaka,Bangladesh
– sequence: 3
  givenname: Md. Majedul
  surname: Islam
  fullname: Islam, Md. Majedul
  email: majed@apurbatech.com
  organization: Apurba Technologies,Dhaka,Bangladesh
– sequence: 4
  givenname: Sheikh
  surname: Abujar
  fullname: Abujar, Sheikh
  email: sabujar@uab.edu
  organization: The University of Alabama at Birmingham,AL,USA
– sequence: 5
  givenname: Fuad
  surname: Rahman
  fullname: Rahman, Fuad
  email: fuad@apurbatech.com
  organization: Apurba Technologies,CA,USA
BookMark eNotjttOAjEURavRRET-QJP-wIynpzPtzCMOeEkwJIriGyntGamBDk6BBL9evDztrIe1ss_ZSWgCMXYlIBUCyutpv3qdKiikShEwSwEEwhHrlbosZA5SAxb6mHVQlZAoFG9nrBfjBwBIhAyE6jA7DAsTLK0obHhT8xsK72bp-bh64vM9f16T9Qf-IscfG0fLyE1wvO92P5LjE7KL4D-3FHndtHzgd9RG4oPGbn-Lk_2a4gU7rc0yUu9_u-zldjip7pPR-O6h6o8Sf3izSWgu5oXTtaqdzaXQKJ2wCgtZZFKiFrnJDAmUhkyGJI21UhfOkkZEU5OQXXb51_VENFu3fmXa_UxAVuZK5PIbW45ZVQ
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/WACVW60836.2024.00120
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350370287
EISSN 2690-621X
EndPage 1109
ExternalDocumentID 10495615
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i204t-eb1b8d7f6fdc531723d1c628384332715a4ae123aea42e3acc378dce7222afe13
IEDL.DBID RIE
IngestDate Wed Jun 26 19:40:49 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i204t-eb1b8d7f6fdc531723d1c628384332715a4ae123aea42e3acc378dce7222afe13
PageCount 8
ParticipantIDs ieee_primary_10495615
PublicationCentury 2000
PublicationDate 2024-Jan.-1
PublicationDateYYYYMMDD 2024-01-01
PublicationDate_xml – month: 01
  year: 2024
  text: 2024-Jan.-1
  day: 01
PublicationDecade 2020
PublicationTitle 2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW)
PublicationTitleAbbrev WACVW
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003204016
Score 1.9056995
Snippet This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving...
SourceID ieee
SourceType Publisher
StartPage 1102
SubjectTerms Handwriting recognition
Image segmentation
Layout
Noise
Optical character recognition
Text analysis
Writing
Title Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types
URI https://ieeexplore.ieee.org/document/10495615
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEF60J08qVnyzB6-p3W6azR5rH4iHKtLa3so-ZrEoqdj0YH-9M0laRRC8hUDIMrPJ983sfDOMXasQI0wEG4HwEMXeqMimsY6QTGjt2pLOYqjaYpjcjeP7aXtaidULLQwAFMVn0KDL4izfL9yKUmX4hROdJ0n5rtK6FGttEyqyhftRJJVKRzT1zaTTfZ4k1H4Z48AWdckWNNb7xxSVAkQG-2y4eX1ZO_LaWOW24da_OjP-e30HrP6t1-OPWyQ6ZDuQHTHXz17IpfQMXwR-CxmiwZw_dJ-4_eTV4Pn5GjyngWhvS24yzztVTQAfbZq7LjnyWt4r6jeA96pVcApgl3U2HvRH3buoGqkQzdE8eYR_Zpt6FZLg0RFIXqQXLkGKkVIfMyXaJjaAYGbAxC2QxjmpUu9AIY0wAYQ8ZrVskcEJ4x4DnwBOausoSANrhGoGaQRgRKOdOGV1stDsveyaMdsY5-yP--dsj7xUpjcuWC3_WMElAn5urwpHfwGG2qqz
link.rule.ids 310,311,783,787,792,793,799,27937,55086
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEF6kHvSkYsW3e_Ca2u2meRxrH1StVaS1vZV9zGJRUrHpwf56Z5K0iiB4C4HAMJNkvpmd7xvGLkPnY5pw2gNhwfOtCj0d-bGHYCKOTV3SWQxNW_SD7tC_HdfHBVk948IAQDZ8BhW6zM7y7cwsqFWGXzjBeaKUbyKwjoKcrrVuqcgavpEiKHg6ohpfjRrN51FAAsxYCdZIJ1vQYu8fe1SyNNLZYf2VAfn0yGtlkeqKWf7SZvy3hbus_M3Y44_rXLTHNiDZZ6advFBQ6Rk-c_waEswHU_7QfOL6kxer56dLsJxWor3NuUosbxRTAXywknedc0S2vJVNcABvFVZwKmHnZTbstAfNrlcsVfCm6J7Uw3-zjmzoAmcxFAhfpBUmQJARkZJZKOrKV4DpTIHyayCVMTKMrIEQgYRyIOQBKyWzBA4Zt1j6ODAy1obKNNBKhFUnlQCsaWIjjliZPDR5z3UzJivnHP9x_4JtdQf3vUnvpn93wrYpYnmz45SV0o8FnGH6T_V5FvQvon-t_g
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+IEEE%2FCVF+Winter+Conference+on+Applications+of+Computer+Vision+Workshops+%28WACVW%29&rft.atitle=Enhancement+of+Bengali+OCR+by+Specialized+Models+and+Advanced+Techniques+for+Diverse+Document+Types&rft.au=Azad+Rabby%2C+AKM+Shahariar&rft.au=Ali%2C+Hasmot&rft.au=Islam%2C+Md.+Majedul&rft.au=Abujar%2C+Sheikh&rft.date=2024-01-01&rft.pub=IEEE&rft.eissn=2690-621X&rft.spage=1102&rft.epage=1109&rft_id=info:doi/10.1109%2FWACVW60836.2024.00120&rft.externalDocID=10495615