Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types
This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving structure, alignment, and images. It incorporates advanced image and signature detection for accurate extraction. Specialized models for word segment...
Saved in:
Published in | 2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) pp. 1102 - 1109 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.01.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving structure, alignment, and images. It incorporates advanced image and signature detection for accurate extraction. Specialized models for word segmentation cater to diverse document types, including computer-composed, letterpress, typewriter, and hand-written documents. The system handles static and dynamic handwritten inputs, recognizing various writing styles. Furthermore, it has the ability to recognize compound characters in Bengali. Extensive data collection efforts provide a diverse corpus, while advanced technical components optimize character and word recognition. Additional contributions include image, logo, signature and table recognition, perspective correction, layout reconstruction, and a queuing module for efficient and scalable processing. The system demonstrates outstanding performance in efficient and accurate text extraction and analysis. |
---|---|
AbstractList | This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving structure, alignment, and images. It incorporates advanced image and signature detection for accurate extraction. Specialized models for word segmentation cater to diverse document types, including computer-composed, letterpress, typewriter, and hand-written documents. The system handles static and dynamic handwritten inputs, recognizing various writing styles. Furthermore, it has the ability to recognize compound characters in Bengali. Extensive data collection efforts provide a diverse corpus, while advanced technical components optimize character and word recognition. Additional contributions include image, logo, signature and table recognition, perspective correction, layout reconstruction, and a queuing module for efficient and scalable processing. The system demonstrates outstanding performance in efficient and accurate text extraction and analysis. |
Author | Ali, Hasmot Islam, Md. Majedul Rahman, Fuad Azad Rabby, AKM Shahariar Abujar, Sheikh |
Author_xml | – sequence: 1 givenname: AKM Shahariar surname: Azad Rabby fullname: Azad Rabby, AKM Shahariar email: rabby@apurbatech.com organization: Apurba Technologies,Dhaka,Bangladesh – sequence: 2 givenname: Hasmot surname: Ali fullname: Ali, Hasmot email: hasmot_ali@apurba.com.bd organization: Apurba Technologies,Dhaka,Bangladesh – sequence: 3 givenname: Md. Majedul surname: Islam fullname: Islam, Md. Majedul email: majed@apurbatech.com organization: Apurba Technologies,Dhaka,Bangladesh – sequence: 4 givenname: Sheikh surname: Abujar fullname: Abujar, Sheikh email: sabujar@uab.edu organization: The University of Alabama at Birmingham,AL,USA – sequence: 5 givenname: Fuad surname: Rahman fullname: Rahman, Fuad email: fuad@apurbatech.com organization: Apurba Technologies,CA,USA |
BookMark | eNotjttOAjEURavRRET-QJP-wIynpzPtzCMOeEkwJIriGyntGamBDk6BBL9evDztrIe1ss_ZSWgCMXYlIBUCyutpv3qdKiikShEwSwEEwhHrlbosZA5SAxb6mHVQlZAoFG9nrBfjBwBIhAyE6jA7DAsTLK0obHhT8xsK72bp-bh64vM9f16T9Qf-IscfG0fLyE1wvO92P5LjE7KL4D-3FHndtHzgd9RG4oPGbn-Lk_2a4gU7rc0yUu9_u-zldjip7pPR-O6h6o8Sf3izSWgu5oXTtaqdzaXQKJ2wCgtZZFKiFrnJDAmUhkyGJI21UhfOkkZEU5OQXXb51_VENFu3fmXa_UxAVuZK5PIbW45ZVQ |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/WACVW60836.2024.00120 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798350370287 |
EISSN | 2690-621X |
EndPage | 1109 |
ExternalDocumentID | 10495615 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IL 6IN ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK OCL RIE RIL |
ID | FETCH-LOGICAL-i204t-eb1b8d7f6fdc531723d1c628384332715a4ae123aea42e3acc378dce7222afe13 |
IEDL.DBID | RIE |
IngestDate | Wed Jun 26 19:40:49 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i204t-eb1b8d7f6fdc531723d1c628384332715a4ae123aea42e3acc378dce7222afe13 |
PageCount | 8 |
ParticipantIDs | ieee_primary_10495615 |
PublicationCentury | 2000 |
PublicationDate | 2024-Jan.-1 |
PublicationDateYYYYMMDD | 2024-01-01 |
PublicationDate_xml | – month: 01 year: 2024 text: 2024-Jan.-1 day: 01 |
PublicationDecade | 2020 |
PublicationTitle | 2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) |
PublicationTitleAbbrev | WACVW |
PublicationYear | 2024 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0003204016 |
Score | 1.9056995 |
Snippet | This research paper presents a unique Bengali OCR system with some capabilities. The system excels in reconstructing document layouts while preserving... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1102 |
SubjectTerms | Handwriting recognition Image segmentation Layout Noise Optical character recognition Text analysis Writing |
Title | Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types |
URI | https://ieeexplore.ieee.org/document/10495615 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEF60J08qVnyzB6-p3W6azR5rH4iHKtLa3so-ZrEoqdj0YH-9M0laRRC8hUDIMrPJ983sfDOMXasQI0wEG4HwEMXeqMimsY6QTGjt2pLOYqjaYpjcjeP7aXtaidULLQwAFMVn0KDL4izfL9yKUmX4hROdJ0n5rtK6FGttEyqyhftRJJVKRzT1zaTTfZ4k1H4Z48AWdckWNNb7xxSVAkQG-2y4eX1ZO_LaWOW24da_OjP-e30HrP6t1-OPWyQ6ZDuQHTHXz17IpfQMXwR-CxmiwZw_dJ-4_eTV4Pn5GjyngWhvS24yzztVTQAfbZq7LjnyWt4r6jeA96pVcApgl3U2HvRH3buoGqkQzdE8eYR_Zpt6FZLg0RFIXqQXLkGKkVIfMyXaJjaAYGbAxC2QxjmpUu9AIY0wAYQ8ZrVskcEJ4x4DnwBOausoSANrhGoGaQRgRKOdOGV1stDsveyaMdsY5-yP--dsj7xUpjcuWC3_WMElAn5urwpHfwGG2qqz |
link.rule.ids | 310,311,783,787,792,793,799,27937,55086 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEF6kHvSkYsW3e_Ca2u2meRxrH1StVaS1vZV9zGJRUrHpwf56Z5K0iiB4C4HAMJNkvpmd7xvGLkPnY5pw2gNhwfOtCj0d-bGHYCKOTV3SWQxNW_SD7tC_HdfHBVk948IAQDZ8BhW6zM7y7cwsqFWGXzjBeaKUbyKwjoKcrrVuqcgavpEiKHg6ohpfjRrN51FAAsxYCdZIJ1vQYu8fe1SyNNLZYf2VAfn0yGtlkeqKWf7SZvy3hbus_M3Y44_rXLTHNiDZZ6advFBQ6Rk-c_waEswHU_7QfOL6kxer56dLsJxWor3NuUosbxRTAXywknedc0S2vJVNcABvFVZwKmHnZTbstAfNrlcsVfCm6J7Uw3-zjmzoAmcxFAhfpBUmQJARkZJZKOrKV4DpTIHyayCVMTKMrIEQgYRyIOQBKyWzBA4Zt1j6ODAy1obKNNBKhFUnlQCsaWIjjliZPDR5z3UzJivnHP9x_4JtdQf3vUnvpn93wrYpYnmz45SV0o8FnGH6T_V5FvQvon-t_g |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+IEEE%2FCVF+Winter+Conference+on+Applications+of+Computer+Vision+Workshops+%28WACVW%29&rft.atitle=Enhancement+of+Bengali+OCR+by+Specialized+Models+and+Advanced+Techniques+for+Diverse+Document+Types&rft.au=Azad+Rabby%2C+AKM+Shahariar&rft.au=Ali%2C+Hasmot&rft.au=Islam%2C+Md.+Majedul&rft.au=Abujar%2C+Sheikh&rft.date=2024-01-01&rft.pub=IEEE&rft.eissn=2690-621X&rft.spage=1102&rft.epage=1109&rft_id=info:doi/10.1109%2FWACVW60836.2024.00120&rft.externalDocID=10495615 |