Neural net based complete character recognition scheme for Bangla printed text books

In this paper we propose a neural net based characters recognition scheme for Bangla printed text books. There are a lot of scientific literature, novels, magazines and books etc that are written in Bangla language. More than 400 million people use Bangla language. Most of the library and educationa...

Full description

Saved in:
Bibliographic Details
Published in16th Int'l Conf. Computer and Information Technology pp. 71 - 75
Main Authors Alamgir Hossain, S. K., Tabassum, Tamanna
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2014
Subjects
Online AccessGet full text
DOI10.1109/ICCITechn.2014.6997336

Cover

Abstract In this paper we propose a neural net based characters recognition scheme for Bangla printed text books. There are a lot of scientific literature, novels, magazines and books etc that are written in Bangla language. More than 400 million people use Bangla language. Most of the library and educational institutions want to keep copy of the books in a digital format. For storing those books in digital text format we need a good character recognition schemes by which we can convert the scanned text book images to editable texts. The key contribution of our research highlights this issue. We propose four main stages namely preprocessing, segmentation, training-recognition and post-processing. In the beginning the input book images preprocessed by rotation, scaling, binarization and noise elimination. The binarized image is then segmented and extracted into individual characters that are trained and recognized by an artificial neural network. Finally, the process ends by reconstructing the text in the post processing stage.
AbstractList In this paper we propose a neural net based characters recognition scheme for Bangla printed text books. There are a lot of scientific literature, novels, magazines and books etc that are written in Bangla language. More than 400 million people use Bangla language. Most of the library and educational institutions want to keep copy of the books in a digital format. For storing those books in digital text format we need a good character recognition schemes by which we can convert the scanned text book images to editable texts. The key contribution of our research highlights this issue. We propose four main stages namely preprocessing, segmentation, training-recognition and post-processing. In the beginning the input book images preprocessed by rotation, scaling, binarization and noise elimination. The binarized image is then segmented and extracted into individual characters that are trained and recognized by an artificial neural network. Finally, the process ends by reconstructing the text in the post processing stage.
Author Tabassum, Tamanna
Alamgir Hossain, S. K.
Author_xml – sequence: 1
  givenname: S. K.
  surname: Alamgir Hossain
  fullname: Alamgir Hossain, S. K.
  email: alamgir@cseku.ac.bd
  organization: Comput. Sci. & Eng. Discipline, Khulna Univ., Khulna, Bangladesh
– sequence: 2
  givenname: Tamanna
  surname: Tabassum
  fullname: Tabassum, Tamanna
  email: tamanna@thecodeandfix.com
  organization: CFS Ltd. Khulna, Khulna, Bangladesh
BookMark eNotj8tKxDAYhSPoQsd5AkHyAq3JJOay1OKlMOim--Fv-mdabJMhjaBvb8BZHfjgfJxzQy5DDEjIPWc158w-tE3TdujGUO8Yl7WyVguhLsjWasOltlbIQq5J94HfCWYaMNMeVhyoi8tpxozUjZDAZUw0oYvHMOUpBrq6ERekPib6DOE4Az2lKeRSzPhTHDF-rbfkysO84vacG9K9vnTNe7X_fGubp301WZYr_yit8EYYozjyHpQE5MCZczAMWspdmSo09MI5ZnQBxguNFlQPlvNBiQ25-9dOiHgoMxZIv4fzVfEHpiVP6Q
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICCITechn.2014.6997336
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781479934973
1479934976
EndPage 75
ExternalDocumentID 6997336
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i90t-f5493f838861e1ba64ae1a10ccadd744297837ab3cc0877448f37e9a6ba911d63
IEDL.DBID RIE
IngestDate Thu Jun 29 18:37:37 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-f5493f838861e1ba64ae1a10ccadd744297837ab3cc0877448f37e9a6ba911d63
PageCount 5
ParticipantIDs ieee_primary_6997336
PublicationCentury 2000
PublicationDate 2014-March
PublicationDateYYYYMMDD 2014-03-01
PublicationDate_xml – month: 03
  year: 2014
  text: 2014-March
PublicationDecade 2010
PublicationTitle 16th Int'l Conf. Computer and Information Technology
PublicationTitleAbbrev ICCITechn
PublicationYear 2014
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.5694094
Snippet In this paper we propose a neural net based characters recognition scheme for Bangla printed text books. There are a lot of scientific literature, novels,...
SourceID ieee
SourceType Publisher
StartPage 71
SubjectTerms Binary Image
Boundary Fill
Character recognition
Computers
Image segmentation
Information technology
Neural Network
Noise
OCR
Optical character recognition software
Training
Title Neural net based complete character recognition scheme for Bangla printed text books
URI https://ieeexplore.ieee.org/document/6997336
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB3anjyptOI3e_DoppvuNh9Xi6UVKh4q9FZ2JxMQNRVNLv56Z5O0onjwFkJIwk7Imzf75g3A1QjjLCeFEhkdpTFopB2hkwoZC41WSY5-R3dxH80ezd1qvOrA9a4Xhohq8RkF_rDey882WPlS2TBKU-_e14Uuf2ZNr1bb9BuqdDifTOZ1PdoLtkzQXvxjakoNGtN9WGwf12hFnoOqdAF-_nJi_O_7HMDguz1PPOyA5xA6VPRh6W027IsoqBQemjJRq8U5Jxa4NWUWO7nQphBMa-mVBCet4sb6UR7C1_g4ARVeDCJ89v0xgOX0djmZyXZkgnxKVSlzZns6T3SSRCGFzkbGUmhDxWHKstgw9sRMSK3TiN4IkKlZrmNKbeQs__SySB9Br9gUdAzCESdKTIasY84yRsc3RdIjlYfIWcfYnEDfL8j6rTHFWLdrcfr36TPY80FpxFvn0CvfK7pgNC_dZR3GL1W6orQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB1qPehJpRW_3YNHkybdzdfVYmm1LR4i9FZ2JxMQNRVNL_56Z5O0onjwFkJIwg7sezP75g3AVR-jLCcPHWR0dJRC5eg-GsdDxkIlvThHe6I7nYWjR3U3D-YtuN70whBRJT4j115WZ_nZEle2VNYLk8S6923BNuO-Cupurabt1_eS3ngwGFcVaSvZUm7z-I-5KRVsDPdguv5grRZ5dlelcfHzlxfjf_9oH7rfDXriYQM9B9CiogOpNdrQL6KgUlhwykSlF2dWLHBtyyw2gqFlITixpVcSTFvFjbbDPISt8jEFFVYOIiz__uhCOrxNByOnGZrgPCVe6eSc78k8lnEc-uQbHSpNvvY9DlSWRYrRJ-KUVBuJaK0AOTnLZUSJDo3mbS8L5SG0i2VBRyAMMVXidEgbzloCNPxSJNn3ch-ZdwTqGDp2QRZvtS3GolmLk79vX8LOKJ1OFpPx7P4Udm2AainXGbTL9xWdM7aX5qIK6RcI06YB
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=16th+Int%27l+Conf.+Computer+and+Information+Technology&rft.atitle=Neural+net+based+complete+character+recognition+scheme+for+Bangla+printed+text+books&rft.au=Alamgir+Hossain%2C+S.+K.&rft.au=Tabassum%2C+Tamanna&rft.date=2014-03-01&rft.pub=IEEE&rft.spage=71&rft.epage=75&rft_id=info:doi/10.1109%2FICCITechn.2014.6997336&rft.externalDocID=6997336