Automatic processing of Historical Arabic Documents: A comprehensive Survey
•Challenges of automatic processing of historical Arabic documents (APHAD).•Classification of APHAD applications into four tasks: Data analysis, Writer classification, Data classification and Data retrieval.•For each application, a survey of existing approaches is presented.•For each application, th...
Saved in:
Published in | Pattern recognition Vol. 100; pp. 107144 - 1:107144-17 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Elsevier Ltd
01.04.2020
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | •Challenges of automatic processing of historical Arabic documents (APHAD).•Classification of APHAD applications into four tasks: Data analysis, Writer classification, Data classification and Data retrieval.•For each application, a survey of existing approaches is presented.•For each application, the existing solutions are discussed and recommendations are suggested.•Existing datasets and softwares on APHAD applications are surveyed.
Nowadays, there is a huge amount of Historical Arabic Documents (HAD) in the national libraries and archives around the world. Analyzing this type of data manually is a difficult and costly task. Thus, an automatic process is required to exploit these documents more rapidly. Processing historical documents is a recent research subject that has seen a remarkable growth in the last years. Processing Historical Arabic Documents is a particularly challenging problem. First, due to complicated nature of Arabic script compared to other scripts and second because the documents are ancient. This paper focuses on this difficult problem and provides a comprehensive survey of existing research work. First, we describe in detail the challenges making the automatic processing of Historical Arabic Documents a difficult task. Second, we classify this task into four applications of automatic processing of HAD: i) Analyze the document to extract the main text ii) Identify the writer of the document iii) Recognize some words or parts of the document in a reference dataset andiv) Retrieve and extract specific data from the document. For each application, existing approaches are surveyed and qualitatively described. Finally, we focus on available datasets and describe how they can be used in each application. |
---|---|
AbstractList | Nowadays, there is a huge amount of Historical Arabic Documents (HAD) in the national libraries and archives around the world. Analyzing this type of data manually is a difficult and costly task. Thus, an automatic process is required to exploit these documents more rapidly. Processing historical documents is a recent research subject that has seen a remarkable growth in the last years. Processing Historical Arabic Documents is a particularly challenging problem. First, due to complicated nature of Arabic script compared to other scripts and second because the documents are ancient. This paper focuses on this difficult problem and provides a comprehensive survey of existing research work. First, we describe in detail the challenges making the automatic processing of Historical Arabic Documents a difficult task. Second, we classify this task into four applications of automatic processing of HAD: i) Analyze the document to extract the main text ii) Identify the writer of the document iii) Recognize some words or parts of the document in a reference dataset andiv) Retrieve and extract specific data from the document. For each application, existing approaches are surveyed and qualitatively described. Finally, we focus on available datasets and describe how they can be used in each application. •Challenges of automatic processing of historical Arabic documents (APHAD).•Classification of APHAD applications into four tasks: Data analysis, Writer classification, Data classification and Data retrieval.•For each application, a survey of existing approaches is presented.•For each application, the existing solutions are discussed and recommendations are suggested.•Existing datasets and softwares on APHAD applications are surveyed. Nowadays, there is a huge amount of Historical Arabic Documents (HAD) in the national libraries and archives around the world. Analyzing this type of data manually is a difficult and costly task. Thus, an automatic process is required to exploit these documents more rapidly. Processing historical documents is a recent research subject that has seen a remarkable growth in the last years. Processing Historical Arabic Documents is a particularly challenging problem. First, due to complicated nature of Arabic script compared to other scripts and second because the documents are ancient. This paper focuses on this difficult problem and provides a comprehensive survey of existing research work. First, we describe in detail the challenges making the automatic processing of Historical Arabic Documents a difficult task. Second, we classify this task into four applications of automatic processing of HAD: i) Analyze the document to extract the main text ii) Identify the writer of the document iii) Recognize some words or parts of the document in a reference dataset andiv) Retrieve and extract specific data from the document. For each application, existing approaches are surveyed and qualitatively described. Finally, we focus on available datasets and describe how they can be used in each application. |
ArticleNumber | 107144 |
Author | Jmila, Houda Ibn Khedher, Mohamed El-Yacoubi, Mounim A. |
Author_xml | – sequence: 1 givenname: Mohamed surname: Ibn Khedher fullname: Ibn Khedher, Mohamed email: mohamed.ibn-khedher@irt-systemx.fr, ibnkhedhermohamed@hotmail.com organization: Samovar, CNRS, Télécom SudParis, Institut Polytechnique de Paris 9 rue Charles Fourier, Evry Cedex 91011, France – sequence: 2 givenname: Houda surname: Jmila fullname: Jmila, Houda organization: IRT SystemX, 8 avenue de la vauve, Palaiseau 91120, France – sequence: 3 givenname: Mounim A. orcidid: 0000-0002-7383-0588 surname: El-Yacoubi fullname: El-Yacoubi, Mounim A. organization: IRT SystemX, 8 avenue de la vauve, Palaiseau 91120, France |
BackLink | https://hal.science/hal-02481354$$DView record in HAL |
BookMark | eNqFkM1PAjEQxRuDiYj-Bx726mGx0-4XHEw2-IGRxIN6bsowhRLYkraQ8N-7ZPXiQU-TvHnvTeZ3yXqNa4ixG-BD4FDcrYc7HdEth4LDqJVKyLIz1oeqlGkOmeixPucSUim4vGCXIaw5h9Yk-uy13ke31dFisvMOKQTbLBNnkqkN0XmLepPUXs_b_YPD_ZaaGMZJnaDb7jytqAn2QMn73h_oeMXOjd4Euv6eA_b59Pgxmaazt-eXST1LUVZFTAmN5oCCV6UZmYXIiwKzis9lWWhZQiFQVFVhMpRzkCbXGjQnpJFcVGCKMpcDdtv1rvRG7bzdan9UTls1rWfqpHGRVSDz7ACtd9x50bsQPBmFNrbvuiZ6bTcKuDohVGvVIVQnhKpD2IazX-Gfa__E7rsYtRAOlrwKaKlBWlhPGNXC2b8LvgDv047E |
CitedBy_id | crossref_primary_10_1109_ACCESS_2024_3520327 crossref_primary_10_1126_sciadv_abg4266 crossref_primary_10_1007_s11042_022_11981_6 crossref_primary_10_1109_ACCESS_2024_3450507 crossref_primary_10_1016_j_procs_2024_10_220 crossref_primary_10_1016_j_patrec_2022_04_040 crossref_primary_10_1007_s10032_021_00382_4 crossref_primary_10_1016_j_amc_2021_126861 crossref_primary_10_7240_jeps_888164 crossref_primary_10_3390_s23198133 crossref_primary_10_1007_s00500_023_08384_6 |
Cites_doi | 10.1016/0031-3203(81)90009-1 10.1109/TSMC.1979.4310076 10.1007/s100440070020 10.1109/TASSP.1978.1163055 10.1016/j.ultrasmedbio.2008.09.007 10.1145/2431211.2431222 10.1016/j.patcog.2017.02.023 10.1007/s10032-012-0186-8 10.1016/S0019-9958(70)80006-7 10.1117/1.JEI.22.1.013016 10.1109/5.18626 10.1016/S0031-3203(99)00055-2 10.1109/TPAMI.2006.102 10.1016/j.patrec.2011.02.006 10.1007/s10032-017-0289-3 10.1016/j.patrec.2013.07.007 10.1007/s00034-009-9130-7 10.1007/s10032-010-0122-8 10.1016/j.neucom.2016.07.020 10.1145/1276377.1276390 10.1109/TPAMI.2007.59 10.1109/TPAMI.2011.219 10.1016/j.culher.2017.10.001 10.1007/BF01212455 10.1049/iet-cvi.2017.0468 10.1007/s10032-006-0023-z 10.1147/rd.266.0647 10.1007/BF00204594 10.1016/j.patrec.2010.04.003 10.9781/ijimai.2016.411 10.1186/s13640-015-0102-5 10.5339/qfarf.2013.ICTP-057 10.1007/s10898-007-9149-x |
ContentType | Journal Article |
Copyright | 2019 Attribution - NonCommercial |
Copyright_xml | – notice: 2019 – notice: Attribution - NonCommercial |
DBID | AAYXX CITATION 1XC VOOES |
DOI | 10.1016/j.patcog.2019.107144 |
DatabaseName | CrossRef Hyper Article en Ligne (HAL) Hyper Article en Ligne (HAL) (Open Access) |
DatabaseTitle | CrossRef |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISSN | 1873-5142 |
EndPage | 1:107144-17 |
ExternalDocumentID | oai_HAL_hal_02481354v1 10_1016_j_patcog_2019_107144 S0031320319304455 |
GroupedDBID | --K --M -D8 -DT -~X .DC .~1 0R~ 123 1B1 1RT 1~. 1~5 29O 4.4 457 4G. 53G 5VS 7-5 71M 8P~ 9JN AABNK AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABEFU ABFNM ABFRF ABHFT ABJNI ABMAC ABTAH ABXDB ABYKQ ACBEA ACDAQ ACGFO ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADJOM ADMUD ADMXK ADTZH AEBSH AECPX AEFWE AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F0J F5P FD6 FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ H~9 IHE J1W JJJVA KOM KZ1 LG9 LMP LY1 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG RNS ROL RPZ SBC SDF SDG SDP SDS SES SEW SPC SPCBC SST SSV SSZ T5K TN5 UNMZH VOH WUQ XJE XPP ZMT ZY4 ~G- AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AFXIZ AGCQF AGQPQ AGRNS AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP BNPGV CITATION SSH 1XC VOOES |
ID | FETCH-LOGICAL-c386t-ecfa01c2087f9fd2566c480b376a37162c2886f4c3b13f5aa1a0ece93d81f6753 |
IEDL.DBID | .~1 |
ISSN | 0031-3203 |
IngestDate | Fri May 09 12:23:50 EDT 2025 Tue Jul 01 02:36:30 EDT 2025 Thu Apr 24 22:57:28 EDT 2025 Fri Feb 23 02:46:54 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Keywords | Historical Arabic Documents Data retrieval Text analysis Text recognition Survey on Historical Arabic Documents Writer identification |
Language | English |
License | Attribution - NonCommercial: http://creativecommons.org/licenses/by-nc |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c386t-ecfa01c2087f9fd2566c480b376a37162c2886f4c3b13f5aa1a0ece93d81f6753 |
ORCID | 0000-0002-7383-0588 0000-0002-4864-5380 0009-0008-2570-0843 0000-0001-5179-6035 |
OpenAccessLink | https://hal.science/hal-02481354 |
ParticipantIDs | hal_primary_oai_HAL_hal_02481354v1 crossref_citationtrail_10_1016_j_patcog_2019_107144 crossref_primary_10_1016_j_patcog_2019_107144 elsevier_sciencedirect_doi_10_1016_j_patcog_2019_107144 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | April 2020 2020-04-00 2020-04 |
PublicationDateYYYYMMDD | 2020-04-01 |
PublicationDate_xml | – month: 04 year: 2020 text: April 2020 |
PublicationDecade | 2020 |
PublicationTitle | Pattern recognition |
PublicationYear | 2020 |
Publisher | Elsevier Ltd Elsevier |
Publisher_xml | – name: Elsevier Ltd – name: Elsevier |
References | Boussellaa, Zahour, Taconet, Alimi, Benabdelhafid (bib0025) 2007; 2 Breuel, Shafait (bib0027) 2010 Knuth (bib0060) 1993 Makhfi, Bannay, Benslimane, Rais (bib0070) 2011 Kassis, El-Sana (bib0054) 2016 Poznanski, Wolf (bib0078) 2016 Rabaev, Dinstein, El-Sana, Kedem (bib0081) 2014 Saabni, El-Sana (bib0086) 2011 Rabaev, Cohen, El-Sana, Kedem (bib0080) 2015 Asi, Saabni, El-Sana (bib0017) 2011 Faisal, AlMaadeed (bib0039) 2017 Wshah, Govindaraju, Cheng, Li (bib0100) 2010 Althobaiti, Lu (bib0007) 2017 Rabaev, Biller, El-Sana, Kedem, Dinstein (bib0079) 2013 Elfattah, Hassanien, Mostafa, Ali, Amin, Mohamed (bib0037) 2015 Levi, Montanari (bib0063) 1970; 17 Bai, Latecki, Liu (bib0020) 2007; 29 Asi, El-Sana, Mrgner (bib0015) 2012 . Indian, Bhatia (bib0049) 2017 Hassane (bib0046) 2013 Saabni, El-sana (bib0084) 2008 Wong, Casey, Wahl (bib0099) 1982; 26 Giotis, Sfikas, Gatos, Nikou (bib0044) 2017; 68 Ballard (bib0021) 1981; 13 Shahkolaei, Beghdadi, Al-maadeed, Cheriet (bib0091) 2018 Maalej, Kherallah (bib0069) 2018 Stamatopoulos, Gatos, Louloudis, Pal, Alaei (bib0096) 2013 Gatos, Stamatopoulos, Louloudis (bib0043) 2011; 14 Sakoe, Chiba (bib0088) 1978; 26 Yoo, Han (bib0101) 2009; 28 Bromley, Guyon, LeCun, Säckinger, Shah (bib0028) 1993 Zirari, Ennaji, Nicolas, Mammass (bib0105) 2013 Asi, Abdalhaleem, Fecker, Märgner, El-Sana (bib0012) 2017; 20 Alaasam, Barakat, El-Sana (bib0004) 2018 Parvez, Mahmoud (bib0075) 2013; 45 Asi, Cohen, Kedem, El-Sana, Dinstein (bib0014) 2014 Avidan, Shamir (bib0018) 2007; 26 Fecker, Asit, Mrgner, El-Sana, Fingscheidt (bib0041) 2014 Barakat, Alasam, El-Sana (bib0022) 2018 Fogel, Sagi (bib0042) 1989; 61 Kolcz, Alspector, Augusteijn, Carlson, Viorel Popescu (bib0061) 2000; 3 Kassis, El-Sana (bib0055) 2016 Pechwitz, Maddouri, Mrgner, Ellouze, Amiri (bib0076) 2002 Hassanien, Elfattah, Aboulenin, Schaefer, Zhu, Korovin (bib0047) 2016 Dalal, Triggs (bib0034) 2005; 1 Kassis, Nassour, El-Sana (bib0056) 2017; 01 Tagougui, Kherallah, Alimi (bib0097) 2013; 16 AlKhateeb, Ren, Jiang, Al-Muhtaseb (bib0006) 2011; 32 Elleuch, Tagougui, Kherallah (bib0038) 2015 Zahour, Taconet, Mercy, Ramdane (bib0102) 2001 Saabni, El-Sana (bib0085) 2009 Rabiner (bib0082) 1989; 77 Boussellaa, Zahour, Taconet, Benabdelhafid, Alimi (bib0026) 2006 Zayene, Touj, Hennebert, Ingold, Amara (bib0104) 2018; 12 El-etriby, Amin (bib0035) 2010 Vapnik (bib0098) 1995 Sauvola, Pietikinen (bib0089) 2000; 33 Asi, Cohen, Kedem, El-Sana (bib0013) 2015 Povey, Ghoshal, Boulianne, Burget, Glembek, Goel, Hannemann, Motlicek, Qian, Schwarz, Silovsky, Stemmer, Vesely (bib0077) 2011 Cover, Hart (bib0033) 1967; 13 Naegel, Wendling (bib0072) 2010; 31 Alaasam, Kurar, Kassis, El-Sana (bib0005) 2017 Cohen, Dinstein, El-Sana, Kedem (bib0031) 2014 Kassis, El-Sana (bib0053) 2014 Schantz (bib0090) 1982 Sivic, Zisserman (bib0093) 2003; 2 Kiessling, Ezra, Miller (bib0059) 2019; abs/1907.04041 Khaissidi, Elfakir, Mrabti, El-Yacoubi, Chenouni, Lakhliai (bib0058) 2016; 4 Otsu (bib0073) 1979; 9 Srihari, Govindaraju (bib0094) 1989; 2 Barakat, El-Sana (bib0023) 2018 Karaboga, Basturk (bib0051) 2007; 39 Kassis, Abdalhaleem, Droby, Alaasam, El-Sana (bib0052) 2017 Lowe (bib0068) 2001; 1 Guo, Cheng, Tian, Zhang (bib0045) 2009; 35 Lorigo, Govindaraju (bib0067) 2006; 28 Bulacu, Schomaker, Brink (bib0030) 2007; 2 Pantke, Dennhardt, Fecker, Mrgner, Fingscheidt (bib0074) 2014 Hussain, Raza, Siddiqi, Khurshid, Djeddi (bib0048) 2015; 2015 Aouadi, Echi (bib0010) 2014 Abdelhaleem, Droby, Asi, Kassis, Asam, El-Sana (bib0003) 2017 Khader, Al-Marridi, Alpona, Kunhoth, Hassaine, Al-maadeed (bib0057) 2014 Amin, Elfattah, Hassanien, Schaefer (bib0008) 2014 Kulis, Grauman (bib0062) 2012; 34 Amrouch, Rabi (bib0009) 2018 Elfakir, Khaissidi, Mrabti, Chenouni (bib0036) 2015; 126 Shahkolaei, Nafchi, Al-Maadeed, Cheriet (bib0092) 2018; 30 Fecker, Asi, Pantke, Mrgner, El-Sana, Fingscheidt (bib0040) 2014 Moghaddam, Cheriet, Adankon, Filonenko, Wisnovsky (bib0071) 2010 Biller, Asi, Kedem, El-Sana, Dinstein (bib0024) 2013 Saabni, Asi, El-Sana (bib0083) 2014; 35 Cohen, Rabaev, El-Sana, Kedem, Dinstein (bib0032) 2015 Zahour, Taconet, Ramdane (bib0103) 2004 N. Aouadi, A. Kacem, Word Spotting for Arabic Handwritten Historical Document Retrieval using Generalized Hough Transform(2011). Asi, Rabaev, Kedem, El-Sana (bib0016) 2011 Likforman-Sulem, Zahour, Taconet (bib0064) 2007; 9 Bukhari, Breuel, Asi, El-Sana (bib0029) 2012 Lillholm, Griffin (bib0065) 2008 Juma al-majid center for culture and heritage, Accessed: 2018-11-02. Abdalhaleem, Barakat, El-Sana (bib0002) 2018 Jayech, Mahjoub, Amara (bib0050) 2016; 214 Saabni, El-Sana (bib0087) 2013; 22 Awaida (bib0019) 2015 Stahlberg, Vogel (bib0095) 2016 Lins (bib0066) 2009 Barakat (10.1016/j.patcog.2019.107144_bib0022) 2018 Lorigo (10.1016/j.patcog.2019.107144_bib0067) 2006; 28 Sivic (10.1016/j.patcog.2019.107144_bib0093) 2003; 2 Breuel (10.1016/j.patcog.2019.107144_sbref0025) 2010 Aouadi (10.1016/j.patcog.2019.107144_bib0010) 2014 Cover (10.1016/j.patcog.2019.107144_bib0033) 1967; 13 Asi (10.1016/j.patcog.2019.107144_bib0014) 2014 Kulis (10.1016/j.patcog.2019.107144_bib0062) 2012; 34 Fogel (10.1016/j.patcog.2019.107144_bib0042) 1989; 61 Saabni (10.1016/j.patcog.2019.107144_bib0085) 2009 10.1016/j.patcog.2019.107144_bib0011 Yoo (10.1016/j.patcog.2019.107144_bib0101) 2009; 28 Levi (10.1016/j.patcog.2019.107144_bib0063) 1970; 17 Bukhari (10.1016/j.patcog.2019.107144_bib0029) 2012 Stamatopoulos (10.1016/j.patcog.2019.107144_bib0096) 2013 Abdelhaleem (10.1016/j.patcog.2019.107144_bib0003) 2017 Maalej (10.1016/j.patcog.2019.107144_bib0069) 2018 Indian (10.1016/j.patcog.2019.107144_bib0049) 2017 Jayech (10.1016/j.patcog.2019.107144_bib0050) 2016; 214 Zirari (10.1016/j.patcog.2019.107144_bib0105) 2013 Hassanien (10.1016/j.patcog.2019.107144_bib0047) 2016 Pechwitz (10.1016/j.patcog.2019.107144_bib0076) 2002 Elfattah (10.1016/j.patcog.2019.107144_bib0037) 2015 Shahkolaei (10.1016/j.patcog.2019.107144_bib0091) 2018 Parvez (10.1016/j.patcog.2019.107144_bib0075) 2013; 45 Cohen (10.1016/j.patcog.2019.107144_bib0032) 2015 Kolcz (10.1016/j.patcog.2019.107144_bib0061) 2000; 3 Srihari (10.1016/j.patcog.2019.107144_bib0094) 1989; 2 Asi (10.1016/j.patcog.2019.107144_bib0017) 2011 Fecker (10.1016/j.patcog.2019.107144_bib0040) 2014 Guo (10.1016/j.patcog.2019.107144_bib0045) 2009; 35 AlKhateeb (10.1016/j.patcog.2019.107144_bib0006) 2011; 32 Saabni (10.1016/j.patcog.2019.107144_sbref0081) 2014; 35 Giotis (10.1016/j.patcog.2019.107144_bib0044) 2017; 68 Kassis (10.1016/j.patcog.2019.107144_bib0054) 2016 Poznanski (10.1016/j.patcog.2019.107144_bib0078) 2016 Alaasam (10.1016/j.patcog.2019.107144_bib0004) 2018 Karaboga (10.1016/j.patcog.2019.107144_bib0051) 2007; 39 Fecker (10.1016/j.patcog.2019.107144_bib0041) 2014 Barakat (10.1016/j.patcog.2019.107144_bib0023) 2018 Tagougui (10.1016/j.patcog.2019.107144_bib0097) 2013; 16 Kassis (10.1016/j.patcog.2019.107144_bib0055) 2016 Lowe (10.1016/j.patcog.2019.107144_bib0068) 2001; 1 Faisal (10.1016/j.patcog.2019.107144_bib0039) 2017 Kassis (10.1016/j.patcog.2019.107144_bib0056) 2017; 01 Alaasam (10.1016/j.patcog.2019.107144_bib0005) 2017 Ballard (10.1016/j.patcog.2019.107144_bib0021) 1981; 13 Pantke (10.1016/j.patcog.2019.107144_bib0074) 2014 Schantz (10.1016/j.patcog.2019.107144_bib0090) 1982 Saabni (10.1016/j.patcog.2019.107144_bib0084) 2008 Likforman-Sulem (10.1016/j.patcog.2019.107144_bib0064) 2007; 9 Boussellaa (10.1016/j.patcog.2019.107144_bib0025) 2007; 2 Shahkolaei (10.1016/j.patcog.2019.107144_bib0092) 2018; 30 Awaida (10.1016/j.patcog.2019.107144_bib0019) 2015 Khader (10.1016/j.patcog.2019.107144_bib0057) 2014 Lillholm (10.1016/j.patcog.2019.107144_bib0065) 2008 Wong (10.1016/j.patcog.2019.107144_bib0099) 1982; 26 Rabaev (10.1016/j.patcog.2019.107144_bib0081) 2014 Bai (10.1016/j.patcog.2019.107144_bib0020) 2007; 29 Amrouch (10.1016/j.patcog.2019.107144_bib0009) 2018 Althobaiti (10.1016/j.patcog.2019.107144_bib0007) 2017 Zahour (10.1016/j.patcog.2019.107144_bib0102) 2001 Saabni (10.1016/j.patcog.2019.107144_bib0086) 2011 Khaissidi (10.1016/j.patcog.2019.107144_bib0058) 2016; 4 Cohen (10.1016/j.patcog.2019.107144_bib0031) 2014 Sakoe (10.1016/j.patcog.2019.107144_bib0088) 1978; 26 Hassane (10.1016/j.patcog.2019.107144_bib0046) 2013 Lins (10.1016/j.patcog.2019.107144_bib0066) 2009 Gatos (10.1016/j.patcog.2019.107144_bib0043) 2011; 14 Makhfi (10.1016/j.patcog.2019.107144_sbref0068) 2011 Rabaev (10.1016/j.patcog.2019.107144_bib0079) 2013 Abdalhaleem (10.1016/j.patcog.2019.107144_bib0002) 2018 Wshah (10.1016/j.patcog.2019.107144_bib0100) 2010 Kassis (10.1016/j.patcog.2019.107144_bib0053) 2014 Asi (10.1016/j.patcog.2019.107144_bib0013) 2015 Bromley (10.1016/j.patcog.2019.107144_bib0028) 1993 Bulacu (10.1016/j.patcog.2019.107144_bib0030) 2007; 2 Rabiner (10.1016/j.patcog.2019.107144_bib0082) 1989; 77 Amin (10.1016/j.patcog.2019.107144_bib0008) 2014 Vapnik (10.1016/j.patcog.2019.107144_bib0098) 1995 Asi (10.1016/j.patcog.2019.107144_bib0012) 2017; 20 El-etriby (10.1016/j.patcog.2019.107144_bib0035) 2010 Saabni (10.1016/j.patcog.2019.107144_bib0087) 2013; 22 Stahlberg (10.1016/j.patcog.2019.107144_bib0095) 2016 Sauvola (10.1016/j.patcog.2019.107144_bib0089) 2000; 33 Dalal (10.1016/j.patcog.2019.107144_bib0034) 2005; 1 Povey (10.1016/j.patcog.2019.107144_sbref0075) 2011 Zayene (10.1016/j.patcog.2019.107144_bib0104) 2018; 12 Knuth (10.1016/j.patcog.2019.107144_bib0060) 1993 Kiessling (10.1016/j.patcog.2019.107144_bib0059) 2019; abs/1907.04041 Hussain (10.1016/j.patcog.2019.107144_bib0048) 2015; 2015 Naegel (10.1016/j.patcog.2019.107144_bib0072) 2010; 31 Otsu (10.1016/j.patcog.2019.107144_bib0073) 1979; 9 Moghaddam (10.1016/j.patcog.2019.107144_bib0071) 2010 Asi (10.1016/j.patcog.2019.107144_bib0016) 2011 Elfakir (10.1016/j.patcog.2019.107144_sbref0034) 2015; 126 Biller (10.1016/j.patcog.2019.107144_bib0024) 2013 Elleuch (10.1016/j.patcog.2019.107144_bib0038) 2015 Asi (10.1016/j.patcog.2019.107144_bib0015) 2012 Rabaev (10.1016/j.patcog.2019.107144_bib0080) 2015 10.1016/j.patcog.2019.107144_bib0001 Avidan (10.1016/j.patcog.2019.107144_bib0018) 2007; 26 Boussellaa (10.1016/j.patcog.2019.107144_bib0026) 2006 Zahour (10.1016/j.patcog.2019.107144_bib0103) 2004 Kassis (10.1016/j.patcog.2019.107144_bib0052) 2017 |
References_xml | – volume: 26 year: 2007 ident: bib0018 article-title: Seam carving for content-aware image resizing publication-title: ACM Trans. Graph. – start-page: 124 year: 2017 end-page: 128 ident: bib0005 article-title: Experiment study on utilizing convolutional neural networks to recognize historical arabic handwritten text publication-title: 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) – start-page: 15 year: 2014 end-page: 20 ident: bib0074 article-title: An historical handwritten arabic dataset for segmentation-free word spotting - hadara80p publication-title: 2014 14th International Conference on Frontiers in Handwriting Recognition – start-page: 156 year: 2018 end-page: 160 ident: bib0091 article-title: Mhdid: a multi-distortion historical document image database publication-title: 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR) – reference: ). – start-page: 743 year: 2014 end-page: 748 ident: bib0040 article-title: Document writer analysis with rejection for historical arabic manuscripts publication-title: 2014 14th International Conference on Frontiers in Handwriting Recognition – year: 1993 ident: bib0060 article-title: The Stanford GraphBase - a platform for combinatorial computing. – year: 2011 ident: bib0070 article-title: Search engine of ancient arabic manuscripts based on metadata and xml annotations publication-title: 2011 Colloquium in Information Science and Technology – start-page: 716 year: 2008 end-page: 722 ident: bib0084 article-title: Keyword searching for arabic handwritten documents publication-title: In The 11th International Conference on Frontiers in Handwriting recognition (ICFHR2008), Montreal – volume: 26 start-page: 43 year: 1978 end-page: 49 ident: bib0088 article-title: Dynamic programming algorithm optimization for spoken word recognition publication-title: IEEE Trans. Acoust. Speech Signal Process. – start-page: 1 year: 2017 end-page: 6 ident: bib0007 article-title: A survey on arabic optical character recognition and an isolated handwritten arabic character recognition algorithm using encoded freeman chain code publication-title: 2017 51st Annual Conference on Information Sciences and Systems (CISS) – start-page: 57 year: 2017 end-page: 63 ident: bib0039 article-title: Enabling indexing and retrieval of historical arabic manuscripts through template matching based word spotting publication-title: 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) – volume: 13 start-page: 21 year: 1967 end-page: 27 ident: bib0033 article-title: Nearest Neighbor Pattern Classification – volume: 35 start-page: 628 year: 2009 end-page: 640 ident: bib0045 article-title: A novel approach to speckle reduction in ultrasound imaging publication-title: Ultrasound Med. Biol. – start-page: 305 year: 2013 end-page: 308 ident: bib0024 article-title: Webgt: an interactive web-based system for historical document ground truth generation publication-title: 2013 12th International Conference on Document Analysis and Recognition – volume: 14 start-page: 25 year: 2011 end-page: 33 ident: bib0043 article-title: Icdar2009 handwriting segmentation contest publication-title: Int. J. Doc. Anal. Recognit. – start-page: 1 year: 2008 end-page: 4 ident: bib0065 article-title: Novel image feature alphabets for object recognition publication-title: 2008 19th International Conference on Pattern Recognition – volume: 28 start-page: 819 year: 2009 ident: bib0101 article-title: Fast normalized cross-correlation publication-title: Circuits Syst. Signal Process. – start-page: 139 year: 2006 end-page: 144 ident: bib0026 article-title: Segmentation texte /graphique : application au manuscrits Arabes Anciens – start-page: 867 year: 2009 end-page: 871 ident: bib0085 article-title: Hierarchical on-line arabic handwriting recognition publication-title: 2009 10th International Conference on Document Analysis and Recognition – start-page: 138 year: 2018 end-page: 149 ident: bib0009 article-title: Deep neural networks features for arabic handwriting recognition publication-title: Advanced Information Technology, Services and Systems – volume: 2 start-page: 1470 year: 2003 end-page: 1477 ident: bib0093 article-title: Video google: a text retrieval approach to object matching in videos publication-title: Proceedings of the 9th IEEE International Conference on Computer Vision – volume: 3 start-page: 153 year: 2000 end-page: 168 ident: bib0061 article-title: A line-oriented approach to word spotting in handwritten documents publication-title: Pattern Anal. Appl. – start-page: 22 year: 2011 end-page: 28 ident: bib0016 article-title: User-assisted alignment of arabic historical manuscripts publication-title: Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, HIP@ICDAR 2011, Beijing, China, September 16–17, 2011 – start-page: 266 year: 2014 end-page: 270 ident: bib0008 article-title: A binarization algorithm for historical arabic manuscript images using a neutrosophic approach publication-title: 2014 9th International Conference on Computer Engineering Systems (ICCES) – volume: 9 start-page: 123 year: 2007 end-page: 138 ident: bib0064 article-title: Text line segmentation of historical documents: a survey publication-title: Int. J. Doc. Anal. Recognit. – start-page: 140 year: 2014 end-page: 145 ident: bib0014 article-title: A coarse-to-fine approach for layout analysis of ancient manuscripts publication-title: 14th International Conference on Frontiers in Handwriting Recognition, ICFHR 2014, Crete, Greece, September 1–4, 2014 – volume: 31 start-page: 1251 year: 2010 end-page: 1259 ident: bib0072 article-title: A document binarization method based on connected operators publication-title: Pattern Recognit. Lett. – volume: 68 start-page: 310 year: 2017 end-page: 332 ident: bib0044 article-title: A survey of document image word spotting techniques publication-title: Pattern Recognit. – volume: 2 start-page: 141 year: 1989 end-page: 153 ident: bib0094 article-title: Analysis of textual images using the hough transform publication-title: Mach. Vision Appl. – start-page: 251 year: 2015 end-page: 255 ident: bib0037 article-title: Artificial bee colony optimizer for historical arabic manuscript images binarization publication-title: 2015 11th International Computer Engineering Conference (ICENCO) – start-page: 168 year: 2016 end-page: 173 ident: bib0095 article-title: Qatip–an optical character recognition system for arabic heritage collections in libraries publication-title: 2016 12th IAPR Workshop on Document Analysis Systems (DAS) – start-page: 737 year: 1993 end-page: 744 ident: bib0028 article-title: Signature verification using a “siamese” time delay neural network publication-title: Proceedings of the 6th International Conference on Neural Information Processing Systems – volume: 28 start-page: 712 year: 2006 end-page: 724 ident: bib0067 article-title: Offline arabic handwriting recognition: a survey publication-title: IEEE Trans. Pattern Anal. Mach. Intell. – start-page: 229 year: 2018 end-page: 234 ident: bib0022 article-title: Word spotting using convolutional siamese network publication-title: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS) – start-page: 1 year: 2018 end-page: 6 ident: bib0069 article-title: Convolutional neural network and blstm for offline arabic handwriting recognition publication-title: 2018 International Arab Conference on Information Technology (ACIT) – volume: 2 start-page: 1058 year: 2007 end-page: 1062 ident: bib0025 article-title: Praad: preprocessing and analysis tool for arabic ancient documents publication-title: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) – start-page: 266 year: 2015 end-page: 270 ident: bib0032 article-title: Aligning transcript of historical documents using energy minimization publication-title: 2015 13th International Conference on Document Analysis and Recognition (ICDAR) – start-page: 64 year: 2017 end-page: 68 ident: bib0003 article-title: WAHD: a database for writer identification of arabic historical documents publication-title: 1st International Workshop on Arabic Script Analysis and Recognition, ASAR 2017, Nancy, France, April 3–5, 2017 – start-page: 1 year: 2014 end-page: 4 ident: bib0057 article-title: An interactive annotation tool for indexing historical manuscripts publication-title: 2014 World Symposium on Computer Applications Research (WSCAR) – year: 2004 ident: bib0103 article-title: Contribution à la segmentation de textes manuscrits anciens – volume: 29 start-page: 449 year: 2007 end-page: 462 ident: bib0020 article-title: Skeleton pruning by contour partitioning with discrete curve evolution publication-title: IEEE Trans. Pattern Anal. Mach.Intell. – start-page: 369 year: 2014 end-page: 378 ident: bib0081 article-title: Segmentation-free keyword retrieval in historical document images publication-title: Image Analysis and Recognition – start-page: 31 year: 2014 end-page: 36 ident: bib0010 article-title: Prior segmentation of old arabic manuscripts by separator word spotting publication-title: 2014 6th International Conference of Soft Computing and Pattern Recognition (SoCPaR) – start-page: 129 year: 2002 end-page: 136 ident: bib0076 article-title: Ifn/enit - database of handwritten arabic words publication-title: In Proc. of CIFED 2002 – start-page: 003842 year: 2016 end-page: 003846 ident: bib0047 article-title: Historic handwritten manuscript binarisation using whale optimisation publication-title: 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) – start-page: 94020I year: 2015 ident: bib0080 article-title: Aligning transcript of historical documents using dynamic programming publication-title: Document Recognition and Retrieval XXII, San Francisco, California, USA, February 11–12, 2015. – start-page: 11 year: 2017 end-page: 14 ident: bib0052 article-title: Vml-hd: the historical arabic documents dataset for recognition systems publication-title: 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) – volume: 33 start-page: 225 year: 2000 end-page: 236 ident: bib0089 article-title: Adaptive document image binarization publication-title: Pattern Recognit. – volume: 16 start-page: 209 year: 2013 end-page: 226 ident: bib0097 article-title: Online arabic handwriting recognition: a survey publication-title: Int. J. Document Anal. Recognit. (IJDAR) – start-page: 1266 year: 2012 end-page: 1271 ident: bib0015 article-title: Hierarchical scheme for arabic text recognition publication-title: 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) – volume: 2 start-page: 769 year: 2007 end-page: 773 ident: bib0030 article-title: Text-independent writer identification and verification on offline arabic handwriting publication-title: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) – start-page: 3050 year: 2014 end-page: 3055 ident: bib0041 article-title: Writer identification for historical arabic documents publication-title: 2014 22nd International Conference on Pattern Recognition – year: 1982 ident: bib0090 article-title: History of OCR, Optical Character Recognition – volume: 13 start-page: 111 year: 1981 end-page: 122 ident: bib0021 article-title: Generalizing the hough transform to detect arbitrary shapes publication-title: Pattern Recognit. – volume: 30 start-page: 199 year: 2018 end-page: 209 ident: bib0092 article-title: Subjective and objective quality assessment of degraded document images publication-title: J. Cultural Heritage – start-page: 151 year: 2018 end-page: 155 ident: bib0023 article-title: Binarization free layout analysis for arabic historical documents using fully convolutional networks publication-title: 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR) – start-page: 13 year: 2016 end-page: 18 ident: bib0054 article-title: Scribble based interactive page layout segmentation using gabor filter publication-title: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) – start-page: 812 year: 2013 end-page: 816 ident: bib0079 article-title: Text line detection in corrupted and damaged historical manuscripts publication-title: 2013 12th International Conference on Document Analysis and Recognition – start-page: 844 year: 2009 end-page: 854 ident: bib0066 article-title: A taxonomy for noise in images of paper documents - the physical noises publication-title: Image Analysis and Recognition – volume: 4 start-page: 6 year: 2016 end-page: 10 ident: bib0058 article-title: Segmentation-free word spotting for handwritten arabic documents. publication-title: IJIMAI – start-page: ICTP057 year: 2013 ident: bib0046 article-title: A robust method for line and word segmentation in handwritten text publication-title: Qatar Found. Annu. Res. Forum Proc. – volume: 17 start-page: 62 year: 1970 end-page: 91 ident: bib0063 article-title: A grey-weighted skeleton publication-title: Inf. Control – start-page: 2865 year: 2010 end-page: 2868 ident: bib0100 article-title: A novel lexicon reduction method for arabic handwriting recognition publication-title: 2010 20th International Conference on Pattern Recognition – volume: 32 start-page: 1081 year: 2011 end-page: 1088 ident: bib0006 article-title: Offline handwritten arabic cursive text recognition using hidden markov models and re-ranking publication-title: Pattern Recognit. Lett. – start-page: 387 year: 2014 end-page: 392 ident: bib0053 article-title: Word spotting using radial descriptor publication-title: 2014 14th International Conference on Frontiers in Handwriting Recognition – volume: 20 start-page: 173 year: 2017 end-page: 187 ident: bib0012 article-title: On writer identification for arabic historical manuscripts publication-title: Int. J. Doc. Anal. Recognit. – start-page: 62 year: 2018 end-page: 66 ident: bib0002 article-title: Case study: fine writing style classification using siamese neural network publication-title: 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR) – start-page: 1 year: 2010 end-page: 6 ident: bib0035 article-title: Detection and correction of deformed historical arabic manuscripts publication-title: Computer and Communication Engineering (ICCCE), 2010 International Conference on – volume: 9 start-page: 62 year: 1979 end-page: 66 ident: bib0073 article-title: A threshold selection method from gray-level histograms publication-title: IEEE Trans. Syst. Man Cybern. – start-page: 120 year: 2011 end-page: 126 ident: bib0017 article-title: Text line segmentation for gray scale historical document images publication-title: Proceedings of the 2011 Workshop on Historical Document Imaging and Processing – start-page: 563 year: 2011 end-page: 568 ident: bib0086 article-title: Language-independent text lines extraction using seam carving publication-title: 2011 International Conference on Document Analysis and Recognition – reference: Juma al-majid center for culture and heritage, Accessed: 2018-11-02. ( – start-page: 11 year: 2010 end-page: 18 ident: bib0071 article-title: IBN SINA: a database for research on processing and understanding of Arabic manuscripts images publication-title: DAS ’10: Proceedings of the 8th IAPR International Workshop on Document Analysis Systems – volume: 77 start-page: 257 year: 1989 end-page: 286 ident: bib0082 article-title: A tutorial on hidden markov models and selected applications in speech recognition publication-title: Proc. IEEE – start-page: 349 year: 2014 end-page: 358 ident: bib0031 article-title: Using scale-space anisotropic smoothing for text line extraction in historical documents publication-title: Image Analysis and Recognition - 11th International Conference, ICIAR 2014, Vilamoura, Portugal, October 22–24, 2014, Proceedings, Part I – volume: 126 start-page: 14 year: 2015 end-page: 18 ident: bib0036 article-title: Article: Handwritten arabic documents indexation using hog feature publication-title: Int. J. Comput. Appl. – start-page: 826 year: 2015 end-page: 830 ident: bib0013 article-title: Simplifying the reading of historical manuscripts publication-title: 2015 13th International Conference on Document Analysis and Recognition (ICDAR) – start-page: 1 year: 2015 end-page: 4 ident: bib0019 article-title: Text independent writer identification of arabic manuscripts and the effects of writers increase publication-title: International Conference on Computer Vision and Image Analysis Applications – volume: 35 start-page: 23 year: 2014 end-page: 33 ident: bib0083 article-title: Text line extraction for historical document images publication-title: Pattern Recognition Letters – start-page: 371 year: 2015 end-page: 382 ident: bib0038 article-title: Deep learning for feature extraction of arabic handwritten script publication-title: Computer Analysis of Images and Patterns – year: 2011 ident: bib0077 article-title: The kaldi speech recognition toolkit publication-title: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding – start-page: 31 year: 2016 end-page: 35 ident: bib0055 article-title: Word spotting using radial descriptor graph publication-title: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) – volume: 34 start-page: 1092 year: 2012 end-page: 1104 ident: bib0062 article-title: Kernelized locality-sensitive hashing publication-title: IEEE Trans. Pattern Anal. Mach. Intell. – start-page: 2305 year: 2016 end-page: 2314 ident: bib0078 article-title: Cnn-n-gram for handwritingword recognition publication-title: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) – volume: 39 start-page: 459 year: 2007 end-page: 471 ident: bib0051 article-title: A powerful and efficient algorithm for numerical function optimization: artificial bee colony (abc) algorithm publication-title: J. Global Optim. – volume: 1 start-page: 886 year: 2005 end-page: 893 ident: bib0034 article-title: Histograms of oriented gradients for human detection publication-title: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition – volume: 214 start-page: 958 year: 2016 end-page: 971 ident: bib0050 article-title: Synchronous multi-stream hidden markov model for offline arabic handwriting recognition without explicit segmentation publication-title: Neurocomputing – volume: abs/1907.04041 year: 2019 ident: bib0059 article-title: BADAM: a public dataset for baseline detection in arabic-script manuscripts publication-title: CoRR – start-page: 639 year: 2012 end-page: 644 ident: bib0029 article-title: Layout analysis for arabic historical document images using machine learning publication-title: 2012 International Conference on Frontiers in Handwriting Recognition – year: 1995 ident: bib0098 article-title: The Nature of Statistical Learning Theory – volume: 45 start-page: 23:1 year: 2013 end-page: 23:35 ident: bib0075 article-title: Offline arabic handwritten text recognition: a survey publication-title: ACM Comput. Surv. – start-page: 1402 year: 2013 end-page: 1406 ident: bib0096 article-title: Icdar 2013 handwriting segmentation contest publication-title: 2013 12th International Conference on Document Analysis and Recognition – volume: 12 start-page: 710 year: 2018 end-page: 719 ident: bib0104 article-title: Multi-dimensional long short-term memory networks for artificial arabic text recognition in news video publication-title: IET Comput. Vision – start-page: 281 year: 2001 end-page: 285 ident: bib0102 article-title: Arabic hand-written text-line extraction publication-title: Proceedings of Sixth International Conference on Document Analysis and Recognition – start-page: 1 year: 2013 end-page: 4 ident: bib0105 article-title: A methodology to spot words in historical arabic documents publication-title: AICCSA – volume: 26 start-page: 647 year: 1982 end-page: 656 ident: bib0099 article-title: Document analysis system publication-title: IBM J. Res. Dev. – volume: 22 start-page: 013016 year: 2013 ident: bib0087 article-title: Keywords image retrieval in historical handwritten arabic documents publication-title: J. Electron. Imag. – year: 2010 ident: bib0027 article-title: Automlp: Simple, effective, fully automated learning rate and size adjustment publication-title: The Learning Workshop – volume: 2015 start-page: 46 year: 2015 ident: bib0048 article-title: A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation publication-title: EURASIP J. Image Video Process. – start-page: 114 year: 2018 end-page: 118 ident: bib0004 article-title: Synthesizing versus augmentation for arabic word recognition with convolutional neural networks publication-title: 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR) – volume: 1 start-page: 682 year: 2001 end-page: 688 ident: bib0068 article-title: Local feature view clustering for 3d object recognition publication-title: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition – volume: 01 start-page: 293 year: 2017 end-page: 298 ident: bib0056 article-title: Alignment of historical handwritten manuscripts using siamese neural network publication-title: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) – reference: N. Aouadi, A. Kacem, Word Spotting for Arabic Handwritten Historical Document Retrieval using Generalized Hough Transform(2011). – volume: 61 start-page: 103 year: 1989 end-page: 113 ident: bib0042 article-title: Gabor filters as texture discriminator publication-title: Biol. Cybern. – start-page: 1 year: 2017 end-page: 6 ident: bib0049 article-title: A survey of offline handwritten hindi character recognition publication-title: 2017 3rd International Conference on Advances in Computing,Communication Automation (ICACCA) (Fall) – start-page: 120 year: 2011 ident: 10.1016/j.patcog.2019.107144_bib0017 article-title: Text line segmentation for gray scale historical document images – start-page: 114 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0004 article-title: Synthesizing versus augmentation for arabic word recognition with convolutional neural networks – start-page: 1 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0007 article-title: A survey on arabic optical character recognition and an isolated handwritten arabic character recognition algorithm using encoded freeman chain code – start-page: 369 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0081 article-title: Segmentation-free keyword retrieval in historical document images – volume: 13 start-page: 111 issue: 2 year: 1981 ident: 10.1016/j.patcog.2019.107144_bib0021 article-title: Generalizing the hough transform to detect arbitrary shapes publication-title: Pattern Recognit. doi: 10.1016/0031-3203(81)90009-1 – volume: 9 start-page: 62 issue: 1 year: 1979 ident: 10.1016/j.patcog.2019.107144_bib0073 article-title: A threshold selection method from gray-level histograms publication-title: IEEE Trans. Syst. Man Cybern. doi: 10.1109/TSMC.1979.4310076 – start-page: 3050 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0041 article-title: Writer identification for historical arabic documents – volume: 3 start-page: 153 issue: 2 year: 2000 ident: 10.1016/j.patcog.2019.107144_bib0061 article-title: A line-oriented approach to word spotting in handwritten documents publication-title: Pattern Anal. Appl. doi: 10.1007/s100440070020 – start-page: 11 year: 2010 ident: 10.1016/j.patcog.2019.107144_bib0071 article-title: IBN SINA: a database for research on processing and understanding of Arabic manuscripts images – volume: 26 start-page: 43 issue: 1 year: 1978 ident: 10.1016/j.patcog.2019.107144_bib0088 article-title: Dynamic programming algorithm optimization for spoken word recognition publication-title: IEEE Trans. Acoust. Speech Signal Process. doi: 10.1109/TASSP.1978.1163055 – volume: 35 start-page: 628 issue: 4 year: 2009 ident: 10.1016/j.patcog.2019.107144_bib0045 article-title: A novel approach to speckle reduction in ultrasound imaging publication-title: Ultrasound Med. Biol. doi: 10.1016/j.ultrasmedbio.2008.09.007 – volume: 45 start-page: 23:1 issue: 2 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0075 article-title: Offline arabic handwritten text recognition: a survey publication-title: ACM Comput. Surv. doi: 10.1145/2431211.2431222 – volume: 68 start-page: 310 issue: C year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0044 article-title: A survey of document image word spotting techniques publication-title: Pattern Recognit. doi: 10.1016/j.patcog.2017.02.023 – start-page: 867 year: 2009 ident: 10.1016/j.patcog.2019.107144_bib0085 article-title: Hierarchical on-line arabic handwriting recognition – start-page: 22 year: 2011 ident: 10.1016/j.patcog.2019.107144_bib0016 article-title: User-assisted alignment of arabic historical manuscripts – start-page: 11 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0052 article-title: Vml-hd: the historical arabic documents dataset for recognition systems – year: 1982 ident: 10.1016/j.patcog.2019.107144_bib0090 – start-page: 140 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0014 article-title: A coarse-to-fine approach for layout analysis of ancient manuscripts – start-page: 31 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0010 article-title: Prior segmentation of old arabic manuscripts by separator word spotting – volume: 16 start-page: 209 issue: 3 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0097 article-title: Online arabic handwriting recognition: a survey publication-title: Int. J. Document Anal. Recognit. (IJDAR) doi: 10.1007/s10032-012-0186-8 – start-page: 139 year: 2006 ident: 10.1016/j.patcog.2019.107144_bib0026 – start-page: 62 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0002 article-title: Case study: fine writing style classification using siamese neural network – volume: 17 start-page: 62 issue: 1 year: 1970 ident: 10.1016/j.patcog.2019.107144_bib0063 article-title: A grey-weighted skeleton publication-title: Inf. Control doi: 10.1016/S0019-9958(70)80006-7 – start-page: 1 year: 2015 ident: 10.1016/j.patcog.2019.107144_bib0019 article-title: Text independent writer identification of arabic manuscripts and the effects of writers increase – start-page: 1 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0069 article-title: Convolutional neural network and blstm for offline arabic handwriting recognition – volume: 2 start-page: 1470 year: 2003 ident: 10.1016/j.patcog.2019.107144_bib0093 article-title: Video google: a text retrieval approach to object matching in videos – start-page: 305 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0024 article-title: Webgt: an interactive web-based system for historical document ground truth generation – volume: 22 start-page: 013016 issue: 1 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0087 article-title: Keywords image retrieval in historical handwritten arabic documents publication-title: J. Electron. Imag. doi: 10.1117/1.JEI.22.1.013016 – start-page: 1402 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0096 article-title: Icdar 2013 handwriting segmentation contest – volume: 77 start-page: 257 issue: 2 year: 1989 ident: 10.1016/j.patcog.2019.107144_bib0082 article-title: A tutorial on hidden markov models and selected applications in speech recognition publication-title: Proc. IEEE doi: 10.1109/5.18626 – year: 2011 ident: 10.1016/j.patcog.2019.107144_sbref0075 article-title: The kaldi speech recognition toolkit – volume: 33 start-page: 225 issue: 2 year: 2000 ident: 10.1016/j.patcog.2019.107144_bib0089 article-title: Adaptive document image binarization publication-title: Pattern Recognit. doi: 10.1016/S0031-3203(99)00055-2 – volume: 28 start-page: 712 issue: 5 year: 2006 ident: 10.1016/j.patcog.2019.107144_bib0067 article-title: Offline arabic handwriting recognition: a survey publication-title: IEEE Trans. Pattern Anal. Mach. Intell. doi: 10.1109/TPAMI.2006.102 – start-page: 1 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0049 article-title: A survey of offline handwritten hindi character recognition – start-page: 1266 year: 2012 ident: 10.1016/j.patcog.2019.107144_bib0015 article-title: Hierarchical scheme for arabic text recognition – volume: 32 start-page: 1081 issue: 8 year: 2011 ident: 10.1016/j.patcog.2019.107144_bib0006 article-title: Offline handwritten arabic cursive text recognition using hidden markov models and re-ranking publication-title: Pattern Recognit. Lett. doi: 10.1016/j.patrec.2011.02.006 – volume: 2 start-page: 769 year: 2007 ident: 10.1016/j.patcog.2019.107144_bib0030 article-title: Text-independent writer identification and verification on offline arabic handwriting – volume: 1 start-page: 886 year: 2005 ident: 10.1016/j.patcog.2019.107144_bib0034 article-title: Histograms of oriented gradients for human detection – start-page: 737 year: 1993 ident: 10.1016/j.patcog.2019.107144_bib0028 article-title: Signature verification using a “siamese” time delay neural network – volume: 20 start-page: 173 issue: 3 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0012 article-title: On writer identification for arabic historical manuscripts publication-title: Int. J. Doc. Anal. Recognit. doi: 10.1007/s10032-017-0289-3 – volume: 01 start-page: 293 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0056 article-title: Alignment of historical handwritten manuscripts using siamese neural network – start-page: 2865 year: 2010 ident: 10.1016/j.patcog.2019.107144_bib0100 article-title: A novel lexicon reduction method for arabic handwriting recognition – volume: 35 start-page: 23 year: 2014 ident: 10.1016/j.patcog.2019.107144_sbref0081 article-title: Text line extraction for historical document images publication-title: Pattern Recognition Letters doi: 10.1016/j.patrec.2013.07.007 – start-page: 138 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0009 article-title: Deep neural networks features for arabic handwriting recognition – volume: 28 start-page: 819 issue: 6 year: 2009 ident: 10.1016/j.patcog.2019.107144_bib0101 article-title: Fast normalized cross-correlation publication-title: Circuits Syst. Signal Process. doi: 10.1007/s00034-009-9130-7 – start-page: 124 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0005 article-title: Experiment study on utilizing convolutional neural networks to recognize historical arabic handwritten text – start-page: 15 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0074 article-title: An historical handwritten arabic dataset for segmentation-free word spotting - hadara80p – start-page: 844 year: 2009 ident: 10.1016/j.patcog.2019.107144_bib0066 article-title: A taxonomy for noise in images of paper documents - the physical noises – start-page: 1 year: 2008 ident: 10.1016/j.patcog.2019.107144_bib0065 article-title: Novel image feature alphabets for object recognition – volume: 14 start-page: 25 issue: 1 year: 2011 ident: 10.1016/j.patcog.2019.107144_bib0043 article-title: Icdar2009 handwriting segmentation contest publication-title: Int. J. Doc. Anal. Recognit. doi: 10.1007/s10032-010-0122-8 – year: 2004 ident: 10.1016/j.patcog.2019.107144_bib0103 – ident: 10.1016/j.patcog.2019.107144_bib0011 – volume: 214 start-page: 958 year: 2016 ident: 10.1016/j.patcog.2019.107144_bib0050 article-title: Synchronous multi-stream hidden markov model for offline arabic handwriting recognition without explicit segmentation publication-title: Neurocomputing doi: 10.1016/j.neucom.2016.07.020 – start-page: 156 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0091 article-title: Mhdid: a multi-distortion historical document image database – volume: 26 issue: 3 year: 2007 ident: 10.1016/j.patcog.2019.107144_bib0018 article-title: Seam carving for content-aware image resizing publication-title: ACM Trans. Graph. doi: 10.1145/1276377.1276390 – year: 2010 ident: 10.1016/j.patcog.2019.107144_sbref0025 article-title: Automlp: Simple, effective, fully automated learning rate and size adjustment – start-page: 1 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0105 article-title: A methodology to spot words in historical arabic documents – volume: 29 start-page: 449 issue: 3 year: 2007 ident: 10.1016/j.patcog.2019.107144_bib0020 article-title: Skeleton pruning by contour partitioning with discrete curve evolution publication-title: IEEE Trans. Pattern Anal. Mach.Intell. doi: 10.1109/TPAMI.2007.59 – start-page: 266 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0008 article-title: A binarization algorithm for historical arabic manuscript images using a neutrosophic approach – start-page: 826 year: 2015 ident: 10.1016/j.patcog.2019.107144_bib0013 article-title: Simplifying the reading of historical manuscripts – start-page: 31 year: 2016 ident: 10.1016/j.patcog.2019.107144_bib0055 article-title: Word spotting using radial descriptor graph – start-page: 94020I year: 2015 ident: 10.1016/j.patcog.2019.107144_bib0080 article-title: Aligning transcript of historical documents using dynamic programming – volume: 2 start-page: 1058 year: 2007 ident: 10.1016/j.patcog.2019.107144_bib0025 article-title: Praad: preprocessing and analysis tool for arabic ancient documents – volume: 1 start-page: 682 year: 2001 ident: 10.1016/j.patcog.2019.107144_bib0068 article-title: Local feature view clustering for 3d object recognition – start-page: 563 year: 2011 ident: 10.1016/j.patcog.2019.107144_bib0086 article-title: Language-independent text lines extraction using seam carving – start-page: 229 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0022 article-title: Word spotting using convolutional siamese network – volume: 34 start-page: 1092 issue: 6 year: 2012 ident: 10.1016/j.patcog.2019.107144_bib0062 article-title: Kernelized locality-sensitive hashing publication-title: IEEE Trans. Pattern Anal. Mach. Intell. doi: 10.1109/TPAMI.2011.219 – volume: 30 start-page: 199 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0092 article-title: Subjective and objective quality assessment of degraded document images publication-title: J. Cultural Heritage doi: 10.1016/j.culher.2017.10.001 – volume: 2 start-page: 141 issue: 3 year: 1989 ident: 10.1016/j.patcog.2019.107144_bib0094 article-title: Analysis of textual images using the hough transform publication-title: Mach. Vision Appl. doi: 10.1007/BF01212455 – start-page: 266 year: 2015 ident: 10.1016/j.patcog.2019.107144_bib0032 article-title: Aligning transcript of historical documents using energy minimization – volume: 12 start-page: 710 issue: 5 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0104 article-title: Multi-dimensional long short-term memory networks for artificial arabic text recognition in news video publication-title: IET Comput. Vision doi: 10.1049/iet-cvi.2017.0468 – start-page: 639 year: 2012 ident: 10.1016/j.patcog.2019.107144_bib0029 article-title: Layout analysis for arabic historical document images using machine learning – volume: 9 start-page: 123 issue: 2 year: 2007 ident: 10.1016/j.patcog.2019.107144_bib0064 article-title: Text line segmentation of historical documents: a survey publication-title: Int. J. Doc. Anal. Recognit. doi: 10.1007/s10032-006-0023-z – volume: 26 start-page: 647 issue: 6 year: 1982 ident: 10.1016/j.patcog.2019.107144_bib0099 article-title: Document analysis system publication-title: IBM J. Res. Dev. doi: 10.1147/rd.266.0647 – start-page: 1 year: 2010 ident: 10.1016/j.patcog.2019.107144_bib0035 article-title: Detection and correction of deformed historical arabic manuscripts – start-page: 2305 year: 2016 ident: 10.1016/j.patcog.2019.107144_bib0078 article-title: Cnn-n-gram for handwritingword recognition – start-page: 716 year: 2008 ident: 10.1016/j.patcog.2019.107144_bib0084 article-title: Keyword searching for arabic handwritten documents – start-page: 387 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0053 article-title: Word spotting using radial descriptor – start-page: 168 year: 2016 ident: 10.1016/j.patcog.2019.107144_bib0095 article-title: Qatip–an optical character recognition system for arabic heritage collections in libraries – start-page: 349 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0031 article-title: Using scale-space anisotropic smoothing for text line extraction in historical documents – start-page: 371 year: 2015 ident: 10.1016/j.patcog.2019.107144_bib0038 article-title: Deep learning for feature extraction of arabic handwritten script – volume: 126 start-page: 14 issue: 9 year: 2015 ident: 10.1016/j.patcog.2019.107144_sbref0034 article-title: Article: Handwritten arabic documents indexation using hog feature publication-title: Int. J. Comput. Appl. – start-page: 57 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0039 article-title: Enabling indexing and retrieval of historical arabic manuscripts through template matching based word spotting – start-page: 281 year: 2001 ident: 10.1016/j.patcog.2019.107144_bib0102 article-title: Arabic hand-written text-line extraction – start-page: 129 year: 2002 ident: 10.1016/j.patcog.2019.107144_bib0076 article-title: Ifn/enit - database of handwritten arabic words – volume: 61 start-page: 103 issue: 2 year: 1989 ident: 10.1016/j.patcog.2019.107144_bib0042 article-title: Gabor filters as texture discriminator publication-title: Biol. Cybern. doi: 10.1007/BF00204594 – volume: abs/1907.04041 year: 2019 ident: 10.1016/j.patcog.2019.107144_bib0059 article-title: BADAM: a public dataset for baseline detection in arabic-script manuscripts publication-title: CoRR – start-page: 1 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0057 article-title: An interactive annotation tool for indexing historical manuscripts – start-page: 64 year: 2017 ident: 10.1016/j.patcog.2019.107144_bib0003 article-title: WAHD: a database for writer identification of arabic historical documents – volume: 13 start-page: 21 year: 1967 ident: 10.1016/j.patcog.2019.107144_bib0033 – start-page: 151 year: 2018 ident: 10.1016/j.patcog.2019.107144_bib0023 article-title: Binarization free layout analysis for arabic historical documents using fully convolutional networks – volume: 31 start-page: 1251 issue: 11 year: 2010 ident: 10.1016/j.patcog.2019.107144_bib0072 article-title: A document binarization method based on connected operators publication-title: Pattern Recognit. Lett. doi: 10.1016/j.patrec.2010.04.003 – start-page: 812 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0079 article-title: Text line detection in corrupted and damaged historical manuscripts – year: 1993 ident: 10.1016/j.patcog.2019.107144_bib0060 – year: 2011 ident: 10.1016/j.patcog.2019.107144_sbref0068 article-title: Search engine of ancient arabic manuscripts based on metadata and xml annotations – volume: 4 start-page: 6 issue: 1 year: 2016 ident: 10.1016/j.patcog.2019.107144_bib0058 article-title: Segmentation-free word spotting for handwritten arabic documents. publication-title: IJIMAI doi: 10.9781/ijimai.2016.411 – ident: 10.1016/j.patcog.2019.107144_bib0001 – start-page: 003842 year: 2016 ident: 10.1016/j.patcog.2019.107144_bib0047 article-title: Historic handwritten manuscript binarisation using whale optimisation – volume: 2015 start-page: 46 issue: 1 year: 2015 ident: 10.1016/j.patcog.2019.107144_bib0048 article-title: A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation publication-title: EURASIP J. Image Video Process. doi: 10.1186/s13640-015-0102-5 – start-page: 13 year: 2016 ident: 10.1016/j.patcog.2019.107144_bib0054 article-title: Scribble based interactive page layout segmentation using gabor filter – start-page: 743 year: 2014 ident: 10.1016/j.patcog.2019.107144_bib0040 article-title: Document writer analysis with rejection for historical arabic manuscripts – year: 1995 ident: 10.1016/j.patcog.2019.107144_bib0098 – start-page: 251 year: 2015 ident: 10.1016/j.patcog.2019.107144_bib0037 article-title: Artificial bee colony optimizer for historical arabic manuscript images binarization – start-page: ICTP057 year: 2013 ident: 10.1016/j.patcog.2019.107144_bib0046 article-title: A robust method for line and word segmentation in handwritten text publication-title: Qatar Found. Annu. Res. Forum Proc. doi: 10.5339/qfarf.2013.ICTP-057 – volume: 39 start-page: 459 issue: 3 year: 2007 ident: 10.1016/j.patcog.2019.107144_bib0051 article-title: A powerful and efficient algorithm for numerical function optimization: artificial bee colony (abc) algorithm publication-title: J. Global Optim. doi: 10.1007/s10898-007-9149-x |
SSID | ssj0017142 |
Score | 2.4251447 |
Snippet | •Challenges of automatic processing of historical Arabic documents (APHAD).•Classification of APHAD applications into four tasks: Data analysis, Writer... Nowadays, there is a huge amount of Historical Arabic Documents (HAD) in the national libraries and archives around the world. Analyzing this type of data... |
SourceID | hal crossref elsevier |
SourceType | Open Access Repository Enrichment Source Index Database Publisher |
StartPage | 107144 |
SubjectTerms | Artificial Intelligence Computer Science Data retrieval Document and Text Processing Historical Arabic Documents Survey on Historical Arabic Documents Text analysis Text recognition Writer identification |
Title | Automatic processing of Historical Arabic Documents: A comprehensive Survey |
URI | https://dx.doi.org/10.1016/j.patcog.2019.107144 https://hal.science/hal-02481354 |
Volume | 100 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8NAEF5qvXjxLdZHWcRrbJLdvLyFYolWe9FCb2Gz2bUVaUJJCl787c7kUfEggrcw2d2E2c08wjffEHLtOp6ypXYNkVquwRVcJaaXGpA8p4Fk0uNVIe3TxI2m_GHmzDpk2NbCIKyysf21Ta-sdSMZNNoc5IsF1vgi7SBW4UBKzh0sNOfcw1N-87mBeWB_75oxnFkGjm7L5yqMVw7mLntFgFcAIhjKf3NPW_P2R2vleEb7ZLeJGGlYv9QB6ajlIdlruzHQ5uM8IuOwLLKKgJXmNfofvBLNNP1mAoFFRAL3wbOUVWnbLQ0pgspXal4D2elzuVqrj2MyHd29DCOjaZVgSOa7haGkFqYlbdP3dKBTiGNcyX0zAfMhGJJESdv3Xc0lSyymHSEsYSqpYEN8S0POwE5Id5kt1SmhAQexz1NHcA65CReecBIIclI_AYMgRI-wVkOxbHjEsZ3Fe9wCxt7iWq8x6jWu9dojxmZWXvNo_DHea5Uf_zgPMZj6P2ZewV5tHoL02VH4GKMM-dss5vC1dfbv5c_Jjo0pdwXeuSDdYlWqS4hLiqRfHbw-2Q7vx9HkCxoP4Ho |
linkProvider | Elsevier |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3JTsMwELXacoALO6KsFuJqmsTOxi2qqAJdLrRSb5bjOLQItVGVVOLCtzPOUsQBVeIWTewkGtuzRG_eIHTv2K6yZOIQEZsOYQquIsONCSTPsS-pdFlRSDscOeGEvUztaQN161oYDausbH9p0wtrXUk6lTY76Xyua3w17aCuwoGUnNl2E-0wOL66jcHD1wbnoRt8l5Th1CR6eF0_V4C8UrB3yzeN8PJBBEPZX_6pOav_tBaep3eI9quQEQflVx2hhloco4O6HQOuTucJ6gd5tiwYWHFawv_BLeFlgn-oQOAhIoL74FryorbtEQdYo8pXalYi2fFrvlqrz1M06T2NuyGpeiUQST0nI0omwjClZXhu4icxBDKOZJ4Rgf0QVLNEScvznIRJGpk0sYUwhaGkghXxzASSBnqGWovlQp0j7DMQeyy2BWOQnDDhCjuCKCf2IrAIQrQRrTXEZUUkrvtZfPAaMfbOS71yrVde6rWNyGZWWhJpbBnv1srnvzYEB1u_ZeYdrNXmJZo_OwwGXMs0gZtJbbY2L_79-Fu0G46HAz54HvUv0Z6l8-8CyXOFWtkqV9cQpGTRTbEJvwFVMeII |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Automatic+processing+of+Historical+Arabic+Documents%3A+A+comprehensive+Survey&rft.jtitle=Pattern+recognition&rft.au=Ibn+Khedher%2C+Mohamed&rft.au=Jmila%2C+Houda&rft.au=El-Yacoubi%2C+Mounim+A.&rft.date=2020-04-01&rft.issn=0031-3203&rft.volume=100&rft.spage=107144&rft_id=info:doi/10.1016%2Fj.patcog.2019.107144&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_patcog_2019_107144 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0031-3203&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0031-3203&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0031-3203&client=summon |