Designing syntactic pattern classifiers using vector quantization and parametric string editing

We consider a fundamental inference problem in syntactic pattern recognition (PR). We assume that the system has a dictionary which is a collection of all the ideal representations of the objects in question. To recognize a noisy sample, the system compares it with every element in the dictionary ba...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on systems, man and cybernetics. Part B, Cybernetics Vol. 29; no. 6; pp. 881 - 888
Main Authors Oommen, B.J., Loke, R.K.S.
Format Journal Article
LanguageEnglish
Published United States IEEE 01.12.1999
Subjects
Online AccessGet full text

Cover

Loading…
Abstract We consider a fundamental inference problem in syntactic pattern recognition (PR). We assume that the system has a dictionary which is a collection of all the ideal representations of the objects in question. To recognize a noisy sample, the system compares it with every element in the dictionary based on a nearest-neighbor philosophy, using three standard edit operations: substitution, insertion, and deletion, and the associated primitive elementary edit distances d(.,.). In this paper, we consider the assignment of the inter-symbol distances using the parametric distances. We show how the classifier can be trained to get the optimal parametric distance using vector quantization in the meta-space. In all our experiments, the training was typically achieved in a very few iterations. The subsequent classification accuracy we obtained using this single-parameter scheme was 96.13%. The power of the scheme is evident if we compare it to 96.67%, which is the accuracy of the scheme which uses the complete array of inter-symbol distances derived from a knowledge of all the confusion probabilities.
AbstractList We consider a fundamental inference problem in syntactic pattern recognition (PR). We assume that the system has a dictionary which is a collection of all the ideal representations of the objects in question. To recognize a noisy sample, the system compares it with every element in the dictionary based on a nearest-neighbor philosophy, using three standard edit operations: substitution, insertion, and deletion, and the associated primitive elementary edit distances d(.,.). In this paper, we consider the assignment of the inter-symbol distances using the parametric distances. We show how the classifier can be trained to get the optimal parametric distance using vector quantization in the meta-space. In all our experiments, the training was typically achieved in a very few iterations. The subsequent classification accuracy we obtained using this single-parameter scheme was 96.13%. The power of the scheme is evident if we compare it to 96.67%, which is the accuracy of the scheme which uses the complete array of inter-symbol distances derived from a knowledge of all the confusion probabilities
We consider a fundamental inference problem in syntactic pattern recognition (PR). We assume that the system has a dictionary which is a collection of all the ideal representations of the objects in question. To recognize a noisy sample, the system compares it with every element in the dictionary based on a nearest-neighbor philosophy, using three standard edit operations: substitution, insertion, and deletion, and the associated primitive elementary edit distances d(.,.). In this paper, we consider the assignment of the inter-symbol distances using the parametric distances. We show how the classifier can be trained to get the optimal parametric distance using vector quantization in the meta-space. In all our experiments, the training was typically achieved in a very few iterations. The subsequent classification accuracy we obtained using this single-parameter scheme was 96.13%. The power of the scheme is evident if we compare it to 96.67%, which is the accuracy of the scheme which uses the complete array of inter-symbol distances derived from a knowledge of all the confusion probabilities.
Author Oommen, B.J.
Loke, R.K.S.
Author_xml – sequence: 1
  givenname: B.J.
  surname: Oommen
  fullname: Oommen, B.J.
  organization: Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada
– sequence: 2
  givenname: R.K.S.
  surname: Loke
  fullname: Loke, R.K.S.
BackLink https://www.ncbi.nlm.nih.gov/pubmed/18252365$$D View this record in MEDLINE/PubMed
BookMark eNqN0UtLAzEQB_Agio_qyZsH2ZMKspr34yi-oeBFz0uanZVIm61JVqif3pQWvamXTCC_GZj899Bm6AMgdEjwBSHYXDKu1IXGBnO8gXaJ4aTG3NDNcsea1ZwTs4P2UnrDuCCjttEO0VRQJsUuam4g-dfgw2uVFiFbl72r5jZniKFyU5uS7zzEVA1paT7A5T5W74MN2X_a7PtQ2dCWjmhnkGNpTuUsElqfS91HW52dJjhY1xF6ubt9vn6ox0_3j9dX49pxRnKtpQBnKccgiRNGCq2lkZ1WIChtVUeY5hPBO9NhC61krnNCMMMmZXk70ZqN0Olq7jz27wOk3Mx8cjCd2gD9kBrFOBWYUFLkya-SaoNV-c5_QCok4-JvKA0WmqoCz36FRCrCmGJySc9X1MU-pQhdM49-ZuOiIbhZxt4sY29WsRd9vB48TGbQ_th1zgUcrYAHgO_ndfcXFUGxZw
CODEN ITSCFI
CitedBy_id crossref_primary_10_1007_s10044_006_0032_z
crossref_primary_10_1007_s10044_006_0036_8
crossref_primary_10_1109_TSMCB_2005_861860
Cites_doi 10.1109/TPAMI.1987.4767962
10.1109/5.58325
10.1007/3-540-60044-2_33
10.1145/321796.321811
10.1016/S0031-3203(97)00124-6
10.1016/0020-0255(81)90052-9
10.1142/S0218001489000152
10.1016/B978-0-444-88071-0.50010-2
10.1109/TSE.1983.237018
10.1016/0020-0255(94)00110-W
10.1145/321879.321880
10.1016/0022-0000(80)90002-1
10.1109/21.362950
10.1016/S0031-3203(96)00101-X
10.1142/9789812797919_0001
10.4159/harvard.9780674419193
10.1016/S0019-9958(85)80046-2
ContentType Journal Article
DBID RIA
RIE
NPM
AAYXX
CITATION
7SC
7SP
7TB
8FD
F28
FR3
JQ2
L7M
L~C
L~D
H8D
7X8
DOI 10.1109/3477.809040
DatabaseName IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE/IET Electronic Library (IEL)
PubMed
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Mechanical & Transportation Engineering Abstracts
Technology Research Database
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Aerospace Database
MEDLINE - Academic
DatabaseTitle PubMed
CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Mechanical & Transportation Engineering Abstracts
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Engineering Research Database
Advanced Technologies Database with Aerospace
ANTE: Abstracts in New Technology & Engineering
Computer and Information Systems Abstracts Professional
Aerospace Database
MEDLINE - Academic
DatabaseTitleList Technology Research Database
PubMed
MEDLINE - Academic
Technology Research Database
Technology Research Database
Computer and Information Systems Abstracts

Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: RIE
  name: IEEE/IET Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Sciences (General)
Computer Science
Philosophy
EISSN 1941-0492
EndPage 888
ExternalDocumentID 10_1109_3477_809040
18252365
809040
Genre Journal Article
GroupedDBID ---
-~X
0R~
29I
4.4
53G
5VS
6IK
85S
97E
AAJGR
AASAJ
ABQJQ
ABVLG
ACGFS
AETIX
AI.
AIBXA
ALLEH
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
EBS
EJD
F5P
HZ~
IFIPE
IFJZH
IPLJI
JAVBF
LAI
M43
O9-
OCL
PZZ
RIA
RIE
RNS
RXW
TAE
TAF
VH1
VJK
XFK
NPM
AAYXX
CITATION
7SC
7SP
7TB
8FD
F28
FR3
JQ2
L7M
L~C
L~D
H8D
7X8
ID FETCH-LOGICAL-c431t-865eca240e61c596588696f87e522d7f1384b54f9f0aed63cfc55393b347ab883
IEDL.DBID RIE
ISSN 1083-4419
IngestDate Fri Aug 16 05:12:31 EDT 2024
Fri Aug 16 06:30:50 EDT 2024
Fri Aug 16 14:20:54 EDT 2024
Fri Aug 16 09:36:42 EDT 2024
Thu Aug 15 22:44:09 EDT 2024
Fri Aug 23 01:39:56 EDT 2024
Sat Sep 28 07:50:48 EDT 2024
Wed Jun 26 19:31:30 EDT 2024
IsPeerReviewed false
IsScholarly false
Issue 6
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c431t-865eca240e61c596588696f87e522d7f1384b54f9f0aed63cfc55393b347ab883
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
PMID 18252365
PQID 1671337367
PQPubID 23500
PageCount 8
ParticipantIDs proquest_miscellaneous_28256345
proquest_miscellaneous_1671337367
proquest_miscellaneous_734250121
proquest_miscellaneous_28907809
pubmed_primary_18252365
ieee_primary_809040
proquest_miscellaneous_26905827
crossref_primary_10_1109_3477_809040
PublicationCentury 1900
PublicationDate 1999-12-01
PublicationDateYYYYMMDD 1999-12-01
PublicationDate_xml – month: 12
  year: 1999
  text: 1999-12-01
  day: 01
PublicationDecade 1990
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle IEEE transactions on systems, man and cybernetics. Part B, Cybernetics
PublicationTitleAbbrev TSMCB
PublicationTitleAlternate IEEE Trans Syst Man Cybern B Cybern
PublicationYear 1999
Publisher IEEE
Publisher_xml – name: IEEE
References ref13
ref15
ref20
lowrance (ref12) 1975; 22
ref22
ref10
ref21
sankoff (ref19) 1983
ref1
ref17
ref16
oommen (ref14) 1995; 77
ref8
ristad (ref18) 1996
bafna (ref2) 1995
ref9
ref4
ref3
ref6
ref5
levenshtein (ref11) 1966; 10
duda (ref7) 1973
References_xml – year: 1996
  ident: ref18
  publication-title: Learning string edit distances
  contributor:
    fullname: ristad
– ident: ref17
  doi: 10.1109/TPAMI.1987.4767962
– start-page: 1
  year: 1995
  ident: ref2
  article-title: computer similarity between rna strings
  publication-title: Proc 6th Symp Combinatorial Pattern Matching CPM-95
  contributor:
    fullname: bafna
– year: 1973
  ident: ref7
  publication-title: Pattern Classification and Scene Analysis
  contributor:
    fullname: duda
– ident: ref10
  doi: 10.1109/5.58325
– ident: ref5
  doi: 10.1007/3-540-60044-2_33
– ident: ref22
  doi: 10.1145/321796.321811
– ident: ref15
  doi: 10.1016/S0031-3203(97)00124-6
– ident: ref8
  doi: 10.1016/0020-0255(81)90052-9
– ident: ref21
  doi: 10.1142/S0218001489000152
– ident: ref1
  doi: 10.1016/B978-0-444-88071-0.50010-2
– ident: ref9
  doi: 10.1109/TSE.1983.237018
– volume: 77
  start-page: 89
  year: 1995
  ident: ref14
  article-title: string alignment with substitution, insertion, deletion, squashing and expansion operations
  publication-title: Inf Sci
  doi: 10.1016/0020-0255(94)00110-W
  contributor:
    fullname: oommen
– volume: 22
  start-page: 177
  year: 1975
  ident: ref12
  article-title: an extension of the string to string correction problem
  publication-title: J Assoc Comput Mach
  doi: 10.1145/321879.321880
  contributor:
    fullname: lowrance
– year: 1983
  ident: ref19
  publication-title: Time Warps String Edits and Macromolecules The Theory and Practice of Sequence Comparison
  contributor:
    fullname: sankoff
– ident: ref13
  doi: 10.1016/0022-0000(80)90002-1
– ident: ref4
  doi: 10.1109/21.362950
– ident: ref16
  doi: 10.1016/S0031-3203(96)00101-X
– ident: ref3
  doi: 10.1142/9789812797919_0001
– ident: ref6
  doi: 10.4159/harvard.9780674419193
– ident: ref20
  doi: 10.1016/S0019-9958(85)80046-2
– volume: 10
  start-page: 707
  year: 1966
  ident: ref11
  article-title: binary codes capable of correcting deletions, insertions and reversals
  publication-title: Sov Phys Dokl
  contributor:
    fullname: levenshtein
SSID ssj0009097
Score 1.2994025
Snippet We consider a fundamental inference problem in syntactic pattern recognition (PR). We assume that the system has a dictionary which is a collection of all the...
SourceID proquest
crossref
pubmed
ieee
SourceType Aggregation Database
Index Database
Publisher
StartPage 881
SubjectTerms Arrays
Classifiers
Computer science
Costs
Councils
Cybernetics
Dictionaries
Inference
Insertion
Pattern recognition
Phase noise
Philosophy
Probability distribution
System testing
Vector quantization
Title Designing syntactic pattern classifiers using vector quantization and parametric string editing
URI https://ieeexplore.ieee.org/document/809040
https://www.ncbi.nlm.nih.gov/pubmed/18252365
https://search.proquest.com/docview/1671337367
https://search.proquest.com/docview/26905827
https://search.proquest.com/docview/28256345
https://search.proquest.com/docview/28907809
https://search.proquest.com/docview/734250121
Volume 29
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT9wwEB4BJziUsrzSFjASh3LI4sSxYx8RDyEkeioStyhx7KpCZKGbVIJf3xknWwpiUW-RNYlij535JjPzDcABT2qlee1j6as8ztDviSvEDTEiZe8tQmjvqBr56pu6uM4ub-TNwLMdamGccyH5zI3pMsTy64nt6FfZkeYG99wiLGqe9qVaz_y6vO-jgogiRgtvhlK8hJsjkeX5uL_1hfEJ3VTmA8tgYM5X-8rtaeAlpLyS23HXVmP79Iq18T_f_SN8GIAmO-53xhosuGYEq7MmDmw40yNY-YeRcARrw_iUfR0IqQ_XoTgNaR4owKaPTRvKqth9IOZsmCX4_dNTR21GSfQ_2O8QCGAPHSptqPJkZVMzYhm_owZellGzEJREw0lZ1xtwfX72_eQiHhozxBbxRhtrJZ0tEQs4lVhJ9DFaGeV17hDN1blPhM4qmXnjeelqJay3UgojKlRFWWktNmGpmTRuG1imy1TUQlrl0FfzQtfSpcLa1OrMKa0jOJhprLjv-TeK4LdwU5Bii35dIxjRkv8VmY3uz5Rb4KmhUEjZuEk3LRJFznkuVB7B3hyZVBkudfqeBLrXSmTyPQmDGIybCNgciVzgV5OI9SLY6rff8zzx8alQ8tObk_sMy4FCIqTWfIGl9lfndhAgtdVuOBp_AA5vDyg
link.rule.ids 315,786,790,802,27957,27958,55109
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT9wwEB4VeoAeSllKG_rASBzoIYs3jh37WBXQltcJJG5R4tgIoWZpN6nU_npmnGx5iEW9RdGXKPHYmW_imW8AtvmoUppXPpa-zOIU4564RN4QI1P23iKF9o6qkU9O1fg8PbyQF73OdqiFcc6F5DM3pMOwl19NbEu_ynY1NzjnFuAlunluumKtO4Vd3nVSQU4Ro483fTEeAndFmmXD7uIH7if0U5lPLYOLOVjparenQZmQMkuuh21TDu3fR7qN__n0b-B1TzXZ125urMILVw9gZdbGgfWregCv7mkSDmC1Pz9lO70k9Zc1yPdCogcC2PRP3YTCKnYTpDlrZomAX3nqqc0ojf6S_Q5bAexni2br6zxZUVeMdMZ_UAsvy6hdCCLRdVLe9Vs4P9g_-zaO-9YMsUXG0cRaSWcLZANOjawkARmtjPI6c8jnqsyPhE5LmXrjeeEqJay3UgojSjRFUWot1mGxntTuPbBUF4mohLTKYbTmha6kS4S1idWpU1pHsD2zWH7TKXDkIXLhJifD5t24RjCgIf8HmZ3dmhk3x3VDmyFF7SbtNB8pCs8zobIINudgEmW41MlzCAywlUjlcwiDLIybCNgcRCbwu0nSehG866bf3Xvi7ROh5MaTL7cJS-Ozk-P8-Pvp0QdYDoISIdHmIyw2v1r3CelSU34Oy-QWdPASfg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Designing+syntactic+pattern+classifiers+using+vector+quantization+and+parametric+string+editing&rft.jtitle=IEEE+transactions+on+systems%2C+man+and+cybernetics.+Part+B%2C+Cybernetics&rft.au=Oommen%2C+B+J&rft.au=Loke%2C+RKS&rft.date=1999-12-01&rft.issn=1083-4419&rft.volume=29&rft.issue=6&rft_id=info:doi/10.1109%2F3477.809040&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1083-4419&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1083-4419&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1083-4419&client=summon