Sign language recognition using a combination of new vision based features

► We present a combination of vision based features to enhance the recognition of ASL. ► Two features are newly introduced: kurtosis position and PCA as an image descriptor. ► Kurtosis position gave smallest error rate when the number of words is small. ► PCA is the most reliable feature when the nu...

Full description

Saved in:
Bibliographic Details
Published inPattern recognition letters Vol. 32; no. 4; pp. 572 - 577
Main Authors Zaki, Mahmoud M., Shaheen, Samir I.
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier B.V 01.03.2011
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:► We present a combination of vision based features to enhance the recognition of ASL. ► Two features are newly introduced: kurtosis position and PCA as an image descriptor. ► Kurtosis position gave smallest error rate when the number of words is small. ► PCA is the most reliable feature when the number of words increases. ► Using a voting based combiner lead to additional reduction in the error rate. Sign languages are based on four components hand shape, place of articulation, hand orientation, and movement. This paper presents a novel combination of vision based features in order to enhance the recognition of underlying signs. Three features are selected to be mapped to these four components. Two of these features are newly introduced for American sign language recognition: kurtosis position and principal component analysis, PCA. Although PCA has been used before in sign a language as a dimensionality reduction technique, it is used here as a descriptor that represents a global image feature to provide a measure for hand configuration and hand orientation. Kurtosis position is used as a local feature for measuring edges and reflecting the place of articulation recognition. The third feature is motion chain code that represents the hand movement. On the basis of these features a prototype is designed, constructed and its performance is evaluated. It consists of skin color detector, connected component locator and dominant hand tracker, feature extractor and a Hidden Markov Model classifier. The input to the system is a sign from RWTH-BOSTON-50 database and the output is the corresponding word with a recognition error rate of 10.90%.
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2010.11.013