Text feature extraction based on deep learning: a review

Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquir...

Full description

Saved in:
Bibliographic Details
Published inEURASIP journal on wireless communications and networking Vol. 2017; no. 1; pp. 211 - 12
Main Authors Liang, Hong, Sun, Xiao, Sun, Yunlei, Gao, Yuan
Format Journal Article
LanguageEnglish
Published Cham Springer International Publishing 15.12.2017
Springer Nature B.V
SpringerOpen
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquire new effective feature representation from training data. As a new feature extraction method, deep learning has made achievements in text mining. The major difference between deep learning and conventional methods is that deep learning automatically learns features from big data, instead of adopting handcrafted features, which mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data. Deep learning can automatically learn feature representation from big data, including millions of parameters. This thesis outlines the common methods used in text feature extraction first, and then expands frequently used deep learning methods in text feature extraction and its applications, and forecasts the application of deep learning in feature extraction.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
content type line 14
ObjectType-Literature Review-2
ObjectType-Feature-3
ObjectType-Feature-2
ObjectType-Review-3
content type line 23
ISSN:1687-1499
1687-1472
1687-1499
DOI:10.1186/s13638-017-0993-1