USING AUTOENCODERS FOR TRAINING NATURAL LANGUAGE TEXT CLASSIFIERS

Systems and methods for using autoencoders for training natural language classifiers. An example method comprises: producing, by a computer system, a plurality of feature vectors, wherein each feature vector represents a natural language text of a text corpus, wherein the text corpus comprises a fir...

Full description

Saved in:
Bibliographic Details
Main Authors Anisimovich, Konstantin Vladimirovich, Ivashnev, Ivan Ivanovich, Indenbom, Evgenii Mikhailovich
Format Patent
LanguageEnglish
Published 13.06.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Systems and methods for using autoencoders for training natural language classifiers. An example method comprises: producing, by a computer system, a plurality of feature vectors, wherein each feature vector represents a natural language text of a text corpus, wherein the text corpus comprises a first plurality of annotated natural language texts and a second plurality of un-annotated natural language texts; training, using the plurality of feature vectors, an autoencoder represented by an artificial neural network; producing, by the autoencoder, an output of the hidden layer, by processing a training data set comprising the first plurality of annotated natural language texts; and training, using the training data set, a text classifier that accepts an input vector comprising the output of the hidden layer and yields a degree of association, with a certain text category, of a natural language text utilized to produce the output of the hidden layer.
Bibliography:Application Number: US201715852418