A Novel Neural Network-Based Method for Medical Text Classification

Medical text categorization is a specific area of text categorization. Classification for medical texts is considered a special case of text classification. Medical text includes medical records and medical literature, both of which are important clinical information resources. However, medical text...

Full description

Saved in:

Bibliographic Details
Published in	Future internet Vol. 11; no. 12; p. 255
Main Authors	Qing, Li, Linhong, Weng, Xuehai, Ding
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.12.2019
Subjects	Chinese medicine Classification Datasets Deep learning Domains Electronic health records Feature extraction Information resources Internet Machine learning Medical records Medical research Natural language processing Neural networks Representations Researchers Sparsity Text categorization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Medical text categorization is a specific area of text categorization. Classification for medical texts is considered a special case of text classification. Medical text includes medical records and medical literature, both of which are important clinical information resources. However, medical text contains complex medical vocabularies, medical measures, which has problems with high-dimensionality and data sparsity, so text classification in the medical domain is more challenging than those in other general domains. In order to solve these problems, this paper proposes a unified neural network method. In the sentence representation, the convolutional layer extracts features from the sentence and a bidirectional gated recurrent unit (BIGRU) is used to access both the preceding and succeeding sentence features. An attention mechanism is employed to obtain the sentence representation with the important word weights. In the document representation, the method uses the BIGRU to encode the sentences, which is obtained in sentence representation and then decode it through the attention mechanism to get the document representation with important sentence weights. Finally, a category of medical text is obtained through a classifier. Experimental verifications are conducted on four medical text datasets, including two medical record datasets and two medical literature datasets. The results clearly show that our method is effective.
ISSN:	1999-5903 1999-5903
DOI:	10.3390/fi11120255