Sports news subject term extraction method based on BiLSTM-CRF

The invention discloses a sports news subject term extraction method based on BiLSTM-CRF, and the method comprises the following steps: obtaining sports news from a website as training data, and extracting a title and a text of the training data; extracting topic sentences from the text, and extract...

Full description

Saved in:
Bibliographic Details
Main Authors JIANG YIQI, ZHAO TONGZHOU
Format Patent
LanguageChinese
English
Published 28.02.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention discloses a sports news subject term extraction method based on BiLSTM-CRF, and the method comprises the following steps: obtaining sports news from a website as training data, and extracting a title and a text of the training data; extracting topic sentences from the text, and extracting topic sentences of the training data; establishing a training set and a test set, dividing partsof the obtained titles and topic sentences into the training set, and dividing the rest parts into the test set; establishing a BiLSTM-CRF model, training the BiLSTM-CRF model by taking titles and topic sentences in the training set as objects, and extracting topic words of training data in the training set to obtain an optimal prediction model; and extracting titles and topic sentences of the sports news of which the topic words need to be extracted, and substituting the titles and the topic sentences into the optimal prediction model to obtain the topic words of the sports news of which thetopic words need to be ex
Bibliography:Application Number: CN201910978573