Long text data recognition method and device, electronic equipment and storage medium

The invention discloses a long text data recognition method and device, electronic equipment and a storage medium, and relates to the technical field of big data.The method comprises the steps of acquiring long text data, wherein the long text data comprises a plurality of themes; performing cluster...

Full description

Saved in:
Bibliographic Details
Main Authors LIN ZEXI, GUAN RUI, LI WEILIANG, LIU GENGCHENG
Format Patent
LanguageChinese
English
Published 22.06.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention discloses a long text data recognition method and device, electronic equipment and a storage medium, and relates to the technical field of big data.The method comprises the steps of acquiring long text data, wherein the long text data comprises a plurality of themes; performing clustering processing on the long text data based on a predetermined topic aggregation model to generate multiple pieces of topic module data; the multiple pieces of theme module data are input into a pre-trained text recognition model, labeling results of the theme module data are generated, wherein the labeling results are used for identifying themes of the data; and recognizing each piece of theme data of the long text data according to the labeling result of each piece of theme model data, and inputting each piece of identified theme data into the corresponding text region part. Labeling accuracy of the text recognition model can be improved, so that the accuracy of long text data recognition can be improved. 本发明公开了一种
Bibliography:Application Number: CN202110217134