Long text data recognition method and device, electronic equipment and storage medium
The invention discloses a long text data recognition method and device, electronic equipment and a storage medium, and relates to the technical field of big data.The method comprises the steps of acquiring long text data, wherein the long text data comprises a plurality of themes; performing cluster...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
22.06.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The invention discloses a long text data recognition method and device, electronic equipment and a storage medium, and relates to the technical field of big data.The method comprises the steps of acquiring long text data, wherein the long text data comprises a plurality of themes; performing clustering processing on the long text data based on a predetermined topic aggregation model to generate multiple pieces of topic module data; the multiple pieces of theme module data are input into a pre-trained text recognition model, labeling results of the theme module data are generated, wherein the labeling results are used for identifying themes of the data; and recognizing each piece of theme data of the long text data according to the labeling result of each piece of theme model data, and inputting each piece of identified theme data into the corresponding text region part. Labeling accuracy of the text recognition model can be improved, so that the accuracy of long text data recognition can be improved.
本发明公开了一种 |
---|---|
Bibliography: | Application Number: CN202110217134 |