Method and system for extracting unstructured bid invitation requirement text based on clustering
The invention provides a clustering-based unstructured bid invitation requirement text extraction method and system, and the method comprises the steps: basic text processing: carrying out the word segmentation, text embedding and dimension reduction operation of an unstructured text complete set, a...
Saved in:
Main Authors | , , , , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
04.07.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The invention provides a clustering-based unstructured bid invitation requirement text extraction method and system, and the method comprises the steps: basic text processing: carrying out the word segmentation, text embedding and dimension reduction operation of an unstructured text complete set, and obtaining a low-dimensional feature vector of each text complete set; text clustering: clustering the text complete set according to the low-dimensional feature vector; extracting rules, sampling M sub-data sets of which the sample size is about n from the text complete set according to a classification result, and assigning one of the sub-data sets as a training set and the rest as test sets; annotation and algorithm iteration are carried out, and all training set data and test set data are annotated; performing algorithm inspection, and sampling an inspection set from the text complete set; according to the method, an unsupervised algorithm is used for preprocessing, and a large amount of early-stage data anno |
---|---|
Bibliography: | Application Number: CN202310224764 |