Method and system for extracting unstructured bid invitation requirement text based on clustering

The invention provides a clustering-based unstructured bid invitation requirement text extraction method and system, and the method comprises the steps: basic text processing: carrying out the word segmentation, text embedding and dimension reduction operation of an unstructured text complete set, a...

Full description

Saved in:
Bibliographic Details
Main Authors LI YANBEI, YAO ZEKUN, ZHU JUN, YAN CHENGUANG, SUN ZHIQIANG, SHEN DAFENG, DAI ZHIXIN, XIA JINGXIANG
Format Patent
LanguageChinese
English
Published 04.07.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention provides a clustering-based unstructured bid invitation requirement text extraction method and system, and the method comprises the steps: basic text processing: carrying out the word segmentation, text embedding and dimension reduction operation of an unstructured text complete set, and obtaining a low-dimensional feature vector of each text complete set; text clustering: clustering the text complete set according to the low-dimensional feature vector; extracting rules, sampling M sub-data sets of which the sample size is about n from the text complete set according to a classification result, and assigning one of the sub-data sets as a training set and the rest as test sets; annotation and algorithm iteration are carried out, and all training set data and test set data are annotated; performing algorithm inspection, and sampling an inspection set from the text complete set; according to the method, an unsupervised algorithm is used for preprocessing, and a large amount of early-stage data anno
Bibliography:Application Number: CN202310224764