Text clustering method and device, equipment and medium

The invention provides a text clustering method, device and equipment and a readable medium, and the method comprises the steps: building a vocabulary, and calculating a word vector of each vocabulary in the vocabulary; obtaining a text vector of each to-be-clustered text, forming a text vector set,...

Full description

Saved in:
Bibliographic Details
Main Author SU HAIMING
Format Patent
LanguageChinese
English
Published 11.08.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention provides a text clustering method, device and equipment and a readable medium, and the method comprises the steps: building a vocabulary, and calculating a word vector of each vocabulary in the vocabulary; obtaining a text vector of each to-be-clustered text, forming a text vector set, and calculating the distance between every two text vectors in the text vector set; randomly selecting a threshold number of text vectors from the text vector set as alternative center vectors, and dividing the text vectors into two classes by taking every two text vectors as a group and taking the group as center vectors in sequence in the alternative center vectors; and selecting the center vector with the maximum confusion degree in the group of center vectors with the minimum confusion degree in each division and the text vector of the corresponding classification, and repeatedly executing the previous step by using the selected text vector until a preset condition is met. By means of the scheme, efficient sem
Bibliography:Application Number: CN202310496099