Word order weighting-based sentence similarity calculation method

The invention provides a word order weighting-based sentence similarity calculation method. The method comprises the following steps of: obtaining a corpus set A in a form of (Label1i, Sen1i), and carrying out training to obtain word vector models of all the words in the corpus set A; constructing a...

Full description

Saved in:

Bibliographic Details
Main Authors	WANG QINGCHEN, SHEN SHENGYU
Format	Patent
Language	Chinese English
Published	07.09.2018
Subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The invention provides a word order weighting-based sentence similarity calculation method. The method comprises the following steps of: obtaining a corpus set A in a form of (Label1i, Sen1i), and carrying out training to obtain word vector models of all the words in the corpus set A; constructing a test corpus set B in a form of (Label2j, Sen2j), and obtaining word vector models of all the Sen2jwords in the corpus set B by adoption of an incremental training manner; obtaining sentence vectors SenVec1i and SenVec2j of statements SenVec1i and SenVec2j by adoption of word order weighting manneraccording to the word vector models obtained by the corpus set B; calculating a similarity between one Sen2j and each statement Sen1i one by one, comparing the Label1i corresponding to the statementSen1i with the highest similarity with the Label2j, if the comparison result is consistent, considering that the result is correct, and otherwise, storing (Sen1i, Sen2j) to a training corpus set C; and furthermore processing th
Bibliography:	Application Number: CN20181217211