Learning element weighting for similarity measures

Described is a technology for measuring the similarity between two objects (e.g., documents), via a framework that learns the term-weighting function from training data, e.g., labeled pairs of objects, to develop a learned model. A learning procedure tunes the model parameters by minimizing a define...

Full description

Saved in:
Bibliographic Details
Main Authors HAJISHIRZI HANNANEH, YIH WEN-TAU, MEEK CHRISTOPHER A
Format Patent
LanguageEnglish
Published 10.11.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Described is a technology for measuring the similarity between two objects (e.g., documents), via a framework that learns the term-weighting function from training data, e.g., labeled pairs of objects, to develop a learned model. A learning procedure tunes the model parameters by minimizing a defined loss function of the similarity score. Also described is using the learning procedure and learned model to detect near duplicate documents.
Bibliography:Application Number: US20100715417