An information-theoretic perspective of tf–idf measures

This paper presents a mathematical definition of the “probability-weighted amount of information” (PWI), a measure of specificity of terms in documents that is based on an information-theoretic view of retrieval events. The proposed PWI is expressed as a product of the occurrence probabilities of te...

Full description

Saved in:
Bibliographic Details
Published inInformation processing & management Vol. 39; no. 1; pp. 45 - 65
Main Author Aizawa, Akiko
Format Journal Article
LanguageEnglish
Published Oxford Elsevier Ltd 2003
Elsevier Science
Elsevier Science Ltd
Subjects
Online AccessGet full text

Cover

Loading…