An information-theoretic perspective of tf–idf measures
This paper presents a mathematical definition of the “probability-weighted amount of information” (PWI), a measure of specificity of terms in documents that is based on an information-theoretic view of retrieval events. The proposed PWI is expressed as a product of the occurrence probabilities of te...
Saved in:
Published in | Information processing & management Vol. 39; no. 1; pp. 45 - 65 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
Oxford
Elsevier Ltd
2003
Elsevier Science Elsevier Science Ltd |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!