A Bayesian Technique for Estimating the Credibility of Question Answerers

We address the problem of ranking question answerers according to their credibility, characterized here by the probability that a given question answerer (user) will be awarded a best answer on a question given the answerer's question-answering history. This probability (represented by θ) is co...

Full description

Saved in:
Bibliographic Details
Published inSociety for Industrial and Applied Mathematics. Proceedings of the SIAM International Conference on Data Mining p. 399
Main Authors Dom, Byron, Paranjpe, Deepa
Format Conference Proceeding
LanguageEnglish
Published Philadelphia Society for Industrial and Applied Mathematics 01.01.2008
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We address the problem of ranking question answerers according to their credibility, characterized here by the probability that a given question answerer (user) will be awarded a best answer on a question given the answerer's question-answering history. This probability (represented by θ) is considered to be a hidden variable that can only be estimated statistically from specific observations associated with the user, namely the number b of best answers awarded, associated with the number n of questions answered. The more specific problem addressed is the potentially high degree of uncertainty associated with such credibility estimates when they are based on small numbers of answers. We address this problem by a kind of Bayesian smoothing. The credibility estimate will consist of a mixture of the overall population statistics and those of the specific user. The greater the number of questions asked, the greater will be the contribution of the specific user statistics relative to those of the overall population. We use the Predictive Stochastic Complexity (PSC) as an accuracy measure to evaluate several methods that can be used for the estimation. We compare our technique (Bayesian Smoothing (BS)) with maximum a priori (MAP) estimation, maximum likelihood (ML) estimation and Laplace smoothing. [PUBLICATION ABSTRACT]