Relaxing Pairwise Conditional Dependency of NB Attributes using Hidden Variables

Conditional independence assumption of naive Bayes classifier as one of its major drawbacks, for real-world applications, has been recently tackled by various structural learning approaches. However, finding the optimal structure is itself reported to be an NP-hard problem. Compliance of the devised...

Full description

Saved in:
Bibliographic Details
Published in2020 10th International Symposium onTelecommunications (IST) pp. 116 - 122
Main Authors Alizadeh, Sasan H., Harzevili, Nima Shiri, Hediehloo, Alireza, Zare, Hadi
Format Conference Proceeding
LanguageEnglish
Published IEEE 15.12.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Conditional independence assumption of naive Bayes classifier as one of its major drawbacks, for real-world applications, has been recently tackled by various structural learning approaches. However, finding the optimal structure is itself reported to be an NP-hard problem. Compliance of the devised structure with general rules and concepts of the literature of Bayesian networks is also another challenge facing many researchers. Eventually, the computational complication of structural learning has impelled many scholars to adopt approaches to confine the search space of possible structures. In this paper, we incorporate a set of hidden variables in a predefined Bayesian network structure, for proposing Pairwise Hidden Naive Bayes Classifier (PHNB). In PHNB, a hidden variable is allocated for each pair of features with conditional mutual information higher than a specific threshold in a diverging connection to preserve their dependency whenever the class variable is instantiated. The Expectation-Maximization (EM) algorithm is modified using a Bayesian correction approach so as to consider an infinitesimal possibility for unforeseen events while converging to a local optimum. Experiments on 6 UCI benchmark datasets reveal substantial results based on classification accuracy(ACC) and area under the ROC curve(AUC).
DOI:10.1109/IST50524.2020.9345894