Variable Selection to Determine Majors of Student using K-Nearest Neighbor and Naïve Bayes Classifier Algorithm

Appropriate student's major placement in high school can help students to better improve their academic achievement. There are many variables which must be considered to determine the student's majors, such as: Gender, Interests, Intelligence Quotient (IQ); Four subjects in Junior High Sch...

Full description

Saved in:
Bibliographic Details
Published inJournal of physics. Conference series Vol. 1363; no. 1; pp. 12057 - 12064
Main Authors Mustakim, Hastarimasuci, Reysa, Papilo, Petir, Zarkasih, Zaitun, Nazir, Alwis
Format Journal Article
LanguageEnglish
Published Bristol IOP Publishing 01.11.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Appropriate student's major placement in high school can help students to better improve their academic achievement. There are many variables which must be considered to determine the student's majors, such as: Gender, Interests, Intelligence Quotient (IQ); Four subjects in Junior High School (JHS), average junior high school grades, matriculation score of four subjects, and average rate of matriculation. The number of variables used in the selection, causes some weaknesses among them, such as the complexity of variable, the inefficiency of variable and the existence of some variables which is only as an addition without having a significant contribution. This study aims to reduce the number of these variables so it will become easier to analyze and to be applied. The Process of reduction was done by combining experiments with Predefined attributes. A total of ten combinations were attempted using K-Nearest Neighbor (K-NN) and Naïve Bayes Classifier (NBC) which then was measured by Confusion Matrix accuracy. The experimental result showed that the combination of variables which produce the best accuracy were the 9th and 10th experiment with variable matriculation, interest, and IQ, and an accuracy of 96.77% from K-NN also 98.38% from NBC. By combining both algorithms, 99.87% of maximum accuracy was obtained from those three variables. New information which can be extracted from this research is that there are only three important variables to determine major placement in Senior High School, Average Scores of Matriculation, Interests and IQ followed by four supporting variables such as the scores of Mathematic, Physics, English and Economics in Matriculation.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1363/1/012057