Parallel mining of frequent patterns for school records analytics at the Universidad Michoacana

This paper presents research results on school record analytics, developed for Universidad Michoacana (UM-SNH), based on a parallel implementation of data mining techniques. Core elements of this research work were finding frequent patterns on academic records for all students of UMSNH from 2005 to...

Full description

Saved in:
Bibliographic Details
Published in2017 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC) pp. 1 - 6
Main Authors Flores, Juan J., Garcia-Nava, J. Luis, Castro-Coria, Monserrat A., Tellez, Victor M., Huerta, B. Emanuel, Espinosa-Romero, Josue, Calderon, Felix
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This paper presents research results on school record analytics, developed for Universidad Michoacana (UM-SNH), based on a parallel implementation of data mining techniques. Core elements of this research work were finding frequent patterns on academic records for all students of UMSNH from 2005 to 2016, and searching for relevant frequent pattern subsets by using the distributed computing platform Spark. The FP-Growth algorithm used for finding frequent patterns is presented, as well as serial, concurrent, and parallel implementations of the mining process based on it. Experimental results are discussed on two different directions: (a) the superior performance achieved by parallel implementation when compared to serial and concurrent versions of the application, and (b) the advantages that mining at the frequent patterns level provides for information retrieval on this specific problem, when compared to mining at association rules or correlation statistics levels.
ISSN:2573-0770
DOI:10.1109/ROPEC.2017.8261636