Automatic Artificial Data Generator: Framework and implementation

Extracting unknown and possibly useful information from a set of examples that has desired features is crucial and important for data analysis and interpretation. Normally, a public repository has become the most used method in attempting to find a suitable domain. However, relying on the available...

Full description

Saved in:
Bibliographic Details
Published in2016 International Conference on Information and Communication Technology (ICICTM) pp. 56 - 60
Main Authors Syahaneim, Hazwani, Raja Asilah, Wahida, Nur, Shafikah, Siti Intan, Zuraini, Ellyza, Puteri Nor
Format Conference Proceeding
LanguageEnglish
Published IEEE 2016
Subjects
Online AccessGet full text
DOI10.1109/ICICTM.2016.7890777

Cover

More Information
Summary:Extracting unknown and possibly useful information from a set of examples that has desired features is crucial and important for data analysis and interpretation. Normally, a public repository has become the most used method in attempting to find a suitable domain. However, relying on the available data in the public repository has several disadvantages. In this case, an automatic problem generation system would be valuable to provide several advantages over the traditional methods. This paper focuses more on data extraction and artificial data generation. Here, a framework is proposed that consists of four main phases: 1) Data extraction, 2) Data characterization, 3) Artificial data generation and 4) Artificial data creation. The approach systematically creates testing datasets based on real data that is extracted from a reliable sources. The system uses random permutation algorithm to generate a large number of artificial data that resembles real data.
DOI:10.1109/ICICTM.2016.7890777