A System for Parallel Data Mining Service on Cloud
We present a cloud-based data mining platform which demonstrates the solution of data mining as a service (DMaaS). In the backend, the data processing engine is based on hadoop, an open-source implementation of Google MapReduce. Implementation of the data mining algorithms in Apache Mahout is deploy...
Saved in:
Published in | 2012 Second International Conference on Cloud and Green Computing pp. 329 - 330 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.11.2012
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We present a cloud-based data mining platform which demonstrates the solution of data mining as a service (DMaaS). In the backend, the data processing engine is based on hadoop, an open-source implementation of Google MapReduce. Implementation of the data mining algorithms in Apache Mahout is deployed in the platform. The user can access DMaaS from his browser for analyzing general purpose data mining problems. In this paper, we give an overview of DMaaS, present the system architecture and implementation techniques, and elaborate on a demonstration scenario. |
---|---|
ISBN: | 1467330272 9781467330275 |
DOI: | 10.1109/CGC.2012.49 |