A prediction-based dynamic replication strategy for data-intensive applications

•Intelligent Replica Manager – a new strategy for prediction-based dynamic replication is proposed.•Multi-criteria based replication algorithm is proposed for replica placement.•Modified apriori algorithm is designed for frequent datasets prediction.•Comparison with existing prediction-based dynamic...

Full description

Saved in:
Bibliographic Details
Published inComputers & electrical engineering Vol. 57; pp. 281 - 293
Main Authors Nagarajan, Vijaya, Maluk Mohamed, Mulk Abdul
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier Ltd 01.01.2017
Elsevier BV
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•Intelligent Replica Manager – a new strategy for prediction-based dynamic replication is proposed.•Multi-criteria based replication algorithm is proposed for replica placement.•Modified apriori algorithm is designed for frequent datasets prediction.•Comparison with existing prediction-based dynamic replication methods.•Comparison and testing of the proposed strategy with other strategies for different job size. Data-intensive applications produce huge amount of data sets which need to be analyzed among geographically distributed nodes in grid computing environment. Data replication is essential in this environment to reduce the data access latency and to improve the data availability across several grid sites. In this work, an Intelligent Replica Manager (IRM) is designed and incorporated in the middleware of the grid for scheduling data-intensive applications. IRM uses a Multi-criteria based replication algorithm which considers multiple parameters like storage capacity, bandwidth and communication cost of the neighboring sites before taking decisions for the selection and placement of replica. Additionally, future needs of the grid site are predicted in advance using modified apriori algorithm, which is an association rule based mining technique. This IRM based strategy reduces the data availability time, data access time and make span. The simulation results prove that the proposed strategy outperforms the existing strategies.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0045-7906
1879-0755
DOI:10.1016/j.compeleceng.2016.11.036