Improving Data Availability through Dynamic Model-Driven Replication in Large Peer-to-Peer Communities

Efficient data sharing in global peer-to-peer systems is complicated by erratic node failure, unreliable network connectivity and limited bandwidth. Replicating data on multiple nodes can improve availability and response time. Yet determining when and where to replicate data in order to meet perfor...

Full description

Saved in:

Bibliographic Details
Published in	Cluster Computing and the Grid (CCGrid 2002): 2nd IEEE International Symposium p. 376
Main Authors	Ranganathan, K., Iamnitchi, A., Foster, I.
Format	Conference Proceeding
Language	English
Published	IEEE 2002
Subjects	Aggregates Availability Bandwidth Computer science Costs Delay Microcomputers Peer to peer computing Personal communication networks Power system reliability
Online Access	Get full text
ISBN	9780769515823 0769515827
DOI	10.1109/CCGRID.2002.1017164

Cover

More Information
Summary:	Efficient data sharing in global peer-to-peer systems is complicated by erratic node failure, unreliable network connectivity and limited bandwidth. Replicating data on multiple nodes can improve availability and response time. Yet determining when and where to replicate data in order to meet performance goals in large-scale systems with many users and files, dynamic network characteristics, and changing user behavior is difficult. We propose an approach in which peers create replicas automatically in a decentralized fashion, as required to meet availability goals. The aim of our framework is to maintain a threshold level of availability at all times. We identify a set of factors that hinder data availability and propose a model that decides when more replication is necessary. We evaluate the accuracy and performance of the proposed model using simulations. Our preliminary results show that the model is effective in predicting the required number of replicas in the system.
ISBN:	9780769515823 0769515827
DOI:	10.1109/CCGRID.2002.1017164