Service reliability modeling and evaluation of active-active cloud data center based on the IT infrastructure

As cloud data center has caught the eye of the information-intensive society since its birth, it keeps on a flourishing development due to its advantages such as high availability and resource utilization, rapid elasticity and disaster recovery. However, as a complex system, it means the centralizat...

Full description

Saved in:
Bibliographic Details
Published inMicroelectronics and reliability Vol. 75; pp. 271 - 282
Main Authors Li, Xiaoyang, Liu, Yue, Kang, Rui, Xiao, Lianghua
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.08.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:As cloud data center has caught the eye of the information-intensive society since its birth, it keeps on a flourishing development due to its advantages such as high availability and resource utilization, rapid elasticity and disaster recovery. However, as a complex system, it means the centralization of failures and risks, which exerts great influence on the service that cloud data center provides to customers. Thus, the service reliability of the cloud data center is always a key concern. In order to evaluate and further improve the service reliability of the cloud data center, modeling analysis is absolutely necessary but difficult to apply because of the complicated cloud control flows, massive-scale service sharing and complex real-world infrastructures. Focused on the active-active data center, which is a typical mode of the cloud data center, a methodology of the service reliability modeling and analysis based on IT infrastructure is proposed. In this methodology, the queuing theory and graph theory are used to formulate the service reliability model, but the Monte Carlo simulation is used for statistical evaluation. During the modeling, the operational process of the active-active data center is divided into two parts—the request stage and execution stage, which are modeled respectively. Then, the modeling and evaluation approaches are applied in a use case, which verifies the applicability and creditability of our approach. Meanwhile, by sensitivity analysis considering the variation of uncertain or key factors both internal and external, several parameters are identified as high-sensitive factors, which can enlighten service providers on service reliability improvement. •A service reliability model of active-active cloud DC on IT infrastructures is proposed.•Queuing theory and graph theory are used to model and Monte Carlo to simulate.•The failure causes are classified into 5 categories including 12 influence factors.•Each major influence factor is incorporated quantitatively in the proposed model.•The practical use case verifies the applicability of the modeling approach.
ISSN:0026-2714
1872-941X
DOI:10.1016/j.microrel.2017.03.009