An Energy Efficient Fault Tolerance Technique Based on Load Balancing Algorithm for High-Performance Computing in Cloud Computing

Background and Objectives: Cloud Computing has brought a new dimension to the IT world. The technology of cloud computing allows employing a large number of Virtual Machines to run intensive applications. Each failure in running applications fails system operations. To solve the problem, it is requi...

Full description

Saved in:
Bibliographic Details
Published inJournal of electrical and computer engineering innovations (Online) Vol. 8; no. 2; pp. 169 - 182
Main Authors H. Jahanpour, H. Barati, A. Mehranzadeh
Format Journal Article
LanguageEnglish
Published Shahid Rajaee Teacher Training University 01.07.2020
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Background and Objectives: Cloud Computing has brought a new dimension to the IT world. The technology of cloud computing allows employing a large number of Virtual Machines to run intensive applications. Each failure in running applications fails system operations. To solve the problem, it is required to restart the systems.Methods: In this paper, to predict and avoid failure in HPC systems, a method of fault tolerance to High-Performance Computing systems (HPC) in the cloud is called Daemon-COA-MMT (DCM), has been proposed. In the proposed method, the Daemon Fault Tolerance technique has been enhanced, and COA-MMT has been utilized for load balancing. The method consists of four modules, which are used to determine the host state. When the system is in the alarm state, the current host may face failure. Then the most optimal host for migration is selected, and process-level migration is performed. The method causes decreased migration overheads, decreased system performance speed, optimal use of underutilized hosts instead of leasing new hosts, appropriate load balancing, equal use of hardware resources by all hosts, focusing on QoS and SLA, and the significant decrease of energy consumption.Results: The simulation results revealed that in terms of parameters, the proposed method declines average job makespan, average response time, and average task execution cost by 18.06%, 35.68%, and 24.6%, respectively. The proposed fault tolerance algorithm has improved energy consumption by 30% and decreased the HPC systems' failure rate.Conclusion: In this study, the Daemon Fault Tolerance technique has been enhanced, and COA-MMT has been utilized for load balancing in high performance computing in the cloud computing.
AbstractList Background and Objectives: Cloud Computing has brought a new dimension to the IT world. The technology of cloud computing allows employing a large number of Virtual Machines to run intensive applications. Each failure in running applications fails system operations. To solve the problem, it is required to restart the systems.Methods: In this paper, to predict and avoid failure in HPC systems, a method of fault tolerance to High-Performance Computing systems (HPC) in the cloud is called Daemon-COA-MMT (DCM), has been proposed. In the proposed method, the Daemon Fault Tolerance technique has been enhanced, and COA-MMT has been utilized for load balancing. The method consists of four modules, which are used to determine the host state. When the system is in the alarm state, the current host may face failure. Then the most optimal host for migration is selected, and process-level migration is performed. The method causes decreased migration overheads, decreased system performance speed, optimal use of underutilized hosts instead of leasing new hosts, appropriate load balancing, equal use of hardware resources by all hosts, focusing on QoS and SLA, and the significant decrease of energy consumption.Results: The simulation results revealed that in terms of parameters, the proposed method declines average job makespan, average response time, and average task execution cost by 18.06%, 35.68%, and 24.6%, respectively. The proposed fault tolerance algorithm has improved energy consumption by 30% and decreased the HPC systems' failure rate.Conclusion: In this study, the Daemon Fault Tolerance technique has been enhanced, and COA-MMT has been utilized for load balancing in high performance computing in the cloud computing.
Author A. Mehranzadeh
H. Barati
H. Jahanpour
Author_xml – sequence: 1
  fullname: H. Jahanpour
  organization: Department of Computer Engineering, Dezful Branch, Islamic Azad University, Dezful, Iran
– sequence: 2
  fullname: H. Barati
  organization: Department of Computer Engineering, Dezful Branch, Islamic Azad University, Dezful, Iran
– sequence: 3
  fullname: A. Mehranzadeh
  organization: Department of Computer Engineering, Dezful Branch, Islamic Azad University, Dezful, Iran
BookMark eNqtTz1OwzAYtVCRKNAbMPgCCf5J3GYsUaoiMTBkj1z7c-LIsYuTDB25OWmFxAWY3q-e9B7RygcPCL1QkjJGBH3tQYFNGWEk3TJapHxL79Ca8SxPOMmy1ZUzlvAiZw9oM449IYQVlBZCrNH33uPKQ2wvuDLGKgt-wgc5uwnXwUGUXgGuQXXefs2A3-QIGgePP4LUi3JLbn2L964N0U7dgE2I-GjbLvmEuPDhNlCG4TxP16L1uHRh1n_WM7o30o2w-cUn9H6o6vKY6CD75hztIOOlCdI2NyPEtpFxsspBo4gxVGhzWu5kpDjtcmGYAK5zwbMd0fw_t34ATFp0Mg
ContentType Journal Article
DBID DOA
DOI 10.22061/jecei.2020.7219.371
DatabaseName DOAJ Directory of Open Access Journals
DatabaseTitleList
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 2345-3044
EndPage 182
ExternalDocumentID oai_doaj_org_article_c0ff16dfb291409b856f26e3d563480d
GroupedDBID ALMA_UNASSIGNED_HOLDINGS
GROUPED_DOAJ
OK1
ID FETCH-doaj_primary_oai_doaj_org_article_c0ff16dfb291409b856f26e3d563480d3
IEDL.DBID DOA
ISSN 2322-3952
IngestDate Thu Jul 04 20:58:23 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
LinkModel DirectLink
MergedId FETCHMERGED-doaj_primary_oai_doaj_org_article_c0ff16dfb291409b856f26e3d563480d3
OpenAccessLink https://doaj.org/article/c0ff16dfb291409b856f26e3d563480d
ParticipantIDs doaj_primary_oai_doaj_org_article_c0ff16dfb291409b856f26e3d563480d
PublicationCentury 2000
PublicationDate 2020-07-01
PublicationDateYYYYMMDD 2020-07-01
PublicationDate_xml – month: 07
  year: 2020
  text: 2020-07-01
  day: 01
PublicationDecade 2020
PublicationTitle Journal of electrical and computer engineering innovations (Online)
PublicationYear 2020
Publisher Shahid Rajaee Teacher Training University
Publisher_xml – name: Shahid Rajaee Teacher Training University
SSID ssj0002911966
Score 4.3181977
Snippet Background and Objectives: Cloud Computing has brought a new dimension to the IT world. The technology of cloud computing allows employing a large number of...
SourceID doaj
SourceType Open Website
StartPage 169
SubjectTerms cloud computing
fault tolerance
high performance computing
load balancing
virtual machines
Title An Energy Efficient Fault Tolerance Technique Based on Load Balancing Algorithm for High-Performance Computing in Cloud Computing
URI https://doaj.org/article/c0ff16dfb291409b856f26e3d563480d
Volume 8
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrZ07T8MwEMct1AkGxFO8dQNr2sSJk3hsq1YFAWIoUrfIsS-lKDhVlX4Avjl-FLUbA4yxrDwuJ9-ddP_fEXKPFS0zQTFQKTcFCqMyyAUzvswjlCJDmQgrcH5-SSdvyeOMzXZGfdmeMI8H9obrybCqolRVJeWWzVTmLK1oirFiaZzkoXKnb8R2iil7BpvNxrX8ZDnbrs4Z9bo5Sk0A632gxIUpDmnYNRUQ78ZOQb9l9rvgMj4ih5usEPr-bY7JHuoTcrDDCjwlX30NIyfUg5HDPphoAWOxrluYNjXa-RgI0x8iKwxMdFLQaHhqhDJXteVq6Dn063mzWrTvn2CyVbBdHsHrVjsAfsiD3bjQMKybtdounZGH8Wg6nAT2C4qlx1QUFhztFow5i405i9_MGZ-Tjm40XhCwqto4UVLFSZYgD4XkoaSS5qWIqiyXl2Tw9-dd_cdNrsm-_Ze-i_aGdNrVGm9NrtCWd84tvgFrdcET
link.rule.ids 315,786,790,870,2115,27957,27958
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+Energy+Efficient+Fault+Tolerance+Technique+Based+on+Load+Balancing+Algorithm+for+High-Performance+Computing+in+Cloud+Computing&rft.jtitle=Journal+of+electrical+and+computer+engineering+innovations+%28Online%29&rft.au=H.+Jahanpour&rft.au=H.+Barati&rft.au=A.+Mehranzadeh&rft.date=2020-07-01&rft.pub=Shahid+Rajaee+Teacher+Training+University&rft.issn=2322-3952&rft.eissn=2345-3044&rft.volume=8&rft.issue=2&rft.spage=169&rft.epage=182&rft_id=info:doi/10.22061%2Fjecei.2020.7219.371&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_c0ff16dfb291409b856f26e3d563480d
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2322-3952&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2322-3952&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2322-3952&client=summon