Flexible Supervision System: A Fast Fault-Tolerance Strategy for Cloud Applications in Cloud-Edge Collaborative Environments

With the development of cloud-edge collaborative computing technology, more and more cloud applications are transferred to edge devices. Some cloud applications in relatively unstable edge scenarios put forward higher requirements for fault tolerance. Therefore, we design and implement a flexible su...

Full description

Saved in:
Bibliographic Details
Published inNetwork and Parallel Computing pp. 108 - 113
Main Authors Cai, Weilin, Chen, Heng, Zhuo, Zhimin, Wang, Ziheng, An, Ninggang
Format Book Chapter
LanguageEnglish
Published Cham Springer Nature Switzerland
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:With the development of cloud-edge collaborative computing technology, more and more cloud applications are transferred to edge devices. Some cloud applications in relatively unstable edge scenarios put forward higher requirements for fault tolerance. Therefore, we design and implement a flexible supervision system. The system provides a higher frequency of fault detection than existing cloud management platforms like Kubernetes. And It implements a more efficient checkpoint-restart fault handling scheme based on the distributed in-memory database. Meanwhile, we also consider minimizing the extra time costs caused by the fault-tolerance operations and saving cloud system resources including computing, storage, and network.
ISBN:9783031213946
3031213947
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-031-21395-3_10