Loading…
Efficient Analysis of Repairable Computing Systems Subject to Scheduled Checkpointing
Mo, Yuchang, Xing, Liudong, Lin, Yi-Kuei, Guo, Wenzhong
Published in IEEE transactions on dependable and secure computing (01.01.2021)
Published in IEEE transactions on dependable and secure computing (01.01.2021)
Get full text
Journal Article
Loading…
Checkpointing Workflows for Fail-Stop Errors
Han, Li, Canon, Louis-Claude, Casanova, Henri, Robert, Yves, Vivien, Frederic
Published in IEEE transactions on computers (01.08.2018)
Published in IEEE transactions on computers (01.08.2018)
Get full text
Journal Article
Loading…
Scalable I/O aggregation for asynchronous multi-level checkpointing
Gossman, Mikaila J., Nicolae, Bogdan, Calhoun, Jon C.
Published in Future generation computer systems (01.11.2024)
Published in Future generation computer systems (01.11.2024)
Get full text
Journal Article
Loading…
Heterogeneous 1-out-of-N warm standby systems with online checkpointing
Levitin, Gregory, Xing, Liudong, Dai, Yuanshun
Published in Reliability engineering & system safety (01.01.2018)
Published in Reliability engineering & system safety (01.01.2018)
Get full text
Journal Article
Loading…
Loading…
Node failure resiliency for Uintah without checkpointing
Sahasrabudhe, Damodar, Berzins, Martin, Schmidt, John
Published in Concurrency and computation (25.10.2019)
Published in Concurrency and computation (25.10.2019)
Get full text
Journal Article
Loading…
Loading…
Optimal checkpointing of fault tolerant systems subject to correlated failure
Jafary, Bentolhoda, Fiondella, Lance
Published in 2017 Annual Reliability and Maintainability Symposium (RAMS) (2017)
Published in 2017 Annual Reliability and Maintainability Symposium (RAMS) (2017)
Get full text
Conference Proceeding
Loading…
Checkpointing Workflows for Fail-Stop Errors
Li Han, Canon, Louis-Claude, Casanova, Henri, Robert, Yves, Vivien, Frederic
Published in Proceedings / IEEE International Conference on Cluster Computing (01.09.2017)
Published in Proceedings / IEEE International Conference on Cluster Computing (01.09.2017)
Get full text
Conference Proceeding
Loading…
Combining Checkpointing and Replication for Reliable Execution of Linear Workflows with Fail-Stop and Silent Errors
Benoit, Anne, Cavelan, Aurélien, Ciorba, Florina M., Fèvre, Valentin Le, Robert, Yves
Published in International Journal of Networking and Computing (2019)
Published in International Journal of Networking and Computing (2019)
Get full text
Journal Article
Loading…
Loading…
Loading…
FPGA Checkpointing for Scientific Computing
Bacardit, Marc Perello, Bautista-Gomez, Leonardo, Unsal, Osman
Published in Proceedings / IEEE International On-Line Testing Symposium (28.06.2021)
Published in Proceedings / IEEE International On-Line Testing Symposium (28.06.2021)
Get full text
Conference Proceeding
Loading…
SweeD: Likelihood-Based Detection of Selective Sweeps in Thousands of Genomes
Pavlidis, Pavlos, Živković, Daniel, Stamatakis, Alexandros, Alachiotis, Nikolaos
Published in Molecular biology and evolution (01.09.2013)
Published in Molecular biology and evolution (01.09.2013)
Get full text
Journal Article
Loading…
Loading…
Loading…
2PACA: Two phases algorithm of checkpointing for Ad hoc mobile networks
Benkaouha, Haroun, Mokdad, Lynda, Abdelli, Abdelkrim
Published in 2013 9th International Wireless Communications and Mobile Computing Conference (IWCMC) (01.07.2013)
Published in 2013 9th International Wireless Communications and Mobile Computing Conference (IWCMC) (01.07.2013)
Get full text
Conference Proceeding
Loading…
Checkpointing strategies for parallel jobs
Bougeret, Marin, Casanova, Henri, Rabie, Mikael, Robert, Yves, Vivien, Frédéric
Published in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)
Published in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (12.11.2011)
Get full text
Conference Proceeding
Loading…
An optimal checkpointing-strategy for real-time control systems under transient faults
Kwak, Seong Woo, Choi, Byung Jae, Kim, Byung Kook
Published in IEEE transactions on reliability (01.09.2001)
Published in IEEE transactions on reliability (01.09.2001)
Get full text
Journal Article
Loading…
Evaluating Multi-Level Checkpointing for Distributed Deep Neural Network Training
Anthony, Quentin, Dai, Donglai
Published in 2021 SC Workshops Supplementary Proceedings (SCWS) (01.11.2021)
Published in 2021 SC Workshops Supplementary Proceedings (SCWS) (01.11.2021)
Get full text
Conference Proceeding