[1] Heithem Abbes and Thouraya Louati. PastryGridCP: A Decentralized Rollback-Recovery Protocol for Desktop Grid Systems.
In Proceedings of the 13th International Conference on Algorithms and Architectures for Parallel Processing – Volume (LNCS 8285) Part I,
ICA3PP’13, pages 143–152, Dec 2013. (Springer International Publishing Switzerland).
Abstract: Desktop Grids are composed of several thousands of resources. They are characterized by high volatility of resources, due to
voluntary disconnections or failures. This could affect the proper termination of applications execution. PastryGrid is a decentralized
system which manages desktop grid resources and user applications over a fully decentralized P2P network. In this paper we present
PastryGridCP: our rollback-recovery protocol, which is based on checkpoints designed for the decentralized Desktop Grid system
PastryGrid. It provides fault tolerance for grid applications and ensures the termination of the execution of applications in a transparent
way to users. We have conducted out experimentations on 110 nodes of Grid’5000. Obtained results validate our protocol and improve
the performance of applications
Keywords:
Desktop Grid, fault tolerance, rollback-recovery, checkpoints, decentralization, Grid’5000
Lien de la conférence
À voir sur:
Bibtex:
@incollection{abbes2013pastrygridcp,
title={PastryGridCP: A Decentralized Rollback-Recovery Protocol for Desktop Grid Systems}, author={Abbes, Heithem and Louati, Thouraya}, booktitle={Algorithms and Architectures for Parallel Processing},
pages={143--152},
year={2013},
publisher={Springer}
}