GVR (Global View Resilience) is a user-level library that enables portable, efficient, application-controlled resilience. The primary target of GVR is HPC applications that require both extreme scalability and performance as well as resilience. GVR's key approaches include independent versioning of application arrays, efficient partial or whole restoration, open resilience to maximize the number of errors that can be handled (minimize fail-stop occurrences). Application knowledge can be exploited to control overhead, maximize error coverage, and maximize recoverable errors.
The latest GVR 1.0.0 was released under BSD licensing on Oct. 4th, 2014, and inculdes the following features,
The document is available as Global View Resilience (GVR) Documentation, Release 1.0, University of Chicago, Computer Science Technical Report 2014-10.
More information of GVR project is available at http://gvr.cs.uchicago.edu/
GVR has been developed by University of Chicago and Argonne National Laboratory, under the lead of Prof. Andrew A. Chien and Dr. Pavan Balaji. It has been supported by the U.S. Department of Energy, Office of Science / ASCR under awards DE-SC0008603/57K68-00-145.
If you are a new user, you must provide your name, organization and organization-level e-mail address to download the GVR library.