Storage

Storage is a very important component of the High Performance Computing Cluster at Case Western Reserve University.  Without it, data cannot be stored or processed or distributed.  The storage also needs to support running thousands of jobs on the cluster concurrently.

There are 2 main storage systems directly mounted on the HPC:

HPC Storage

High Performance storage that is mounted on the HPC. The current capacity is 750TB and it has the general 7-day snapshot policy. HPC storage has several important filesystems that form the backbone of the cluster:

Research Storage

General-purpose storage that is mounted on the HPC and can be accessible from other campus locations as well. Research Storage capacity is currently 1.1PB and it is replicated to a duplicate site and has the general 7-day snapshot policy.

Additional Storage Options

Research Dedicated Storage

For research groups that require more than 100TB of storage, an inexpensive option is to acquire 2 storage servers (that replicate each other). Such storage servers would cost around $45k and can provide ~400TB of storage.

The RDS servers are mounted to login and data transfer servers, but not to the compute nodes.  Copying data for use in workflows is described at the following link: https://sites.google.com/a/case.edu/hpcc/data-transfer#h.nltp3jo2szbu

Research Archival Storage

If the data is no longer analyzed, it can be placed in a cold/archival storage, to be used rarely. For such cases, the Research Computing group provides Research Archival Storage service.

We use Ohio Supercomputing Project Storage/Tape system for the Archival service, and Globus Data Transfer tool to move the data into the Archive.  The Archive service would force for an encrypted transfer, making the transfer more secure. 

The data from both HPC Storage and Research Storage can be archived using Globus where users can manage their archival process mostly by themselves. 

Visit HPC Guide to Archival Storage for detail information.