TORQUE is the software used for scheduling compute jobs on our clusters.
The following is an example TORQUE script that runs an MPI program on 5 nodes. This script can be scheduled with the qsub command. See the man page for the qsub command for information on what options go into TORQUE scripts. Programs must use MPI in order to run in parallel on multiple nodes on the cluster.
The MCNPX software can be used to perform Monte Carlo physics simulations on our clusters. Because MCNPX is export controlled, in order to access this software you must provide proof of access and identification to the CoSE IT staff. Contact us at email@example.com for more information.
Make sure to move/remove any existing data files (the files that end with o, r, and m) from old jobs because these will break MCNPX. Then execute the following command to queue your jobs:
Run the qstat command to check your running jobs. To kill jobs that have hung or were started by mistake, run the qdel command followed by the ID of the job you want to kill.
MCNPX has a habit of getting stuck in a loop when a job breaks or wasn't started properly. If your job isn't updating its output files and appears to be frozen, please kill it so that it doesn't waste cluster resources.