Open OnDemand (OOD or OoD) is a service providing access to HPC resources through a web browser. Manage your files, check the cluster job status, submit and monitor your own jobs, obtain a shell session, or choose from a set of "interactive" applications -- each provided through a browser tab.
Access the OnDemand server using your Case Single-SignOn (SSO) credentials. It doesn't require campus network or VPN connection.
Use ondemand-pioneer.case.edu to access Pioneer cluster.
Alternatively, use ondemand.case.edu for access to Markov cluster
OnDemand will need to write files to disk, in the directory(ies), to record information about the conditions requested for the app session in the location /home/<caseid>/ondemand/data/sys/dashboard/batch_connect/.
Write capability may be denied if the group storage usage reaches the group hard quota. To still launch an interactive app will require either resolving the disk usage situation, or the following. Logout of OnDemand, use AMARA to change the primary linux group. Initiate a new Ondemand session (private window or delete any cookies) and proceed to request the interactive session resources.
The Shell Access (via 'cluster' pulldown) doesn't support Visual Access. Use Desktop under Interactive Apps for visual access. The details on visual access is available at HPC Guide to Graphical Access.
The field under Desktop should be appropriately filled else you will get an error. For an example, If the "Node Type (partition)" field is set to GPU, "Number of GPU" and "CPU or GPU Node Type" should be filled appropriately.
We have observed that commands with progress output (like a progress bar or a percent progress) can sometimes run slower in the OnDemand terminal than when using a regular terminal. (see User Quick Start Guide for details)
The OOD Dashboard is the landing page:
Most major browsers support all features of OnDemand; however, the Safari browser does not implement all features, due to the authentication method used in the CWRU implementation of OOD. The Markov Shell using google chrome may produce truncated output (e.g. may not display compiler error properly) or command (e.g. systemctl) won't work as desired.
A video overview of features is available.
The dropdown menus provide:
Files: Home Directory, with view, edit, upload and other capabilities.
Jobs: Choose Active Jobs to review status, or Job Composer to prepare job submissions
Clusters: A bash shell on the selected cluster
Interactive Apps: These sessions run on compute nodes, scheduled through Slurm manager. Applications include Jupyter/Tensorflow, RStudio, a desktop session and/or CodeServer. Check the cluster for up-to-date application availability.
The apps organized under Files, Jobs and Clusters access headnode resources and personal account data. These apps aid in managing, and conducting your work.
To see the current set of interactive apps, use the 'Interactive Apps' pulldown menu. These apps require computational resources for their graphics environment, and to run the underlying app, therefore they run on compute node resources obtained through the Slurm scheduler. A form is presented after clicking on the item in the pulldown menu, through which compute resources are specified. Some applications, particularly on the Pioneer cluster, are modified to allow customization of the environment. Examples are presented below.
RStudio Server: within a batch job on a compute node.
Slurm Account
Node Type (slurm partition)
Duration -- up to 60 hours
Cores -- up to 16
GPU -- applies only for gpu node type
Memory -- from 4gb to 64gb
The is an alternative approach when starting the rstudio session from the ood 'connect' icon fails.
Open a new tab.
Create a url with the format https://ondemand-pioneer.case.edu/rnode/<compute-node>/<port>
Obtain the <port> from the file connection.yml, accessible through the 'session id' link of the rstudio session information
The <compute-node> is visible in the 'launch' window.
Once the url is complete in the appropriate format, navigate to the url
Follow the above image to select 'Global options' from the 'Tools' menu.
Desktop Sessions: general purpose HPC graphical interface
Account
Desktop application
xfce4
mate
Duration -- up to 60 hours
Cores -- up to 16
Memory -- from 4gb to 64gb
When logged in in Xfce desktop, Find the "Applications" menu on the upper left corner of the desktop. Navigate to: Applications->Settings->Screensaver. Ensure that both "Enable Screensaver" and "Enable Lock Screen" are set to OFF. That will stop it from locking the screen in the future.
When the "Launch Desktop" button is pressed in Interactive Desktop, the error appears as "sbatch: error: Batch job submission failed: Requested node configuration is not available"
Solution: Fill up the resource fields appropriately.
Host Not Specified:
Failed to access the portal with message "Host "xxx.case.edu" not specified in allowlist or cluster configs or proxy error.
Solution: click on dropdown "restart webserver" under Help (?). Logout from the portal and try opening the portal again.
Proxy Error:
If you encounter the errors like "The proxy server received an invalid response from an upstream server. The proxy server could not handle the request" when accessing Open OnDemand, then you need to kill the unresponsive processes going through the log file (e.g. request to the file caused the process unresponsive).
When the "Launch Desktop" button is pressed in Interactive Desktop, NoVNC screen appears with "Failed to Connect to Server"
Solution: Your ~/.bashrc or ~/.bash_profile file may have been modified either during software installation (example: typing yes for conda init option during Anaconda/Miniconda installation or setting environment variables for the installed software).
Gaussian old config file can also create issues. So, you can try deleting the older cofig file:
rm -f ~/.config/gaussian.com/GaussView.conf
If you don't know how to undo the updates, please contact hpc-supportATcase.edu.
Explore the interface, and let us know whether it supports your needs.
More detailed documentation will be developed, particularly for using the Job Composer capability.
email: hpc-supportATcase.edu
You get the disk quota violation.
Solution: Check the group quota violation (HPC Guide to Storage and Quota) and inform your group members to reduce the quota. If you have multiple groups, switch to another group via Amara Portal (https://amara.case.edu/profile) and wait for about a couple of hours.
Connection Issue Related to Duo Security and two factor authentication:
Solution: Contact CWRU HelpDesk.
Problem: If you encounter firefox issue, start a new session. If you experience the same problems, we have some steps to resolve:
Solutions:
For the firefox error, run this script that cleans up lock files left behind after a firefox crash:
/mnt/vstor/UGEN_RCCI/STM/cleanFirefox.sh
For the endpoint not connected error run the following command to remount it:
fusermount -u .gvfs
OnDemand Desktop session starts but completes without presenting Launch Button.
Solution: Try to reconnect it a few times. If the issue persist, Identify the issue checking the log file:
Go to and find the <sessionID for your job checking the time stamp:
ll -t ~/ondemand/data/sys/dashboard/batch_connect/sys/<apps e.g. Jupyter-Tensorflow>/output/
output:
rwxr-xr-x 2 <caseID> hpcadmin 4096 Sep 21 18:19 <sesion-ID e.g. ae477524-3bd1-4d6f-99b3-dd4a26ee1051>
drwxr-xr-x 2 <caseID> hpcadmin 4096 Sep 20 16:45 <session-ID>
Check the log file associated with that session ID:
cat ondemand/data/sys/dashboard/batch_connect/sys/Jupyter-Tensorflow/output/ae477524-3bd1-4d6f-99b3-dd4a26ee1051/output.log
output:
ImportError: cannot import name 'soft_unicode' from 'markupsafe' (/home/<caseID>/.local/lib/python3.7/site-packages/markupsafe/__init__.py)
In this scenario, markupsafe pkg was upgraded and was incompatible with Jupyter Notebook App. Uninstalling it resolved the issue.
If you don't request enough memory, your job may timeout or run out of memory. Using OnDemand Desktop, you may get the blank page. Check your job status using sacct command:
sacct -u aps223 -S 080124
JobID JobName Partition Account AllocCPUS State ExitCode
------------ ---------- ---------- ---------- ---------- ---------- --------
2042066 sys/dashb+ batch axa1826_1 4 TIMEOUT 0:0
2042066.bat+ batch axa1826_1 4 OUT_OF_ME+ 0:125
2042066.ext+ extern axa1826_1 4 COMPLETED 0:0
Problem: I am getting KDE Error related to baloo_file_extractor in a dialog.
Solution: In a terminal on the OnDemand desktop, run the below command[1]:
balooctl disable
Startup problems for RStudio, such as https errors 502/502, or just a lengthy delay in the app loading, may be due to settings that load large data files at start up. Changing the settings may be done through the graphical interface after it loads. Starting in Safe Mode may be necessary. If prompted to start Safe Mode, select this option and update the settings. Look for the 'tools' pull down menu to access the 'Global options', where you may deselect the various options for restoring data or settings from previous sessions by default.
There is an alternative approach when starting the rstudio session from the ood 'connect' icon fails.
Open a new tab.
Create a url with the format https://ondemand-pioneer.case.edu/rnode/<compute-node>/<port>
Obtain the <port> from the file connection.yml, accessible through the 'session id' link of the rstudio session information
The <compute-node> is visible in the 'launch' window.
Once the url is complete in the appropriate format, navigate to the url
Follow the above image to select 'Global options' from the 'Tools' menu.
The newer R installs are using a different default setting when calling png()
The way to restore the default behavior is to add the following somewhere in your workflow:
options(bitmapType="cairo")
This could be:
1) Called in the RStudio interpreter prior to running your main script
2) Added to the top of the R script itself
3) Added to /home/<CaseID>/.Rprofile prior to launching the RStudio job in ondemand
We advise installing packages directly in R (https://sites.google.com/a/case.edu/hpcc/hpc-cluster/pioneer-software/programming-languages/r-project). The RStudio app presents a container environment which lacks the full environment necessary to install software in many cases.
A session may begin where at launch, the connection is made to the compute node 'remote host', and yet the RStudio session is not displayed in the window. The url will show the details, and for a few seconds the 'spinning wheel' may be observed, but after that a blank window persists for many mnutes, and if patient, even multiple hours.
In this case, it is worth checking in the directory for any settings relating to previous 'active' sessions. The information would be in the following location by defauilt:
/home/<caseid>/.local/share/rstudio/sessions/active/
Look for subdirectories that would contain session information, and specifically, a file 'session-persistent-state',
/home/<caseid>/.local/share/rstudio/sessions/active/session-efe57ea5/session-persistent-state
that file may contain information similar to:
abend="1"
active-client-id="99ee0b2a-6998-48ea-a099-0d7eab5b0327"
If so, delete the file and/or directory contents, and relaunch the connection to the 'blank' session.
If you don't request enough memory, your job may timeout or run out of memory. Using OnDemand Desktop, you may get the blank page. Check your job status using sacct command:
sacct -u aps223 -S 080124
JobID JobName Partition Account AllocCPUS State ExitCode
------------ ---------- ---------- ---------- ---------- ---------- --------
2042066 sys/dashb+ batch axa1826_1 4 TIMEOUT 0:0
2042066.bat+ batch axa1826_1 4 OUT_OF_ME+ 0:125
2042066.ext+ extern axa1826_1 4 COMPLETED 0:0