We are currently experiencing temporary instability on the Ascend login nodes.

A rolling reboot is in progress to address CVE-2026-23111 for all clusters, including Ascend, Cardinal, and Pitzer.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Issue with GPFS on Owens since April 14, 2017 Batch, filesystem, Owens Resolved

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address this GPFS issue. 

We have had issues with GPFS mounts on Owens Cluster since Friday afternoon,...

Read more
9 years 2 months ago 9 years 1 month ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node....

Read more
2 years 11 months ago 2 years 5 months ago
Certain modules not accessible Software Resolved

Certain modules are not working for all clusters since the downtime.  We have reports specifically that Amber, Gaussian, and Turbomole are not working.  We are working to resolve the issue, but...

Read more
11 years 11 months ago 11 years 11 months ago
Spurious warnings about balance being exhausted client portal Resolved

Due to the price changes and some specifics about MyOSC, you may get warnings...

Read more
5 years 11 months ago 5 years 10 months ago
Ls-Dyna license outage Licensing Resolved

4:30 PM 10/18/2018 Original Post:

Ls-Dyna license is not available now.

We are actively investigating the issue with LSTC. Please contact ...

Read more
7 years 8 months ago 7 years 8 months ago
Core label on OnDemand app is incorrect OnDemand Resolved

The core label on the OnDemand app incorrectly displays as '1', regardless of the requested number of cores for a job. While this label is incorrect, the job is still allocated the correct number...

Read more
1 year 5 months ago 1 year 4 months ago
GPU/VIS nodes for various OOD apps are broken Cardinal, OnDemand, Outage, Software Resolved

After the most recent downtime we discovered that various OOD apps relying on the "virtualgl" module on Cardinal were broken. We have since updated and pinned the latest virtualgl version...

Read more
4 months 2 weeks ago 4 months 1 week ago
Owens login nodes are impacted due to switch replacement on Thursday, Jan 17, 2019 login, Owens Resolved

We will perform the replacement work of Ethernet switches from 12pm to 3pm on Thursday, Jan 17, which includes all login nodes and 2 quick nodes on Owens. As a result, users won't be able to log...

Read more
7 years 5 months ago 7 years 5 months ago
OSC will remove Jupyter MATLAB Kernel Cardinal, OnDemand Resolved

OSC will remove the default MATLAB Jupyter Kernel on Tuesday, May 20th, 2025. To create your own Jupyter MATLAB Kernel please follow the documentation on the MATLAB Page...

1 year 1 month ago 1 year 1 month ago
NFS service disruption 6/29/16 filesystem Resolved

OSC experienced errors with NFS services the morning of June 29 between 08:37 and 09:12 that may have caused some jobs to fail, or other unexpected behavior.  The...

Read more
9 years 11 months ago 9 years 11 months ago

Pages