We are currently experiencing temporary instability on the Ascend login nodes.

A rolling reboot is in progress to address CVE-2026-23111 for all clusters, including Ascend, Cardinal, and Pitzer.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Large MPI job startup hang with mvapich2/2.3 and mvapich2/2.3.1 Owens, Pitzer, Software Resolved
(workaround)

We have found that large MPI jobs may hang at startup with mvapich2/2.3 and mvapich/2.3.1 (on any compiler dependency) due to a known bug that has been fixed in release 2...

Read more
6 years 7 months ago 1 year 1 month ago
"pbsdcp" is not working on Oakley Resolved

12:35PM 5/24/2017 Update: pbsdcp   has been fixed on Oakley.

pbsdcp   is not working on Oakley and returns a missing library error as below:...

Read more
9 years 3 weeks ago 9 years 3 weeks ago
Backup Issues Backups Resolved

Updates on 04/17:

OSC has conducted thorough validations to ensure the integrity of our backup data for user home directories and the /fs/ess filesystem. 

...

Read more
2 years 2 months ago 2 years 2 months ago
Temporary StarCCM License Outage Cardinal Resolved

StarCCM software will be temporarily unavailable starting February 21, 2026.

Why: Our current license expires on Feb 21 at 11:59pm EST.
...

Read more
3 months 3 weeks ago 3 months 3 weeks ago
my.osc.edu logins failing Account Management Resolved

Logins to my.osc.edu are failing. This is unrelated to our InfiniBand issue; a router change at OARnet is the believed cause. They are working on re-establishing the necessary routing.

11 years 10 months ago 11 years 10 months ago
Rolling reboot of Pitzer cluster, starting from Feb 03, 2021 Batch, login, Pitzer Resolved

Updates at 10AM Feb 11, 2021:

The rolling reboot is completed. 

Original Post:

We will have rolling reboots of Pitzer cluster including...

Read more
5 years 4 months ago 5 years 4 months ago
LAMMPS ppn=40 GPU problem on Pitzer Pitzer, Software Resolved

On Pitzer both LAMMPS 22Aug18 and 5Jun19 gpu jobs with ppn=40 hang immediately after lammps begins execution.  A workaround is to use ppn=39.

7 years 7 months ago 6 years 5 months ago
OSC will remove Jupyter MATLAB Kernel Cardinal, OnDemand Resolved

OSC will remove the default MATLAB Jupyter Kernel on Tuesday, May 20th, 2025. To create your own Jupyter MATLAB Kernel please follow the documentation on the MATLAB Page...

1 year 1 month ago 1 year 1 month ago
NCCL hang on Ascend dual-GPU nodes Ascend, GPU, Software Resolved
(workaround)

Users may encounter the following message and experience NCCL hangs if the first operation is a barrier when running multi-GPU training. We have identified...

Read more
1 year 2 weeks ago 1 year 2 weeks ago
GPFS hang Issue on 09/08/2016 filesystem Resolved

On Thursday, Sept 8 starting at 19:37, we had some bad interaction that appears to have been caused by the backup client, and the GPFS servers. This resulted in a GPFS hang that propagated I/O...

Read more
9 years 9 months ago 9 years 9 months ago

Pages