A rolling reboot is in progress to address CVE-2026-23111 for all clusters, including Ascend, Cardinal, and Pitzer.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Posted Updated
STAR-CCM+ MPI job failure Cardinal, Software Resolved
(workaround)

STAR-CCM+ encounters errors when running MPI jobs using Intel MPI or Open MPI, displaying the following message:

ib_iface.c:1139 UCX ERROR Invalid active_speed on...
Read more
1 year 8 months ago 23 hours 51 min ago
Temporary Login Node Instability on Ascend Ascend Unresolved

We are currently experiencing temporary instability on the Ascend login nodes, which may result in slow response times or unexpected session disconnects. Our team is actively...

Read more
3 weeks 1 day ago 3 days 11 hours ago
Rolling Reboot for Security Fix Unresolved

A rolling reboot is in progress to address CVE-2026-23111 (nf_tables logic bug) for all clusters, including Ascend, Cardinal, and Pitzer. Login nodes will be rebooted first and access...

Read more
1 week 2 days ago 3 days 14 hours ago
cuMemHostRegister Fails with CUDA_ERROR_INVALID_VALUE on RHEL 9.6 Ascend, Cardinal, GPU, system software Unresolved

After upgrading the operating system to RHEL 9.6 during the scheduled downtime on May 12, 2026,  applications utilizing UCX (...

Read more
3 weeks 2 days ago 3 weeks 2 days ago
STAR-CCM+ OpenMPI Job Failed due to Out-of-Memory Cardinal, Software Unresolved
(workaround)

After the scheduled downtime on May 12, 2026, STAR-CCM+ encounters out-of-memory errors when running OpenMPI jobs. A message...

Read more
3 weeks 2 days ago 3 weeks 2 days ago
Nsight GPU profiler not working due to DCGM conflict GPU, Infrastructure Resolved

UPDATE (Mar 15, 2023)

After the downtime on Mar. 14, 2023, OSC enabled a new Slurm option --gres=nsight. DCGM will be disabled on the nodes for the...

Read more
3 years 3 months ago 1 month 13 hours ago
MATLAB (legacy) r2024a fails to launch on OnDemand Desktop Software Resolved
(workaround)

As of the downtime on 05/12/26 MATLAB (legacy) has stopped working when users try to open .m files in the GUI application for version r2024a. At the time we suggested to please use the...

Read more
1 month 1 week ago 1 month 2 days ago
ptrace Disabled Across OSC Systems Unresolved

ptrace has been disabled globally across all OSC systems to mitigate a newly identified Linux kernel vulnerability. If this security mitigation impacts your active research...

Read more
1 month 5 days ago 1 month 5 days ago
MVAPICH 3.0 hang due to PMI mismatch with Slurm Software Resolved
(workaround)

Applications such as Quantum ESPRESSO, LAMMPS, and NWChem experienced hangs with MVAPICH 3.0 due to a PMI mismatch. MVAPICH 3.0 was built with PMI-1, while newer Slurm versions on RHEL 9...

Read more
8 months 2 weeks ago 1 month 1 week ago
Reboot of Public-Facing Systems for Security Fix Resolved

Updates on May 12:

All  public-facing systems have been rebooted. 

Original Post:

Due to a critical...

Read more
1 month 3 weeks ago 1 month 1 week ago

Pages