We are currently experiencing temporary instability on the Ascend login nodes.

A rolling reboot is in progress to address CVE-2026-23111 for all clusters, including Ascend, Cardinal, and Pitzer.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Posted Updatedsort descending
autodock-gpu 1.5.2 does not work on Cardinal Software Resolved

autodock-gpu/1.5.2 does not work on Cardinal. We suggest using autodock-gpu/1.6 instead

1 month 1 week ago 1 month 1 week ago
Reboot of Public-Facing Systems for Security Fix Resolved

Updates on May 12:

All  public-facing systems have been rebooted. 

Original Post:

Due to a critical...

Read more
1 month 2 weeks ago 1 month 5 days ago
MVAPICH 3.0 hang due to PMI mismatch with Slurm Software Resolved
(workaround)

Applications such as Quantum ESPRESSO, LAMMPS, and NWChem experienced hangs with MVAPICH 3.0 due to a PMI mismatch. MVAPICH 3.0 was built with PMI-1, while newer Slurm versions on RHEL 9...

Read more
8 months 1 week ago 1 month 5 days ago
ptrace Disabled Across OSC Systems Unresolved

ptrace has been disabled globally across all OSC systems to mitigate a newly identified Linux kernel vulnerability. If this security mitigation impacts your active research...

Read more
1 month 1 day ago 1 month 1 day ago
MATLAB (legacy) r2024a fails to launch on OnDemand Desktop Software Resolved
(workaround)

As of the downtime on 05/12/26 MATLAB (legacy) has stopped working when users try to open .m files in the GUI application for version r2024a. At the time we suggested to please use the...

Read more
1 month 3 days ago 4 weeks 22 hours ago
Nsight GPU profiler not working due to DCGM conflict GPU, Infrastructure Resolved

UPDATE (Mar 15, 2023)

After the downtime on Mar. 14, 2023, OSC enabled a new Slurm option --gres=nsight. DCGM will be disabled on the nodes for the...

Read more
3 years 3 months ago 3 weeks 5 days ago
STAR-CCM+ OpenMPI Job Failed due to Out-of-Memory Cardinal, Software Unresolved
(workaround)

After the scheduled downtime on May 12, 2026, STAR-CCM+ encounters out-of-memory errors when running OpenMPI jobs. A message...

Read more
2 weeks 5 days ago 2 weeks 5 days ago
cuMemHostRegister Fails with CUDA_ERROR_INVALID_VALUE on RHEL 9.6 Ascend, Cardinal, GPU, system software Unresolved

After upgrading the operating system to RHEL 9.6 during the scheduled downtime on May 12, 2026,  applications utilizing UCX (...

Read more
2 weeks 5 days ago 2 weeks 5 days ago
Temporary Login Node Instability on Ascend Ascend Unresolved

We are currently experiencing temporary instability on the Ascend login nodes, which may result in slow response times or unexpected session disconnects. Our team is actively...

Read more
2 weeks 4 days ago 2 weeks 4 days ago
Rolling Reboot for Security Fix Unresolved

A rolling reboot is in progress to address CVE-2026-23111 (nf_tables logic bug) for all clusters, including Ascend, Cardinal, and Pitzer. Login nodes will be rebooted first and access...

Read more
5 days 19 hours ago 4 days 21 hours ago

Pages