Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort ascending Updated
Apptainer sandbox fails on GPFS due to stale NFS file handles Software Resolved

Users may encounter errors when attempting to run a sandbox on GPFS-mounted directories such as /fs/scratch or /fs/ess. The error... Read more

9 months 2 weeks ago 5 months 3 weeks ago
CP package hang with MVAPICH3 in Quantum-Espresso Software Resolved

While using MVAPICH3 builds of Quantum ESPRESSO (QE), users may encounter hangs when running the CP package, which can lead to job timeouts. We recommend switching to the OpenMPI build of... Read more

10 months 3 weeks ago 5 months 2 weeks ago
OSC Service Outage Notification Outage Resolved

We are currently experiencing outages affecting multiple services, including OnDemand (ondemand.osc.edu) and login nodes of HPC systems. Our team is actively investigating and working to resolve... Read more

11 months 5 days ago 11 months 4 days ago
Some MKL environment variables have incorrect paths Ascend, Cardinal, Pitzer, Software Resolved

MKL module files define some helper environment variables with incorrect paths.  This can yield link time errors.  All three clusters are affected.  We are working to correct the module... Read more

11 months 6 days ago 9 months 4 days ago
OneDrive Connector File Transfer Issue with Globus Resolved

Update:

We deployed the fix from Globus: https://docs.globus.org/globus-connect-server/... Read more

11 months 1 week ago 10 months 4 weeks ago
Resolved: Home directory space Issue with MATLAB 2024a Software Resolved

Users may experience their home directory running out of space after executing multiple MATLAB 2024a jobs. This issue is caused by the accumulation of multiple copies of the MathWorks Service... Read more

11 months 2 weeks ago 11 months 2 weeks ago
NCCL hang on Ascend dual-GPU nodes Ascend, GPU, Software Resolved
(workaround)

Users may encounter the following message and experience NCCL hangs if the first operation is a barrier when running multi-GPU training. We have identified... Read more

11 months 3 weeks ago 11 months 2 weeks ago
Python version mismatch in Jupyter + Spark instance Software Resolved
(workaround)

You may encounter the following error message when running a Spark instance using a custom kernel in the Jupyter + Spark app:

25/04/25 10:49:01 WARN TaskSetManager:... Read more          
12 months 3 days ago 5 months 4 days ago
Instability on Clusters after May 13 Downtime Resolved

We've been experiencing some instability on the clusters (particularly Cardinal and Ascend) following the recent May 13 downtime, especially with parallel job processing. If you notice any unusual... Read more

12 months 3 days ago 11 months 2 weeks ago
STAR error bgzf_open: Assertion failed Cardinal, Software Resolved
(workaround)

You may encounter errors that look similar to these when running STAR 2.7.10b:

STAR: bgzf.c:158: bgzf_open: Assertion `compressBound(0xff00) < 0x10000' failed.

Cause... Read more

12 months 3 days ago 11 months 3 weeks ago

Pages