We are currently experiencing outages affecting multiple services, including OnDemand (ondemand.osc.edu) and login nodes of HPC systems.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

8 years 7 months ago 8 years 7 months ago
Fail to connect with VS Code 1.86 Resolved

VS Code 1.86 (aka the ‘January 2024’ update) requires ≥glibc 2.28 which are not supported on Pitzer and Owens clusters. Please downgrade to VS Code 1.85. 

See this link for more... Read more

1 year 4 months ago 1 year 1 month ago
Brief disruption to external network, 2013/12/29 Connectivity Resolved

Between 5:00AM and 9:00AM EDT on Sunday,... Read more

11 years 5 months ago 11 years 5 months ago
GPFS errors on compute nodes filesystem Resolved

We've seen an increase in transient problems that result in compute nodes losing access to the GPFS file systems for ~5 minutes.

Any jobs running on these nodes accessing files on GPFS may... Read more

4 years 6 months ago 3 years 6 months ago
Rolling reboot of Owens cluster, starting from Monday, April 16, 2018 Owens Resolved

12:00 PM 5/7/2018 Update:

The rolling reboot of Owens has been completed. 

Posted on April 11, 2018, at 3:45... Read more

7 years 2 months ago 7 years 1 month ago
PyTorch hangs on dual-gpu node on Ascend Ascend, GPU Resolved
(workaround)

PyTorch can hang on Ascend on dual-GPU nodes

Through internal testing, we have confirmed that the hang issue only occurs on Ascend dual-GPU (nextgen) nodes. We’re still unsure why... Read more

1 month 2 weeks ago 1 month 1 week ago
STAR error bgzf_open: Assertion failed Cardinal, Software Resolved
(workaround)

You may encounter errors that look similar to these when running STAR 2.7.10b:

STAR: bgzf.c:158: bgzf_open: Assertion `compressBound(0xff00) < 0x10000' failed.

Cause... Read more

4 weeks 13 hours ago 2 weeks 2 days ago
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.


As of early afternoon, Sept. 8,... Read more

9 years 9 months ago 9 years 9 months ago
Some backups of ESS project directories are missed Resolved

Backups of ESS project directories (/fs/ess) are missed on August 10 and 13, 2022. We apologize for the inconvenience.

Please contact OSC Help if you... Read more

2 years 9 months ago 2 years 9 months ago
Segmentation fault from openmpi/1.10-hpcx and 2.0-hpcx on Owens Owens, Software Resolved

We have found that recent MPI jobs using openmpi/1.10-hpcx and openmpi/2.0-hpcx on Owens may complete or hang until the job is killed, but receive segmentation fault. Some applications might be ... Read more

5 years 10 months ago 5 years 10 months ago

Pages