We are currently experiencing temporary instability on the Ascend login nodes.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Rolling reboot of all clusters, starting from 8 AM Tuesday, June 19, 2018 Batch, Owens, Ruby Resolved

Posted on June 12, 2018, at 4:40 PM:

We will have rolling reboots of three clusters (Owens, Ruby, and Oakley) including login and compute nodes, starting from 8 AM Tuesday... Read more

8 years 10 hours ago 7 years 11 months ago
Jobs reports 'excessive memory usage' message Owens, Pitzer Resolved

... Read more

6 years 3 months ago 4 years 1 month ago
Intermittent DNS issues Resolved

3/9/15 Update: The DNS issues have been resolved.  In total, the following services may have been affected by the DNS issues:

  • Logging in / Connecting out to... Read more
11 years 3 months ago 11 years 3 months ago
HCOLL-related failures in OpenMPI applications Cardinal, Software Resolved
(workaround)

Several applications using OpenMPI, including HDF5, Boost, Rmpi, ORCA, and CP2K, may fail with errors such as

mca_coll_hcoll_module_enable() coll_hcol: mca_coll_hcoll_save_coll_handlers... Read more          
1 year 7 months ago 1 year 1 month ago
Issues with GPFS filesystem since Saturday May 18 filesystem Resolved

Update Posted on 20 May 2019 13:24

We fixed the problem with both project and scratch filesystem. Please contact oschelp@osc.edu if... Read more

7 years 3 weeks ago 7 years 3 weeks ago
Apptainer container builds fail on compute nodes due to /tmp namespace behavior Software Resolved
(workaround)

Building Apptainer containers on compute nodes may fail during apt update or other package operations... Read more

5 months 5 days ago 4 months 15 hours ago
Correction on OSC project restriction email Account Management Resolved

Updated:

OSC has resolved this morning's issue and reverted impacted projects back to an ACTIVE status. Queued jobs under those projects will be able to start once today's... Read more

4 years 11 months ago 4 years 11 months ago
Dec 27, 2016: Issues with /fs/project filesystem Resolved

Dec 27, 2016 3:46PM Update: Both project and scratch file systems (/fs/project and /fs/scratch ) are back to normal now.  Some users' jobs may be... Read more

9 years 5 months ago 9 years 5 months ago
Resolved: Home directory space Issue with MATLAB 2024a Software Resolved

Users may experience their home directory running out of space after executing multiple MATLAB 2024a jobs. This issue is caused by the accumulation of multiple copies of the MathWorks Service... Read more

1 year 1 week ago 1 year 1 week ago
OnDemand experiencing difficulties Web Services Resolved

OnDemand is experiencing some difficulties that may be related to the changes from the downtime. We are aware of these problems and are working on resolving them. We appreciate your patience.

12 years 4 months ago 12 years 4 months ago

Pages