Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Lustre jobs suspended filesystem Resolved

The Lustre filesystem ($PFSDIR and /fs/lustre) has crashed several times Friday evening (8/15). We have degraded this service temporarily, while we work to isolate the actions that are triggering... Read more

11 years 5 months ago 11 years 4 months ago
Login Issues with my.osc.edu Outage Resolved

Update: this is fixed. 

Original post:

We are aware that some users may be experiencing issues logging into ... Read more

1 year 3 months ago 1 year 3 months ago
A reboot of the NetApp as part of an upgrade, starting from Monday, November 19, 2018 Maintenance Resolved

Updated on 12:49 PM Nov... Read more

7 years 2 months ago 7 years 2 months ago
Reached your pull rate limit while pull from Docker hub Software Resolved
(workaround)

You might encounter an error when pulling from Docker hub:

ERROR: toomanyrequests: Too Many Requests.

or

You have reached your... Read more          
4 years 7 months ago 1 month 3 weeks ago
NAMD 2.11 precompiled binaries do not work Software Resolved

NAMD 2.11 precompiled binaries do not work.  Please use NAMD 2.11 installed from the source and available via module namd/2.11.

The NAMD 2.11 issue involves changes to the command charmrun... Read more

9 years 11 months ago 6 years 11 months ago
Python version mismatch in Jupyter + Spark instance Software Resolved
(workaround)

You may encounter the following error message when running a Spark instance using a custom kernel in the Jupyter + Spark app:

25/04/25 10:49:01 WARN TaskSetManager:... Read more          
8 months 6 days ago 1 month 1 week ago
System Downtime 9/29/13 Outage Resolved

OSC systems will be offline on September 29th, 2013 for maintenance. Please visit osc.edu/n for more information.

12 years 3 months ago 12 years 3 months ago
PyTorch jobs timeout and hanging GPU Resolved

We have observed that many PyTorch users frequently encounter random timeouts, which result in the termination of their jobs but leave the process running on the node.... Read more

2 years 6 months ago 2 years 1 week ago
Testing Issue for Quantum Espresso 7.4.1 on Ascend Ascend, Software Resolved

Benchmark AUSURF112 for quantum-espresso/7.4.1 on Ascend aborts.  We suspect that this is a lurking bug in Quantum Espresso and are reporting it as a convenience.  Concerned users can use... Read more

3 months 3 weeks ago 1 month 3 weeks ago
Rolling reboots of all clusters starting from Monday Feb 5, 2018 Batch, Owens, Ruby Resolved

Posted on Feb 22 at 1:25PM:

The rolling reboots have been completed. 

Posted on Jan 30, 2018 at 4:00PM:

We will have rolling reboots of... Read more

7 years 11 months ago 7 years 11 months ago

Pages