Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Slurm on Pitzer is offline Resolved

The Slurm scheduler for Pitzer is currently offline. We are working with the vendor for the fix. Sorry for the inconvenience.

1 year 9 months ago 1 year 9 months ago
Network card re-seat Network Resolved

At 8AM on Tuesday, July 9th 2013, we will be re-seating a network card in a switch at our operations center. It is possible that a brief (~10 minute) outage may occur. Jobs will pause for the... Read more

12 years 5 months ago 12 years 5 months ago
Rolling reboot of Oakley and Ruby clusters, starting from 8:30AM October 9, 2017 Batch, login, Ruby Resolved

Updates on 1:00PM October 16, 2017: 

The rolling reboots of Oakley and Ruby are completed. 

... Read more
8 years 2 months ago 8 years 2 months ago
Ondemand error when terminating interactive app OnDemand Resolved
(workaround)

When trying to delete an interactive session through OnDemand, you may receive an error page about 'No such file'. This can be disregarded. Simply navigate back to the interactive sessions page... Read more

5 years 2 weeks ago 3 years 8 months ago
OpenMPI-HPCX 4.1.x hangs on writing files on a shared file system Software Resolved
(workaround)

Your job utilizing openmpi/4.1.x-hpcx (or 4.1.x on Ascend) might hang while writing files on a shared file system. This issue is caused by a ... Read more

7 months 2 weeks ago 7 months 2 weeks ago
Oakley login node instability Operations Resolved

Oakley login nodes are seeing some instability related to Lustre. We will reboot the nodes on Thursday, October 2nd 2014 to resolve the issue. If a login node crashes before then and we have the... Read more

11 years 2 months ago 11 years 1 month ago
MVAPICH broken on Ruby Ruby Resolved

Update Monday February 16th -- Ruby MVAPICH2 build fixed.

Ruby's MVAPICH2 build has been fixed.  Please email oschelp@osc.edu with any issues.

... Read more
10 years 10 months ago 10 years 10 months ago
Application Errors client portal Resolved

When beginning a major or discovery-level application for resources at OSC, you are asked for a required justification on the Additional Documents page. However, there is no mechanism for you to... Read more

6 years 7 months ago 6 years 4 months ago
GPFS problems with /fs/project and possibly /fs/scratch filesystem Resolved

There was an issue with GPFS clients that affected /fs/project and possibly /fs/scratch between around 3:30AM and 8:30AM on Sunday September 4th. Some jobs from clients were also impacted. 

... Read more
3 years 3 months ago 3 years 3 months ago
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

9 years 1 month ago 9 years 1 month ago

Pages