Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Slurm on Pitzer is offline Resolved

The Slurm scheduler for Pitzer is currently offline. We are working with the vendor for the fix. Sorry for the inconvenience.

1 year 4 months ago 1 year 4 months ago
Incorrect RU Balances client portal Resolved

RESOLVED 2/20/2019

We deployed a new version of the Client Portal during our downtime on Tuesday, 2/5, and a bug has been introduced.

The Client Portal (my.osc.edu) and OSCUsage... Read more

6 years 4 months ago 6 years 3 months ago
Unavailability of intel channel among the conda channel list: Software Resolved

If you are getting an error: 

UnavailableInvalidChannel: HTTP 403 FORBIDDEN for channel intel <... Read more

9 months 3 weeks ago 9 months 3 weeks ago
libibumad.so.2 missing on Oakley Software Resolved

Update:  We think this is fixed.  Please submit a ticket if you encounter further problems.

 

As a result of updates made during yesterday's downtime, software built with mvapich2/... Read more

8 years 8 months ago 8 years 8 months ago
Problems with GPFS filesystem, and OnDemand is not working filesystem Resolved

Updates on 12:00pm June 10:

The issue is fixed. All impacted services return to production.

We apologize for the inconvience. If you have any questions, please... Read more

4 years 3 weeks ago 4 years 3 weeks ago
System Downtime 9/29/13 Outage Resolved

OSC systems will be offline on September 29th, 2013 for maintenance. Please visit osc.edu/n for more information.

11 years 9 months ago 11 years 9 months ago
Instability on Clusters after May 13 Downtime Resolved

We've been experiencing some instability on the clusters (particularly Cardinal and Ascend) following the recent May 13 downtime, especially with parallel job processing. If you notice any unusual... Read more

1 month 2 weeks ago 1 month 11 hours ago
Rolling reboots of all clusters starting from Monday Feb 5, 2018 Batch, Owens, Ruby Resolved

Posted on Feb 22 at 1:25PM:

The rolling reboots have been completed. 

Posted on Jan 30, 2018 at 4:00PM:

We will have rolling reboots of... Read more

7 years 5 months ago 7 years 4 months ago
GPFS problems with /fs/project and possibly /fs/scratch filesystem Resolved

There was an issue with GPFS clients that affected /fs/project and possibly /fs/scratch between around 3:30AM and 8:30AM on Sunday September 4th. Some jobs from clients were also impacted. 

... Read more
2 years 10 months ago 2 years 10 months ago
Maintenance for OnDemand and other web based services Resolved

Update (12/13/14 10am): Maintenance has finished as planned.

 

OnDemand, AweSim applications, and other web based services will be down starting Wednesday, January 31 at 8:30AM for... Read more

10 years 6 months ago 10 years 6 months ago

Pages