Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
Brief interruption of batch services on 4/17 Batch Resolved

On April 17th 2013, at roughly 2PM, we will be rebooting the batch server on the Oakley cluster. Running jobs will not be affected, but there will be a brief disruption in scheduling, as well as... Read more

11 years 5 months ago 11 years 5 months ago
Slurm database repair on 01/25/2024 Outage Resolved

We have scheduled a Slurm database repair, which is planned to start at 8:30 am US/Eastern on Thursday, January 25, 2024. During the repair, Slurm database will be offline; running jobs and... Read more

8 months 3 weeks ago 8 months 2 weeks ago
Rolling reboot of Owens cluster, starting from 9AM June 28, 2017 Owens Resolved

Update posted on July 7, 2017 at 2:00PM:

Rolling reboot of login and compute nodes of Owens cluster is completed. 

... Read more
7 years 3 months ago 7 years 3 months ago
MVAPICH2 build of CP2K 6.1 Pitzer Resolved

We have found some types of CP2K jobs would fail or have poor performance using cp2k.popt and cp2k.psmp from MVAPICH2 build (gnu/4.8.5 mvapich2/2.3). This version will be removed on December 15th... Read more

3 years 10 months ago 3 years 7 months ago
Lustre jobs suspended filesystem Resolved

The Lustre filesystem ($PFSDIR and /fs/lustre) has crashed several times Friday evening (8/15). We have degraded this service temporarily, while we work to isolate the actions that are triggering... Read more

10 years 1 month ago 10 years 1 month ago
XDMOD outage: 9AM-4PM June 2 2021 Outage Resolved

There is a scheduled outage for XDMOD tool (xdmod.osc.edu) between 9AM-4PM June 2 2021 for upgrading to 9.5.0. During the outage XDMOD will be in maintain mode and not accessible by OSC users. ... Read more

3 years 4 months ago 3 years 4 months ago
Maintenance for OnDemand and other web based services Resolved

Update (12/13/14 10am): Maintenance has finished as planned.

 

OnDemand, AweSim applications, and other web based services will be down starting Wednesday, January 31 at 8:30AM for... Read more

9 years 9 months ago 9 years 9 months ago
Incorrect RU Balances client portal Resolved

RESOLVED 2/20/2019

We deployed a new version of the Client Portal during our downtime on Tuesday, 2/5, and a bug has been introduced.

The Client Portal (my.osc.edu) and OSCUsage... Read more

5 years 8 months ago 5 years 7 months ago
issues accessing /fs/ess/ locations filesystem, Operations, Outage Resolved

15 Aug 2022 resolved

... Read more

2 years 2 months ago 2 years 1 month ago
libibumad.so.2 missing on Oakley Software Resolved

Update:  We think this is fixed.  Please submit a ticket if you encounter further problems.

 

As a result of updates made during yesterday's downtime, software built with mvapich2/... Read more

7 years 12 months ago 7 years 12 months ago

Pages