We are currently experiencing outages affecting multiple services, including OnDemand (ondemand.osc.edu) and login nodes of HPC systems.

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Owens login nodes are impacted due to switch replacement on Thursday, Jan 17, 2019 login, Owens Resolved

We will perform the replacement work of Ethernet switches from 12pm to 3pm on Thursday, Jan 17, which includes all login nodes and 2 quick nodes on Owens. As a result, users won't be able to log... Read more

6 years 5 months ago 6 years 4 months ago
starccm/15.02.007 with intelmpi after Mar 22, 2022 Resolved
(workaround)

STAR-CCM+ 15.02.007 and 15.02.007-mixed with intelMPI would fail on multiple node jobs after the downtime on Mar 22, 2022. Please use openmpi instead. You can find more... Read more

3 years 2 months ago 1 month 13 hours ago
NFS service disruption 6/29/16 filesystem Resolved

OSC experienced errors with NFS services the morning of June 29 between 08:37 and 09:12 that may have caused some jobs to fail, or other unexpected behavior.  The... Read more

8 years 11 months ago 8 years 11 months ago
GPFS filesystem Problem on Oct 24 2019 filesystem Resolved

Updated on 4:45 PM Oct 24, 2019

The issue is fixed. GPFS filesystems and OnDemand are back. 

Original Post

We are having issues with GPFS filesystem... Read more

5 years 7 months ago 5 years 7 months ago
Brief disruption on 8/1/2013 at 8AM Network Resolved

At 8AM on the morning of 8/1/2013, we will be replacing some faulty hardware in our network infrastructure. Unfortunately, this work cannot be delayed until the next downtime, and the replacement... Read more

11 years 10 months ago 11 years 10 months ago
Storage Problems - GPFS services (Project / Scratch) filesystem Resolved

Updates at 15:52 March 11, 2020:

The issue with the Project file system that causes deletes of file system snapshots to fail has now been resolved. OSC Project file system... Read more

5 years 3 months ago 5 years 3 months ago
2/13/2014 0730 - Reboot of login nodes Outage Resolved

We need to reboot all of the login nodes on our production clusters to fix a minor issue from the downtime. We will be conducting this reboot at 7:30AM on Thursday, February 13th 2014. We expect... Read more

11 years 4 months ago 11 years 4 months ago
Unavailability of intel channel among the conda channel list: Software Resolved

If you are getting an error: 

UnavailableInvalidChannel: HTTP 403 FORBIDDEN for channel intel <... Read more

9 months 5 days ago 9 months 5 days ago
Issue with submitting job array Batch, Owens Resolved

3:30 PM 5/10/2018 Original Post:

User may have been getting the following error message when trying to submit a PBS job using job arrays:

qsub: submit error (Maximum number of... Read more          
7 years 1 month ago 3 years 6 months ago
Problems with GPFS filesystem, and OnDemand is not working filesystem Resolved

Updates on 12:00pm June 10:

The issue is fixed. All impacted services return to production.

We apologize for the inconvience. If you have any questions, please... Read more

4 years 3 days ago 4 years 3 days ago

Pages