Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Rolling reboots of all three clusters, starting from Tuesday, September 4, 2018 Batch, Owens, Ruby Resolved

10:14 AM 09/17/2018... Read more

6 years 10 months ago 6 years 9 months ago
MVAPICH2 and/or STAR-CCM+ MPI job failure and workaround Cardinal, Software Resolved
(workaround)

... Read more

8 months 2 weeks ago 4 days 9 hours ago
warning: libhwloc.so.1 may conflict with libhwloc.so.5 Resolved

Sometimes when building MPI programs the following warning appears.  It is harmless and can be safely ignored.

ld: warning: libhwloc.so.1, needed by /usr/local/mvapich2/1.7-intel/lib/... Read more          
10 years 2 months ago 9 years 8 months ago
CP2K 6.1 would fail on Pitzer Cascade Lakes (48-core) node: Pitzer Resolved
(workaround)

CP2K 6.1 would fail with the following error when running on Pitzer Cascade Lakes (48-core) node:

Program received signal SIGFPE: Floating-point exception - erroneous arithmetic... Read more          
4 years 1 week ago 1 month 2 weeks ago
A bug in the trigger that sends automated emails from client portal client portal Resolved

We deployed a new version to OSC Client Portal (my.osc.edu) at 3 pm Tuesday, July 9th, which involves a bug in the trigger that sends automated emails to some OSC clients with the subject 'Your... Read more

5 years 11 months ago 5 years 11 months ago
NCCL hang on Ascend dual-GPU nodes Ascend, GPU, Software Resolved
(workaround)

Users may encounter the following message and experience NCCL hangs if the first operation is a barrier when running multi-GPU training. We have identified... Read more

1 month 5 days ago 4 weeks 1 day ago
Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017 login, Owens, Ruby Resolved

4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed. 

All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more

8 years 3 months ago 8 years 3 months ago
OnDemand has NOT been working with external providers since 08/22 OnDemand Resolved

Updates on 9:40AM August 23, 2017: this issue has been resolved. 

>>>

Issue:

User can NOT login to OnDemand (ondemand.osc.edu)... Read more

7 years 10 months ago 7 years 10 months ago
Outbound Emails from MyOSC are Blocked at MS 365 Servers Account Management, client portal Resolved

Outbound emails, including account verification emails, from MyOSC to institutional email addresses utilizing MS 365 are being blocked as phishing attempts at the institution's server.

... Read more

1 year 10 months ago 1 year 9 months ago
Lustre Updates filesystem Resolved

9/10/14 - We have not seen any additional crashes of the Lustre servers since making this change.

8/26/14 
- Lustre jobs are being accepted as of 10AM this... Read more

10 years 10 months ago 10 years 9 months ago

Pages