Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Rolling reboots of all three clusters, starting from Tuesday, September 4, 2018 Batch, Owens, Ruby Resolved

10:14 AM 09/17/2018... Read more

6 years 4 months ago 6 years 4 months ago
Handling full-node MPI warnings with MVAPICH 3.0 Ascend, Cardinal Resolved
(workaround)

When running a full-node MPI job with MVAPICH 3.0 , you may encounter the following warning message:

[][mvp_generate_implicit_cpu_mapping] WARNING: You appear to be running at full... Read more          
2 months 2 weeks ago 2 months 2 weeks ago
warning: libhwloc.so.1 may conflict with libhwloc.so.5 Resolved

Sometimes when building MPI programs the following warning appears.  It is harmless and can be safely ignored.

ld: warning: libhwloc.so.1, needed by /usr/local/mvapich2/1.7-intel/lib/... Read more          
9 years 8 months ago 9 years 3 months ago
/fs/ess and OnDemand not accessible filesystem, OnDemand Resolved

/fs/ess and OnDemand are not accessible now. We are working on this.

Sorry for the inconvenience. Please contact OSC Help if you have any questions. 

3 years 5 months ago 3 years 5 months ago
A bug in the trigger that sends automated emails from client portal client portal Resolved

We deployed a new version to OSC Client Portal (my.osc.edu) at 3 pm Tuesday, July 9th, which involves a bug in the trigger that sends automated emails to some OSC clients with the subject 'Your... Read more

5 years 6 months ago 5 years 6 months ago
Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017 login, Owens, Ruby Resolved

4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed. 

All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more

7 years 10 months ago 7 years 9 months ago
Nsight GPU profiler not working due to DCGM conflict GPU, Infrastructure Resolved

UPDATE (Mar 15, 2023)

After the downtime on Mar. 14, 2023, OSC enabled a new Slurm option --gres=nsight. DCGM will be disabled on the nodes for the job with the Slurm option,... Read more

1 year 10 months ago 1 year 10 months ago
Account changes temporarily suspended Account Management Resolved

We are still experiencing some account problems related to Thursday's issue. As a result, we have taken my.osc.edu offline and cannot process email changes or password resets, either via self-... Read more

10 years 7 months ago 10 years 7 months ago
Security Vulnerability for GPFS filesystem Resolved

Update: The fix was deployed during May 19 Downtime. 

Clients are not able to use mm* commands to manipulate GPFS ACLs on most OSC systems, due to a security vulnerability... Read more

4 years 8 months ago 4 years 7 months ago
Lustre Updates filesystem Resolved

9/10/14 - We have not seen any additional crashes of the Lustre servers since making this change.

8/26/14 
- Lustre jobs are being accepted as of 10AM this... Read more

10 years 4 months ago 10 years 4 months ago

Pages