Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
MPI fails with UCX 1.18 Software Resolved
(workaround)

After the downtime on August 19, 2025, users may encounter UCX errors such as:

UCX ERROR no active messages transport to <no debug data>: self/memory -... Read more          
7 months 3 weeks ago 6 months 1 week ago
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more

12 years 8 months ago 12 years 8 months ago
MOE license server down Licensing Resolved

The MOE license server is experiencing an unknown issue and potentially down.  We are working to resolve the issue.

2 years 6 months ago 2 years 6 months ago
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017 Batch, Owens Resolved

Updated on Nov 21, 2017 at 3:33PM:

It has been completed. 

Updated on October 20, 2017 at 4:19PM:

We will have a rolling reboot of Owens... Read more

8 years 5 months ago 8 years 4 months ago
OpenMPI job stopped at 'There are not enough slots available in the system to satisfy the slots' Owens, Pitzer, Software Resolved

Users would encounter a MPI job failed with openmpi/3.1.0-hpcx on Owens and Pitzer. The job would stop with the error  like "There are not enough slots available in the system to... Read more

5 years 8 months ago 5 years 7 months ago
Scheduling suspended Batch Resolved

We have temporarily suspended scheduling due to some problems with the parallel scratch file system.

11 years 6 months ago 11 years 6 months ago
starccm outage Feb 21, 2021 Licensing, Outage, Owens, Software Resolved

Updated on Feb 25: 

StarCCM license outage is restored.

Original post:

OSC's starccm software license will expire at 12 a.m., Sunday, Feb... Read more

5 years 1 month ago 5 years 1 month ago
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... Read more

11 years 1 month ago 11 years 1 month ago
Multiple definition error Software Resolved
(workaround)

GNU compiler versions 10+ may have C compiler errors like

/.libs/libmca_mtl_psm.a(mtl_psm_component.o): multiple definition of `mca_mtl_psm_component'

This is a ... Read more

11 months 4 days ago 11 months 4 days ago
Password Expiration Emails client portal Resolved

Password expiration notices are still being sent after you change your password.

To ensure your password change date has been updated and the account will not expire, please look at... Read more

6 years 11 months ago 6 years 11 months ago

Pages