Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
Segmentation fault from openmpi/1.10-hpcx and 2.0-hpcx on Owens Owens, Software Resolved

We have found that recent MPI jobs using openmpi/1.10-hpcx and openmpi/2.0-hpcx on Owens may complete or hang until the job is killed, but receive segmentation fault. Some applications might be ... Read more

6 years 6 months ago 6 years 5 months ago
MyOSC unavailable on Tuesday, August 2, 2022, from 10 am to 12 pm client portal Resolved

We will be adding a charge account structure. MyOSC will be unavailable on Tuesday, August 2, 2022, from 10 am to 12 pm while the update occurs. See... Read more

3 years 5 months ago 3 years 5 months ago
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more

12 years 6 months ago 12 years 6 months ago
MPI_THREAD_MULTIPLE is not supported with OpenMPI-HPCX 4.x Owens, Software Resolved

A threading code with MPI where MPI_Init_thread uses MPI_THREAD_MULTIPLE will fail because UCX from HPCX package is built without enabling multi-threading. UCX is the... Read more

2 years 10 months ago 8 months 3 weeks ago
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017 Batch, Owens Resolved

Updated on Nov 21, 2017 at 3:33PM:

It has been completed. 

Updated on October 20, 2017 at 4:19PM:

We will have a rolling reboot of Owens... Read more

8 years 3 months ago 8 years 2 months ago
Maintenance outage on the cluster export services Maintenance, OnDemand, Ruby Resolved

Update on 14 April 2020, 0903:

Work is completed.

Original message:

There will be maintenance on cluster export services on Tuesday, April... Read more

5 years 9 months ago 5 years 9 months ago
Scheduling suspended Batch Resolved

We have temporarily suspended scheduling due to some problems with the parallel scratch file system.

11 years 4 months ago 11 years 4 months ago
Rolling reboots on all HPC systems starting Oct 31 2024 Owens, Pitzer Resolved

Updates on Nov 13 2024:

Pitzer is completed. 

Updates... Read more

1 year 2 months ago 1 year 2 months ago
NFS outage on Thursday Jan 17 from 7am to 8am filesystem Resolved

Update:

This work has been canceled and will be done during downtime on Feb. 5. 

Original Post:

On Thursday, January 17th from 7 am to 8 am OSC will have a GPFS... Read more

7 years 5 days ago 7 years 5 days ago
Backup failures on ess filesystem Backups, filesystem Resolved

The backups on the /fs/ess filesystem are having issues running. There has not been a successful backup of this filesystem since Sunday, 08 August 2021.

OSC is working with the vendor to... Read more

4 years 5 months ago 4 years 5 months ago

Pages