Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort ascending Description Posted Updated
Segmentation fault from openmpi/1.10-hpcx and 2.0-hpcx on Owens Owens, Software Resolved

We have found that recent MPI jobs using openmpi/1.10-hpcx and openmpi/2.0-hpcx on Owens may complete or hang until the job is killed, but receive segmentation fault. Some applications might be ... Read more

3 years 10 months ago 3 years 9 months ago
OnDemand Apps are not working OnDemand, Outage Resolved

Apps through OnDemand (ondemand.osc.edu) can't be launched successfully. You will see the error like:


Failed to submit session with the following error:
sbatch: error: 'pitzer... Read more

2 years 1 day ago 2 years 1 day ago
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more

9 years 10 months ago 9 years 10 months ago
Rolling reboots on owens and pitzer starting 18 Aug 2021 Batch, Connectivity, Maintenance Resolved

We will have rolling reboots of Owens and Pitzer cluster, including login and compute nodes, starting from 9am on August 18, 2021. The rolling reboot is for urgent security updates.

The... Read more

1 year 9 months ago 1 year 8 months ago
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017 Batch, Owens Resolved

Updated on Nov 21, 2017 at 3:33PM:

It has been completed. 

Updated on October 20, 2017 at 4:19PM:

We will have a rolling reboot of Owens... Read more

5 years 7 months ago 5 years 6 months ago
Scheduling suspended Batch Resolved

We have temporarily suspended scheduling due to some problems with the parallel scratch file system.

8 years 8 months ago 8 years 8 months ago
Possible job failures due to MPI library change on Pitzer after May 20 Software Resolved

There are changes on MPI libraries on Pitzer after May 20. We will upgrade MOFED from 4.9 to 5.6 and recompile all OpenMPI and Mvapich2 against the newer MOFED version. Users with their own MPI... Read more

3 weeks 5 days ago 3 weeks 5 days ago
NFS outage on Thursday Jan 17 from 7am to 8am filesystem Resolved

Update:

This work has been canceled and will be done during downtime on Feb. 5. 

Original Post:

On Thursday, January 17th from 7 am to 8 am OSC will have a GPFS... Read more

4 years 4 months ago 4 years 4 months ago
Problems with the home directories filesystem Resolved

 We are currently seeing problems with the home directories at OSC's HPC facility.... Read more

3 years 2 days ago 3 years 2 days ago
June 7th downtime to finish at 6:30PM Connectivity, filesystem, Infrastructure, login, Login Problems, Maintenance, Operations, Outage Resolved

Update: Downtime completed at 6:30PM, June 7th.

 

The June 7th downtime is now slated to be completed at 6:30PM.  Previous estimate was 5PM.

All systems and services will... Read more

6 years 11 months ago 6 years 11 months ago

Pages