Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Rolling reboot of all clusters, starting from Wednesday morning, April 19, 2017 Batch, Maintenance, Owens, Ruby Resolved

1:40PM 4/27/2017 Update: Rolling reboots are completed. 

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address GPFS errors occured... Read more

8 years 8 months ago 8 years 7 months ago
Poor performance with hybrid MPI+OpenMPI jobs and more than 4 MPI Tasks on multiple nodes Software Resolved
(workaround)

RELION versions prior to 5 may exhibit suboptimal performance in hybrid MPI+OpenMP jobs when the number of MPI tasks exceeds four across multiple nodes.

Workaround... Read more

4 months 5 days ago 4 months 5 days ago
Lustre is still offline. HPC systems back up Maintenance Resolved

Day One of the scheduled downtime has been completed, and HPC operations have resumed. As planned, Lustre work will extend into Day Two. Jobs using /fs/lustre or $PFSDIR cannot run until this work... Read more

11 years 5 months ago 11 years 5 months ago
Intermittent home directory performance issues filesystem Resolved

Users may experience performance issues in home directory. It is recommended to use temporary directory ($TMPDIR, or scratch) or project storage to minimize the impact on... Read more

2 years 5 months ago 2 years 4 months ago
Scheduling suspended Batch Resolved

We have temporarily suspended scheduling due to some problems with the parallel scratch file system.

11 years 2 months ago 11 years 2 months ago
Slurm to be Upgraded to Version 23.11.4 Owens, Pitzer Resolved

Updates on 04/08/2024:

The rolling reboots are completed. 

Updates:

We will perform rolling reboots on this... Read more

1 year 9 months ago 1 year 8 months ago
NFS outage on Thursday Jan 17 from 7am to 8am filesystem Resolved

Update:

This work has been canceled and will be done during downtime on Feb. 5. 

Original Post:

On Thursday, January 17th from 7 am to 8 am OSC will have a GPFS... Read more

6 years 11 months ago 6 years 11 months ago
Client Portal inaccessible client portal Resolved

All users are currently unable to login to OSC client portal.

When attempting to login to OSC... Read more

4 years 11 months ago 4 years 11 months ago
June 7th downtime to finish at 6:30PM Connectivity, filesystem, Infrastructure, login, Login Problems, Maintenance, Operations, Outage Resolved

Update: Downtime completed at 6:30PM, June 7th.

 

The June 7th downtime is now slated to be completed at 6:30PM.  Previous estimate was 5PM.

All systems and services will... Read more

9 years 6 months ago 9 years 6 months ago
Intermittent failure of default CPU binding Software Resolved
(workaround)

The default CPU binding for ORCA jobs can fail sporadically.  The failure is almost immediate and produces a cryptic error message, e.g.

$ORCA/orca h2o.in... Read more          
7 months 1 week ago 7 months 1 week ago

Pages