Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Titlesort ascending Category Resolution Description Posted Updated
Scratch and Project are hung; schedulings have been paused Batch, filesystem Resolved

1:00PM 4/6/2017 Update:  The Scratch and Project file systems are back to normal service. Scheduling on systems are resumed. We are still investigating the causes to this problem... Read more

8 years 2 months ago 8 years 2 months ago
Schrodinger license check in issue Licensing Resolved

Schrödinger is an application that uses a FlexNet license server. To run a job, the application needs to check out licenses from the server and check it back in once the job is completed. However... Read more

11 months 3 weeks ago 6 months 2 weeks ago
Scheduling temporarily suspended on Oakley Batch Resolved

We are migrating the batch scheduler on Oakley to a new virtual machine. In order to accomplish this, the scheduler will be temporarily offline for about four hours on December 16th. Running jobs... Read more

9 years 6 months ago 9 years 6 months ago
Scheduling suspended Batch Resolved

We have temporarily suspended scheduling due to some problems with the parallel scratch file system.

10 years 9 months ago 10 years 9 months ago
Running jobs requeued on all clusters Owens, Pitzer Resolved

The Slurm upgrades during rolling reboots of Ascend, Owens and Pitzer we performed today (Oct 25 2023) cause all running jobs on the systems requeued around 8:45am. You will not be billed for the... Read more

1 year 7 months ago 1 year 7 months ago
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... Read more

10 years 4 months ago 10 years 4 months ago
Ruby is offline Operations Resolved

The Ruby Transitional Cluster (only open to select research groups) is currently offline due to network problems. We expect it will return to service some time after the downtime.

11 years 8 months ago 11 years 4 months ago
Rolling reboots on owens and pitzer starting 18 Aug 2021 Batch, Connectivity, Maintenance Resolved

We will have rolling reboots of Owens and Pitzer cluster, including login and compute nodes, starting from 9am on August 18, 2021. The rolling reboot is for urgent security updates.

The... Read more

3 years 10 months ago 3 years 9 months ago
Rolling reboots on all HPC systems starting Oct 31 2024 Owens, Pitzer Resolved

Updates on Nov 13 2024:

Pitzer is completed. 

Updates... Read more

7 months 3 weeks ago 7 months 1 week ago
Rolling reboots of Owens cluster, starting from Feb 18, 2021 Owens Resolved

Updated on March 2:

This is completed.

Original Post:

We will have rolling reboots of Owens cluster including login and compute nodes,... Read more

4 years 4 months ago 4 years 3 months ago

Pages