Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Posted Updatedsort ascending
Owens batch is down Owens Resolved

Updated at 9:07PM on Dec 20, 2017 :

Owens batch was restored by updating Torque resource manager at 6:37pm Dec 19, 2017. 

Original Post at 4:45PM on Dec 19... Read more

5 years 1 month ago 5 years 1 month ago
Rolling reboot of login nodes of clusters at 7:00AM Dec 19, 2017 login Resolved

We will have rolling reboot of login nodes of clusters at 7:00AM Dec 19, 2017 for GPFS version upgrade. It is supposed to be completed in a short period of time. f you encounter any login issues,... Read more

5 years 1 month ago 5 years 1 month ago
DOWNTIME EXTENDED UNTIL MORNING OF 12/13/17 Resolved

We have extended the 12/12/2017 downtime until 7AM on 12/13/17 to complete filesystem maintenance that has taken longer than expected.

5 years 1 month ago 5 years 1 month ago
qstat error on Oakley Nov 21, 2017 Batch Resolved

We had mis-configuration of Oakley system such that users who logged in on Oakley between around 3~3:30pm Nov 21, 2017 may receive the following error message when trying to submit jobs:

... Read more
5 years 2 months ago 5 years 2 months ago
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017 Batch, Owens Resolved

Updated on Nov 21, 2017 at 3:33PM:

It has been completed. 

Updated on October 20, 2017 at 4:19PM:

We will have a rolling reboot of Owens... Read more

5 years 3 months ago 5 years 2 months ago
Rolling reboot of Oakley and Ruby clusters, starting from 8:30AM October 9, 2017 Batch, login, Ruby Resolved

Updates on 1:00PM October 16, 2017: 

The rolling reboots of Oakley and Ruby are completed. 

... Read more
5 years 4 months ago 5 years 3 months ago
Rolling reboot of Owens cluster, starting from 9AM September 11, 2017 Batch, Owens Resolved

Updates on 12:20PM September 25, 2017: 

The rolling reboot of Owens is completed. 

... Read more
5 years 5 months ago 5 years 4 months ago
Some issues remain after downtime Login Problems, Operations, Outage Resolved

15 July 2016, 5:00PM update: some additional issues we are facing

  • We are experiencing periodic hangs of the GPFS client file system software used with the new storage environment. We... Read more
6 years 6 months ago 5 years 4 months ago
OnDemand has NOT been working with external providers since 08/22 OnDemand Resolved

Updates on 9:40AM August 23, 2017: this issue has been resolved. 

>>>

Issue:

User can NOT login to OnDemand (ondemand.osc.edu)... Read more

5 years 5 months ago 5 years 5 months ago
PBS commands on Owens are not working Batch, Owens Resolved

Update posted on July 12, 2017 at 1:50PM:

We have fixed the problem with the batch management system on Owens and queues on Owens have been opened again for jobs.

... Read more

5 years 7 months ago 5 years 6 months ago

Pages