The Ohio Supercomputer Center (OSC) is experiencing an email delivery problem with several types of messages from MyOSC. 

 OSC is preparing to update Slurm on its production systems to version 23.11.4 on March, 27. 

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort descending Updated
OnDemand has NOT been working with external providers since 08/22 OnDemand Resolved

Updates on 9:40AM August 23, 2017: this issue has been resolved. 

>>>

Issue:

User can NOT login to OnDemand (ondemand.osc.edu)... Read more

6 years 7 months ago 6 years 7 months ago
Rolling reboot of Owens cluster, starting from 9AM September 11, 2017 Batch, Owens Resolved

Updates on 12:20PM September 25, 2017: 

The rolling reboot of Owens is completed. 

... Read more
6 years 6 months ago 6 years 6 months ago
Rolling reboot of Oakley and Ruby clusters, starting from 8:30AM October 9, 2017 Batch, login, Ruby Resolved

Updates on 1:00PM October 16, 2017: 

The rolling reboots of Oakley and Ruby are completed. 

... Read more
6 years 5 months ago 6 years 5 months ago
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017 Batch, Owens Resolved

Updated on Nov 21, 2017 at 3:33PM:

It has been completed. 

Updated on October 20, 2017 at 4:19PM:

We will have a rolling reboot of Owens... Read more

6 years 5 months ago 6 years 4 months ago
qstat error on Oakley Nov 21, 2017 Batch Resolved

We had mis-configuration of Oakley system such that users who logged in on Oakley between around 3~3:30pm Nov 21, 2017 may receive the following error message when trying to submit jobs:

... Read more
6 years 4 months ago 6 years 4 months ago
DOWNTIME EXTENDED UNTIL MORNING OF 12/13/17 Resolved

We have extended the 12/12/2017 downtime until 7AM on 12/13/17 to complete filesystem maintenance that has taken longer than expected.

6 years 3 months ago 6 years 3 months ago
Rolling reboot of login nodes of clusters at 7:00AM Dec 19, 2017 login Resolved

We will have rolling reboot of login nodes of clusters at 7:00AM Dec 19, 2017 for GPFS version upgrade. It is supposed to be completed in a short period of time. f you encounter any login issues,... Read more

6 years 3 months ago 6 years 3 months ago
Owens batch is down Owens Resolved

Updated at 9:07PM on Dec 20, 2017 :

Owens batch was restored by updating Torque resource manager at 6:37pm Dec 19, 2017. 

Original Post at 4:45PM on Dec 19... Read more

6 years 3 months ago 6 years 3 months ago
Oakley and Owens queue issue Batch Resolved

We are experiencing a problem with the queuing system on oakley and owens that is delaying or preventing new jobs from running. Our systems staff is investigating.

 

6 years 3 months ago 6 years 3 months ago
Rolling reboots of all clusters starting from Monday Feb 5, 2018 Batch, Owens, Ruby Resolved

Posted on Feb 22 at 1:25PM:

The rolling reboots have been completed. 

Posted on Jan 30, 2018 at 4:00PM:

We will have rolling reboots of... Read more

6 years 1 month ago 6 years 1 month ago

Pages