We have been experiencing an issue with the Ethernet switches in the Owens cluster.
There have been problems with PBS torque commands including
qstat from the Ruby login nodes since this morning.
Rolling reboots of all three clusters, starting from Tuesday, September 4, 2018
We've been having intermittent issues with connecting to the batch server hosts on Owens.
Rolling reboots of all clusters, starting from 8 AM Tuesday, June 19, 2018
User may have been getting the following error message when trying to submit a PBS job using job arrays
We will have rolling reboots of Oakley, Ruby and Owens clusters starting from Monday Feb 5, 2018.
We are experiencing a problem with the queuing system on oakley and owens that is delaying or preventing new jobs from running. Our systems staff is investigating.
qstat: cannot connect to server oak-batch-test.osc.edu on Oakley between around 3~3:30pm Nov 21, 2017.
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017