We've been having intermittent issues with connecting to the batch server hosts on Owens.
Rolling reboots of all clusters, starting from 8 AM Tuesday, June 19, 2018
User may have been getting the following error message when trying to submit a PBS job using job arrays
We will have rolling reboots of Oakley, Ruby and Owens clusters starting from Monday Feb 5, 2018.
We are experiencing a problem with the queuing system on oakley and owens that is delaying or preventing new jobs from running. Our systems staff is investigating.
qstat: cannot connect to server oak-batch-test.osc.edu on Oakley between around 3~3:30pm Nov 21, 2017.
Rolling reboot of Owens cluster, starting from 8:30AM Oct 30, 2017
We will have rolling reboots of Oakley and Ruby clusters starting from 8:30AM on Monday October 9, 2017.
We will have a rolling reboot of Owens starting from 9AM on Monday, September 11 2017.
All PBS commands on Owens are working now