8AM 9/11/13 - Brief network disruption to reboot a switch

At 8AM on September 11, 2013, we will be rebooting a network switch to replace a failed card in the switch. Network will be disrupted for 10 to 15 minutes while the work is done. Filesystem mounts may experience difficulties, and running jobs may hang for the duration of the reboot, but resume without failure. If you experience any unexpected problems, please contact OSC Help.

Brief disruption on 8/1/2013 at 8AM

At 8AM on the morning of 8/1/2013, we will be replacing some faulty hardware in our network infrastructure. Unfortunately, this work cannot be delayed until the next downtime, and the replacement will cause a short disruption of network services for our compute nodes. Jobs may temporarily hang, if they are attempting to communicate with network provided storage or communicate between nodes. It is possible that a few jobs may actually fail to complete properly, but only under a very specific set of circumstances.

Network card re-seat

At 8AM on Tuesday, July 9th 2013, we will be re-seating a network card in a switch at our operations center. It is possible that a brief (~10 minute) outage may occur. Jobs will pause for the duration of any outage, and resume once the network becomes available again. If a job's walltime expires during an outage, the job may be terminated. Connections to OSC systems may be terminated, and attempts to log in may generate a "no route to host" error. Please contact OSC Help if you have any concerns.

Subscribe to RSS - Network