|Oakley login node down||login||Resolved||
One of the Oakley login nodes is down. We are currently working on bringing it back online. SSH connections to oakley.osc.edu may time out. A workaround is to connect directly to oakley01.osc.edu... (Read more)
|1 year 7 months ago||1 year 7 months|
|Network card re-seat||Network||Resolved||
At 8AM on Tuesday, July 9th 2013, we will be re-seating a network card in a switch at our operations center. It is possible that a brief (~10 minute) outage may occur. Jobs will pause for the... (Read more)
|2 years 3 months ago||2 years 3 months|
|my.osc.edu logins failing||Account Management||Resolved||
Logins to my.osc.edu are failing. This is unrelated to our InfiniBand issue; a router change at OARnet is the believed cause. They are working on re-establishing the necessary routing.
|1 year 2 months ago||1 year 2 months|
|MVAPICH broken on Ruby||Ruby||Resolved||
Update Monday February 16th -- Ruby MVAPICH2 build fixed.
Ruby's MVAPICH2 build has been fixed. Please email firstname.lastname@example.org with any issues.... (Read more)
|7 months 4 weeks ago||7 months 3 weeks|
|module spider/avail/show not showing MPI dependent modules||Ruby||Resolved||
On Ruby, the commands:
|5 months 2 weeks ago||1 day 17 hours|
|Matlab PCT broken due to pbsrsh modification||Matlab||Resolved||
A change was made to the system wide
|5 months 2 weeks ago||3 days 19 hours|
|Maintenance for OnDemand and other web based services||Resolved||
Update (12/13/14 10am): Maintenance has finished as planned.
OnDemand, AweSim applications, and other web based services will be down starting Wednesday, January 31 at 8:30AM for... (Read more)
|9 months 1 week ago||9 months 1 week|
|Lustre, Infiniband Operational and Being Monitored Closely||filesystem||Resolved||
UPDATE: Most users should no longer see any issues with Lustre.
Again, please continue to notify OSC Help of any errors you see in job output. For example, you might see "... (Read more)
|1 year 2 months ago||1 year 1 month|
9/10/14 - We have not seen any additional crashes of the Lustre servers since making this change.
|1 year 1 month ago||1 year 4 weeks|
|Lustre jobs suspended||filesystem||Resolved||
The Lustre filesystem ($PFSDIR and /fs/lustre) has crashed several times Friday evening (8/15). We have degraded this service temporarily, while we work to isolate the actions that are triggering... (Read more)
|1 year 1 month ago||1 year 1 month|