15 July 2016, 5:00PM update: some additional issues we are facing

Known Issues

Titlesort descending Cat. Res. Description Post Upd.
Cannot login to clusters Resolved

As of around 3PM today (Thursday 6/12), we have reports of users being unable to login in to the clusters.  The error message given will make it sound like your password is incorrect, although it... (Read more)

2 years 1 month ago 2 years 1 month
Certain modules not accessible Software Resolved

Certain modules are not working for all clusters since the downtime.  We have reports specifically that Amber, Gaussian, and Turbomole are not working.  We are working to resolve the issue, but... (Read more)

2 years 2 weeks ago 2 years 2 weeks
Downtime Update: All Major Services Online Resolved

Friday, Sept 25th 12PM Noon:

  • Oakley is back online and has resumed running jobs.  
  • Ruby... (Read more)
10 months 1 week ago 10 months 4 days
Emergency InfiniBand Shutdown (All systems) Network Resolved

We have returned to service. It appears that we have resolved the networking issues enough to allow jobs to run safely. We will continue working with our vendors to fix any remaining hardware... (Read more)

1 year 12 months ago 1 year 11 months
February 11 2014 Scheduled Downtime Outage Resolved

HPC systems are offline today for scheduled quarterly maintenance activity. For details, please visit osc.edu/n

2 years 5 months ago 2 years 5 months
Intermittent DNS issues Resolved

3/9/15 Update: The DNS issues have been resolved.  In total, the following services may have been affected by the DNS issues:

1 year 4 months ago 1 year 4 months
Issue when loading multiple Fluent or ANSYS modules simultaneously Software Resolved

Due to the way our Fluent and ANSYS modules are configured, simultaneously loading multiple of either module will cause a cryptic error.  The most common case of this happening is when multiple of... (Read more)

1 year 9 months ago 9 months 1 day
June 7th downtime to finish at 6:30PM Connectivity, filesystem, Infrastructure, login, Login Problems, Maintenance, Operations, Outage Resolved

Update: Downtime completed at 6:30PM, June 7th.

 

The June 7th downtime is now slated to be completed at 6:30PM.  Previous estimate was 5PM.

All systems and services will... (Read more)

1 month 2 weeks ago 1 month 2 weeks
Login Shell Issues on Oakley Account/Shell Resolved

UPDATE: The shells have all been switched back for affected users, and you can submit jobs normally again.  Additionally, if you are still logged in and have the incorrect shell, logging back out... (Read more)

2 years 1 month ago 2 years 1 month
Lustre bug causing Oakley login node crashes filesystem, login, Oakley Resolved

Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.  The bug (or issue otherwise) seems to be activated when a user does operations on a... (Read more)

11 months 5 days ago 9 months 3 weeks

Pages