Known Issues

Title Cat. Res. Description Postsort ascending Upd.
Lustre, Infiniband Operational and Being Monitored Closely filesystem Resolved

UPDATE: Most users should no longer see any issues with Lustre.


Again, please continue to notify OSC Help of any errors you see in job output. For example, you might see "... (Read more)

9 months 1 week ago 8 months 3 weeks
Emergency InfiniBand Shutdown (All systems) Network Resolved

We have returned to service. It appears that we have resolved the networking issues enough to allow jobs to run safely. We will continue working with our vendors to fix any remaining hardware... (Read more)

9 months 1 week ago 9 months 1 week
Certain modules not accessible Software Resolved

Certain modules are not working for all clusters since the downtime.  We have reports specifically that Amber, Gaussian, and Turbomole are not working.  We are working to resolve the issue, but... (Read more)

10 months 1 day ago 10 months 1 day
Lustre is still offline. HPC systems back up Maintenance Resolved

Day One of the scheduled downtime has been completed, and HPC operations have resumed. As planned, Lustre work will extend into Day Two. Jobs using /fs/lustre or $PFSDIR cannot run until this work... (Read more)

10 months 2 days ago 9 months 4 weeks
Login Shell Issues on Oakley Account/Shell Resolved

UPDATE: The shells have all been switched back for affected users, and you can submit jobs normally again.  Additionally, if you are still logged in and have the incorrect shell, logging back out... (Read more)

10 months 3 weeks ago 10 months 2 weeks
Oakley Login Node Issues Login Problems Resolved

Currently users connecting via SSH to Oakley may recieve "connection refused" or "connection failed" errors if they were not logged in before this occurred.  Glenn is currently functioning... (Read more)

10 months 3 weeks ago 10 months 3 weeks
Account changes temporarily suspended Account Management Resolved

We are still experiencing some account problems related to Thursday's issue. As a result, we have taken my.osc.edu offline and cannot process email changes or password resets, either via self-... (Read more)

10 months 3 weeks ago 10 months 3 weeks
Cannot login to clusters Resolved

As of around 3PM today (Thursday 6/12), we have reports of users being unable to login in to the clusters.  The error message given will make it sound like your password is incorrect, although it... (Read more)

10 months 4 weeks ago 10 months 4 weeks
ARMSTRONG is offline Outage Resolved

ARMSTRONG is experiencing an unexpected outage. We are working on a resolution.

1 year 3 weeks ago 1 year 3 weeks
Oakley login node down login Resolved

One of the Oakley login nodes is down. We are currently working on bringing it back online. SSH connections to oakley.osc.edu may time out. A workaround is to connect directly to oakley01.osc.edu... (Read more)

1 year 1 month ago 1 year 1 month

Pages