Known Issues

Title Cat. Res.sort ascending Description Post Upd. logins failing Account Management Resolved

Logins to are failing. This is unrelated to our InfiniBand issue; a router change at OARnet is the believed cause. They are working on re-establishing the necessary routing.

1 year 2 months ago 1 year 2 months
Lustre, Infiniband Operational and Being Monitored Closely filesystem Resolved

UPDATE: Most users should no longer see any issues with Lustre.

Again, please continue to notify OSC Help of any errors you see in job output. For example, you might see "... (Read more)

1 year 2 months ago 1 year 1 month
Can not change GPU compute mode on Oakley GPU Resolved

Update: The driver version has been updated and the issue has been fixed.


In updating the driver version for Oakley's NVIDIA GPUs the NVML libraries that are used in conjunction... (Read more)

10 months 2 weeks ago 8 months 3 weeks
Oakley login node problems Oakley Resolved

One of the Oakley login nodes (oakley01) has experienced some hardware failures and is temporarily out of service while repairs are ongoing.

Please limit your interactive use of the... (Read more)

9 months 3 weeks ago 9 months 3 weeks
Maintenance for OnDemand and other web based services Resolved

Update (12/13/14 10am): Maintenance has finished as planned.


OnDemand, AweSim applications, and other web based services will be down starting Wednesday, January 31 at 8:30AM for... (Read more)

9 months 1 week ago 9 months 1 week
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.

As of early afternoon, Sept. 8,... (Read more)

4 weeks 5 hours ago 4 weeks 10 min
Lustre bug causing Oakley login node crashes filesystem, login, Oakley Resolved

Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.  The bug (or issue otherwise) seems to be activated when a user does operations on a... (Read more)

1 month 1 week ago 8 hours 29 min
Unscheduled GPFS Outage filesystem Resolved

As of 11:30PM on June 16th, we have removed the GPFS filesystem from service due to a number of hardware failures. At this point, further hardware failures would put a large portion of the entire... (Read more)

3 months 3 weeks ago 3 months 2 weeks
Matlab PCT broken due to pbsrsh modification Matlab Resolved

A change was made to the system wide pbsrsh script which Matlab relies on.  It has been discovered that this change has broken the parallel computing toolbox (... (Read more)

5 months 1 week ago 7 hours 16 min
Armstrong inaccessible Resolved

Update: 2PM March 12th: Armstrong is back up and running.  Please notify of any lingering issues.

As of 10AM Thursday March 12th... (Read more)

6 months 4 weeks ago 6 months 4 weeks