Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.
|Can not change GPU compute mode on Oakley||GPU||Resolved||
Update: The driver version has been updated and the issue has been fixed.
In updating the driver version for Oakley's NVIDIA GPUs the NVML libraries that are used in conjunction... (Read more)
|9 months 1 week ago||7 months 2 weeks|
|Abaqus Service Disruption Expected||Software||Resolved||
OSC has acquired its new ABAQUS license with new license terms. Use is now limited by both institution and for use in educational, institutional, instructional... (Read more)
|10 months 5 days ago||9 months 6 hours|
|OnDemand Apps will not open||OnDemand||Resolved||
Since the 10.9.5 OS X update, Apple has changed its security model to only support applications from the Mac App Store and identified developers. This has caused OnDemand apps to fail to run with... (Read more)
|11 months 5 days ago||11 months 1 day|
We have temporarily suspended scheduling due to some problems with the parallel scratch file system.
|11 months 1 week ago||11 months 1 week|
|Oakley login node instability||Operations||Resolved||
Oakley login nodes are seeing some instability related to Lustre. We will reboot the nodes on Thursday, October 2nd 2014 to resolve the issue. If a login node crashes before then and we have the... (Read more)
|11 months 1 week ago||10 months 4 days|
|Statewide Intel compiler license checkout failures||Licensing||Resolved||
This morning (9/10/14) we updated our Intel compiler licenses. We are seeing some unexpected license checkout failures in the logs (please click through to see details):
10:44:... (Read more)
|11 months 3 weeks ago||11 months 1 week|
9/10/14 - We have not seen any additional crashes of the Lustre servers since making this change.
|1 year 5 days ago||11 months 3 weeks|
|Armstrong offline until Noon||Armstrong||Resolved||
Armstrong will need to be taken down today until Noon. In the meantime, contact OSCHelp (OSCHelp@osc.edu) for account assistance.
|1 year 1 week ago||1 year 1 week|
|Lustre jobs suspended||filesystem||Resolved||
The Lustre filesystem ($PFSDIR and /fs/lustre) has crashed several times Friday evening (8/15). We have degraded this service temporarily, while we work to isolate the actions that are triggering... (Read more)
|1 year 2 weeks ago||1 year 5 days|
|issue with OnDemand 6:09 - 8:39 pm||Resolved||
OnDemand, epi accounting queries, the Viper DB, the Medline DB, the Eweld DB,... (Read more)
|1 year 2 weeks ago||1 year 2 weeks|