|Ruby Rolling Reboot||Resolved||
2015/02/16 RUBY Rolling Reboot starting Today
A rolling reboot is required on Ruby to update a critical... Read more
|2 years 3 months ago||2 years 3 months ago|
|Ruby is offline||Operations||Resolved||
The Ruby Transitional Cluster (only open to select research groups) is currently offline due to network problems. We expect it will return to service some time after the downtime.
|3 years 8 months ago||3 years 3 months ago|
|Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017||login, Oakley, Owens, Ruby||Resolved||
4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed.
All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more
|2 months 2 weeks ago||2 months 2 days ago|
|Rolling reboot of all clusters, starting from Wednesday morning, April 19, 2017||Batch, Maintenance, Oakley, Owens, Ruby||Resolved||
1:40PM 4/27/2017 Update: Rolling reboots are completed.
3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address GPFS errors occured... Read more
|1 month 1 week ago||4 weeks 20 hours ago|
|qsub filter rejects valid jobs||Resolved||
Job scripts submitted on Glenn, Oakley, or Ruby all go a submit filter before reaching the resource manager, Torque. A bug has been discovered in our submit filter which prevents jobs with the... Read more
|2 years 2 months ago||1 year 7 months ago|
|Project space giving errors "No space left on device"||filesystem||Resolved||
11/01/2016 11:52AM Update: This issue has been fixed.
We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more
|7 months 14 hours ago||6 months 4 weeks ago|
|Proj13 file system difficulties||filesystem||Resolved||
We are currently experiencing difficulties with the servers for the filesystem mounted at /nfs/proj13.
|3 years 7 months ago||3 years 7 months ago|
|Problems with Project Space (/nfs/gpfs)||filesystem||Resolved||
(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.
As of early afternoon, Sept. 8,... Read more
|1 year 8 months ago||1 year 8 months ago|
|Problems with LAMMPS 14May16||Software||Resolved||
LAMMPS 14May16 was built with the USER-OMP package on Oakley, Ruby, and Owens. Its default behavior is to spawn too many OpenMP threads. lammps/14May16 batch scripts that do not use the USER-OMP... Read more
|8 months 2 weeks ago||6 months 13 hours ago|
|Poor network performance on some filesystems||filesystem||Resolved||
We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more
|3 years 10 months ago||3 years 10 months ago|