Known Issues

Known Issues

Titlesort ascending Category Resolution Description Posted Updated
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... Read more

2 years 3 months ago 2 years 3 months ago
Ruby is offline Operations Resolved

The Ruby Transitional Cluster (only open to select research groups) is currently offline due to network problems. We expect it will return to service some time after the downtime.

3 years 8 months ago 3 years 3 months ago
Rolling reboot of compute and login nodes of all clusters, starting from Wednesday morning, March 22, 2017 login, Oakley, Owens, Ruby Resolved

4:56PM 3/28/2017 Update: The rolling reboots of all systems are completed. 

All compute nodes and login nodes of Owens, Oakley, and Ruby clusters will need to be rebooted... Read more

2 months 2 weeks ago 2 months 2 days ago
Rolling reboot of all clusters, starting from Wednesday morning, April 19, 2017 Batch, Maintenance, Oakley, Owens, Ruby Resolved

1:40PM 4/27/2017 Update: Rolling reboots are completed. 

3:10PM 4/18/2017 Update: Rolling reboots on Owens have started to address GPFS errors occured... Read more

1 month 1 week ago 4 weeks 20 hours ago
qsub filter rejects valid jobs Resolved

Job scripts submitted on Glenn, Oakley, or Ruby all go a submit filter before reaching the resource manager, Torque.  A bug has been discovered in our submit filter which prevents jobs with the... Read more

2 years 2 months ago 1 year 7 months ago
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

7 months 14 hours ago 6 months 4 weeks ago
Proj13 file system difficulties filesystem Resolved

We are currently experiencing difficulties with the servers for the filesystem mounted at /nfs/proj13.

3 years 7 months ago 3 years 7 months ago
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.


As of early afternoon, Sept. 8,... Read more

1 year 8 months ago 1 year 8 months ago
Problems with LAMMPS 14May16 Software Resolved

LAMMPS 14May16 was built with the USER-OMP package on Oakley, Ruby, and Owens. Its default behavior is to spawn too many OpenMP threads. lammps/14May16 batch scripts that do not use the USER-OMP... Read more

8 months 2 weeks ago 6 months 13 hours ago
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... Read more

3 years 10 months ago 3 years 10 months ago

Pages