Known Issues

Known Issues

Title Category Resolution Description Postedsort ascending Updated
Dec 27, 2016: Issues with /fs/project filesystem Resolved

Dec 27, 2016 3:46PM Update: Both project and scratch file systems (/fs/project and /fs/scratch ) are back to normal now.  Some users' jobs may be... Read more

5 months 3 days ago 5 months 3 days ago
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

7 months 14 hours ago 6 months 4 weeks ago
Oakley login nodes and ruby02 will not be accessible between 9:00-9:30am on 10/18/2016 login Resolved

We upgraded to RHEL 6.8 for both Oakley and Ruby clusters during the October 12th's downtime. Unfortunately, we are noticing some NFS problem that has been causing rsh, or ssh sessions to hang on... Read more

7 months 2 weeks ago 7 months 2 weeks ago
Interruption of both Project and Scratch filesystems on 10/13/2016 filesystem Resolved

We experienced a brief interruption of both Project and Scratch filesystems at about 5:15PM October 13, 2016. User jobs may have been effected.

We'll update more details later. 

7 months 2 weeks ago 7 months 2 weeks ago
libibumad.so.2 missing on Oakley Software Resolved

Update:  We think this is fixed.  Please submit a ticket if you encounter further problems.

 

As a result of updates made during yesterday's downtime, software built with mvapich2/... Read more

7 months 2 weeks ago 7 months 2 weeks ago
Problems with LAMMPS 14May16 Software Resolved

LAMMPS 14May16 was built with the USER-OMP package on Oakley, Ruby, and Owens. Its default behavior is to spawn too many OpenMP threads. lammps/14May16 batch scripts that do not use the USER-OMP... Read more

8 months 2 weeks ago 6 months 14 hours ago
GPFS hang Issue on 09/08/2016 filesystem Resolved

On Thursday, Sept 8 starting at 19:37, we had some bad interaction that appears to have been caused by the backup client, and the GPFS servers. This resulted in a GPFS hang that propagated I/O... Read more

8 months 3 weeks ago 8 months 3 weeks ago
All HPC systems are available Login Problems, Operations, Outage Resolved

8/24/16 3:57PM: All HPC systems are availalbe including:

  • Oakley cluster for general access
  • Ruby cluster for restricted access
  • Owens cluster for... Read more
9 months 1 week ago 9 months 1 week ago
NFS service disruption 6/29/16 filesystem Resolved

OSC experienced errors with NFS services the morning of June 29 between 08:37 and 09:12 that may have caused some jobs to fail, or other unexpected behavior.  The... Read more

11 months 4 days ago 11 months 4 days ago
June 7th downtime to finish at 6:30PM Connectivity, filesystem, Infrastructure, login, Login Problems, Maintenance, Operations, Outage Resolved

Update: Downtime completed at 6:30PM, June 7th.

 

The June 7th downtime is now slated to be completed at 6:30PM.  Previous estimate was 5PM.

All systems and services will... Read more

11 months 3 weeks ago 11 months 3 weeks ago

Pages