Due to a critical security vulnerability we need to reboot public-facing systems to deploy a mitigation against the vulnerabilit

Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolutionsort descending Description Posted Updated
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... Read more

11 years 2 months ago 11 years 2 months ago
Torque module on Oakley improperly setting environment variables Resolved

Intel library paths are being added to the environment variable LD_LIBRARY_PATH incorrectly when loading torque.  Additionally the Intel paths remain when the torque... Read more

11 years 2 months ago 7 years 11 months ago
pdsh -j broken on Oakley Batch, system software Resolved

pdsh -j is broken on Oakley.  It was broken by updates during the September downtime.  We are currently working on resolving the issue.

Users who require... Read more

10 years 5 months ago 7 years 11 months ago
Scheduling temporarily suspended on Oakley Batch Resolved

We are migrating the batch scheduler on Oakley to a new virtual machine. In order to accomplish this, the scheduler will be temporarily offline for about four hours on December 16th. Running jobs... Read more

10 years 4 months ago 10 years 4 months ago
Glenn module lammps-7Dec15 bug Software Resolved

Batch scripts loading module lammps-7Dec15 should use the user's login shell or

the Korn shell, e.g. #PBS -S /bin/ksh... Read more

10 years 3 months ago 9 years 8 months ago
libibumad.so.2 missing on Oakley Software Resolved

Update:  We think this is fixed.  Please submit a ticket if you encounter further problems.

 

As a result of updates made during yesterday's downtime, software built with mvapich2/... Read more

9 years 6 months ago 9 years 6 months ago
Interruption of both Project and Scratch filesystems on 10/13/2016 filesystem Resolved

We experienced a brief interruption of both Project and Scratch filesystems at about 5:15PM October 13, 2016. User jobs may have been effected.

We'll update more details later. 

9 years 6 months ago 9 years 6 months ago
Nvidia drivers on Oakley GPU Resolved

We upgraded the drivers for the Nvidia GPUs on all of our clusters during the downtime this week. Unfortunately, we are noticing some subtle problems with the GPUs on Oakley. We will be rolling... Read more

9 years 6 months ago 7 years 11 months ago
Oakley login nodes and ruby02 will not be accessible between 9:00-9:30am on 10/18/2016 login Resolved

We upgraded to RHEL 6.8 for both Oakley and Ruby clusters during the October 12th's downtime. Unfortunately, we are noticing some NFS problem that has been causing rsh, or ssh sessions to hang on... Read more

9 years 6 months ago 9 years 6 months ago
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

9 years 6 months ago 9 years 6 months ago

Pages