Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Postedsort ascending Updated
Abaqus license contention Batch, Licensing Resolved

We have noticed some abaqus jobs end up in BatchHold. Once the job is in BatchHold, it will never start. This is because of sharing the abaqus licenses between Oakley and Owens. We have opened a... Read more

6 years 4 days ago 4 years 8 months ago
Issues with VDI through OnDemand Batch Resolved

Update: Jan 23, 2017 3PM: this issue has been fixed. 

There is an issue with OSC OnDemand -> Desktops -> Virtual Desktop Interface (VDI) such that you get "qsub: submit error ..."... Read more

6 years 2 weeks ago 6 years 2 weeks ago
Dec 27, 2016: Issues with /fs/project filesystem Resolved

Dec 27, 2016 3:46PM Update: Both project and scratch file systems (/fs/project and /fs/scratch ) are back to normal now.  Some users' jobs may be... Read more

6 years 1 month ago 6 years 1 month ago
Performance Regression of GPU Nodes on Ruby GPU, Ruby Resolved

We currently have performance regression of Ruby's GPU nodes. Some of the GPU nodes on Ruby will remain in a power-saving state even after an application starts using them, resulting in... Read more

6 years 2 months ago 4 years 8 months ago
LAMMPS 14May16 velocity command problem on Owens Software Resolved
(workaround)

LAMMPS 14May16 on Owens can hang when using the velocity command.  Inputs that hang on Owens work on Oakley and Ruby.  LAMMPS 31Mar17 on Owens also works.  Here is an example failing input snippet... Read more

6 years 2 months ago 9 months 3 weeks ago
Project space giving errors "No space left on device" filesystem Resolved

11/01/2016 11:52AM Update: This issue has been fixed. 

We have become aware of a problem with the Project storage space that gives errors "No space left on device". The... Read more

6 years 3 months ago 6 years 3 months ago
Oakley login nodes and ruby02 will not be accessible between 9:00-9:30am on 10/18/2016 login Resolved

We upgraded to RHEL 6.8 for both Oakley and Ruby clusters during the October 12th's downtime. Unfortunately, we are noticing some NFS problem that has been causing rsh, or ssh sessions to hang on... Read more

6 years 3 months ago 6 years 3 months ago
Nvidia drivers on Oakley GPU Resolved

We upgraded the drivers for the Nvidia GPUs on all of our clusters during the downtime this week. Unfortunately, we are noticing some subtle problems with the GPUs on Oakley. We will be rolling... Read more

6 years 3 months ago 4 years 8 months ago
Interruption of both Project and Scratch filesystems on 10/13/2016 filesystem Resolved

We experienced a brief interruption of both Project and Scratch filesystems at about 5:15PM October 13, 2016. User jobs may have been effected.

We'll update more details later. 

6 years 3 months ago 6 years 3 months ago
libibumad.so.2 missing on Oakley Software Resolved

Update:  We think this is fixed.  Please submit a ticket if you encounter further problems.

 

As a result of updates made during yesterday's downtime, software built with mvapich2/... Read more

6 years 3 months ago 6 years 3 months ago

Pages